194,483 results on '"Krause, A."'
Search Results
202. Adult skull bone marrow is an expanding and resilient haematopoietic reservoir
- Author
-
Koh, Bong Ihn, Mohanakrishnan, Vishal, Jeong, Hyun-Woo, Park, Hongryeol, Kruse, Kai, Choi, Young Jun, Nieminen-Kelhä, Melina, Kumar, Rahul, Pereira, Raquel S., Adams, Susanne, Lee, Hyuek Jong, Bixel, M. Gabriele, Vajkoczy, Peter, Krause, Daniela S., and Adams, Ralf H.
- Published
- 2024
- Full Text
- View/download PDF
203. Mobile learning in the classroom – Should students bring mobile devices for learning, or should these be provided by schools?
- Author
-
Laumann, Daniel, Krause, Maurice, Kremer, Fabienne E., Leibrock, Barbara, Ubben, Malte S., Forthmann, Boris, Janzik, Robin, Masemann, Dörthe, Reer, Felix, Denz, Cornelia, Greefrath, Gilbert, Heinicke, Susanne, Marohn, Annette, Quandt, Thorsten, Souvignier, Elmar, and Heusler, Stefan
- Published
- 2024
- Full Text
- View/download PDF
204. Seven strategies to leverage water for peace and foster sustainable and just water management for all
- Author
-
Krause, Stefan, Lynch, Iseult, Agarwal, Ankit, Akinsemolu, Adenike, Arheimer, Berit, Buytaert, Wouter, Floyd, Rita, Houdret, Annabelle, Saccoccia, Elizabeth, Schneidewind, Uwe, Tockner, Klement, Yasmin, Tahmina, and Hannah, David M.
- Published
- 2024
- Full Text
- View/download PDF
205. Enantioconvergent copper-catalysed difluoromethylation of alkyl halides
- Author
-
Ding, Decai, Yin, Lingfeng, Poore, Andrew T., Ho, Yeu-Shiuan, Cheng, Yu-Ho, Hsieh, Chi-Tien, Yachuw, Stephen C., Knieser, Rachael M., Krause, Jeanette A., Tian, Shiliang, Cheng, Mu-Jeng, and Liu, Wei
- Published
- 2024
- Full Text
- View/download PDF
206. Assessment of Sub-micrometer-Sized Particles with Practical Activities in an Underground Coal Mine
- Author
-
Chen, Yi-Hsuan, Munoz, Alejandro, Krause, Connor, Brune, Jürgen, and Tsai, Candace S. J.
- Published
- 2024
- Full Text
- View/download PDF
207. Enriching productive mutational paths accelerates enzyme evolution
- Author
-
Patsch, David, Schwander, Thomas, Voss, Moritz, Schaub, Daniela, Hüppi, Sean, Eichenberger, Michael, Stockinger, Peter, Schelbert, Lisa, Giger, Sandro, Peccati, Francesca, Jiménez-Osés, Gonzalo, Mutný, Mojmír, Krause, Andreas, Bornscheuer, Uwe T., Hilvert, Donald, and Buller, Rebecca M.
- Published
- 2024
- Full Text
- View/download PDF
208. Correlation between MRI utilization and therapy switches in disease-modifying treatments for multiple sclerosis
- Author
-
Naizer, Hayden, Kohl III, Harold, Krause, Trudy, Hamden, Randa, Wozny, Joseph, Charron, Odelin, and Freeman, Leorah
- Published
- 2024
- Full Text
- View/download PDF
209. Cellulose regenerated films obtained from the dissolution of cotton waste in ionic liquid
- Author
-
Knihs, Aline Ferreira, de Brito, Beatriz Barbosa, Granato, Miguel Angelo, Porto, Bruna, Siqueira Curto Valle, Rita de Cassia, and Bierhalz, Andrea Cristiane Krause
- Published
- 2024
- Full Text
- View/download PDF
210. High incidence of cassava common mosaic virus in cassava plants and complete genome sequence of a distinct isolate from Brazil
- Author
-
Watanabe, Luís Fernando Maranho, Ribeiro-Junior, Marcos Roberto, Portilho, Angélica Maria Nogueira, Marubayashi, Julio Massaharu, Barreto da Silva, Felipe, Uzan, Juliana, Favara, Gabriel Madoglio, and Krause-Sakate, Renate
- Published
- 2024
- Full Text
- View/download PDF
211. Passiflora virus Y in soybean: High susceptibility of soybean cultivars, unlikely transmission trough seeds and no detection of the virus in fields from São Paulo state, Brazil
- Author
-
da Cruz Martines, Caroline, Secler, Luana Cury, Favara, Gabriel Madoglio, de Oliveira, Cintia Sabino, Marubayashi, Julio Massaharu, Barreto da Silva, Felipe, Uzan, Juliana, and Krause-Sakate, Renate
- Published
- 2024
- Full Text
- View/download PDF
212. Traumaassoziierte Gefäßverletzungen und deren gefäßchirurgische/interventionelle Rekonstruktionsmöglichkeiten
- Author
-
Barth, U., Piatek, S., Stojkova, M., Krause, H., Meyer, F., and Halloul, Z.
- Published
- 2024
- Full Text
- View/download PDF
213. Post-anesthesia care unit hypotension in low-risk patients recovering from non-cardiac surgery: a prospective observational study
- Author
-
Flick, Moritz, Lohr, Anneke, Weidemann, Friederike, Naebian, Ashkan, Hoppe, Phillip, Thomsen, Kristen K., Krause, Linda, Kouz, Karim, and Saugel, Bernd
- Published
- 2024
- Full Text
- View/download PDF
214. Frieden fühlen? Emotionale (Be)Deutungen von innerem Frieden nach Konflikt und Flucht
- Author
-
Edler, Hannah and Krause, Ulrike
- Published
- 2024
- Full Text
- View/download PDF
215. Electronic nudges for sustained influenza vaccination uptake in older adults: the nationwide randomized NUDGE-FLU-2 trial
- Author
-
Johansen, Niklas Dyrby, Vaduganathan, Muthiah, Bhatt, Ankeet S., Modin, Daniel, Chatur, Safia, Claggett, Brian L., Janstrup, Kira Hyldekær, Larsen, Carsten Schade, Larsen, Lykke, Wiese, Lothar, Dalager-Pedersen, Michael, Køber, Lars, Solomon, Scott D., Sivapalan, Pradeesh, Jensen, Jens Ulrik Stæhr, Martel, Cyril Jean-Marie, Krause, Tyra Grove, and Biering-Sørensen, Tor
- Published
- 2024
- Full Text
- View/download PDF
216. Self-reported benzodiazepine use among adults with chronic spinal cord injury in the southeastern USA: associations with demographic, injury, and opioid use characteristics
- Author
-
DiPiro, Nicole D., Dismuke-Greer, Clara E., and Krause, James S.
- Published
- 2024
- Full Text
- View/download PDF
217. 9,000 years of genetic continuity in southernmost Africa demonstrated at Oakhurst rockshelter
- Author
-
Gretzinger, Joscha, Gibbon, Victoria E., Penske, Sandra E., Sealy, Judith C., Rohrlach, Adam B., Salazar-García, Domingo C., Krause, Johannes, and Schiffels, Stephan
- Published
- 2024
- Full Text
- View/download PDF
218. Einstellungen von Medizinischen Fachangestellten und Hausärzt:innen zum geriatrischen Assessment in der Hausarztpraxis: Eine Fragebogenerhebung in Thüringen, Berlin und Brandenburg
- Author
-
Rost, Liliana, Bleidorn, Jutta, Döpfmer, Susanne, Jung, Paul, Krause, Markus, Kümpel, Lisa, Kuschick, Doreen, Toutaoui, Kahina, and Wolf, Florian
- Published
- 2024
- Full Text
- View/download PDF
219. Doping Effect of Poly(vinylidene fluoride) on Carbon Nanofibers Deduced by Thermoelectric Analysis of Their Melt Mixed Films
- Author
-
Paleo, A. J., Serrato, V. M., Mánuel, J. M., Toledano, O., Muñoz, E., Melle-Franco, M., Krause, B., Pötschke, P., and Lozano, K.
- Published
- 2024
- Full Text
- View/download PDF
220. Komplementärmedizinische Therapieansätze bei krebsbedingter Fatigue
- Author
-
Busch, Alina, Krause, Alena, and Rostock, Matthias
- Published
- 2024
- Full Text
- View/download PDF
221. Intraoperative cone-beam computed tomography for catheter placement verification in pediatric hydrocephalus: technical note
- Author
-
Krause, Matthias, Lagumdzija, Jasmina, Enzinger, Simon, Wittig, Jörn, Gaggl, Alexander, Metzger, Roman P., and Griessenauer, Christoph J.
- Published
- 2024
- Full Text
- View/download PDF
222. Molekulare Onkologie: Schrittmacher der Präzisionsonkologie
- Author
-
Kopp, Hans-Georg, Krause, Mechthild, Röcken, Christoph, and Höffken, Klaus
- Published
- 2024
- Full Text
- View/download PDF
223. Extensive protein pyrophosphorylation revealed in human cell lines
- Author
-
Morgan, Jeremy A. M., Singh, Arpita, Kurz, Leonie, Nadler-Holly, Michal, Ruwolt, Max, Ganguli, Shubhra, Sharma, Sheenam, Penkert, Martin, Krause, Eberhard, Liu, Fan, Bhandari, Rashna, and Fiedler, Dorothea
- Published
- 2024
- Full Text
- View/download PDF
224. Boosting effects of mindfulness-based intervention with a multi-modal adaptive supplement: a preliminary investigation
- Author
-
Lucas-Thompson, Rachel G., Krause, Jill T., Rzonca, Addie, Moran, Megan J., Miller, Reagan L., Rigsby, Brock A., Najman, Jonathan I., Adams, Melanie S., Haddock, Shelley A., Zimmerman, Toni S., and Shomaker, Lauren B.
- Published
- 2024
- Full Text
- View/download PDF
225. Aktuelle Zahlen zur rheumatologischen Versorgung – Jahresbericht aus der Kerndokumentation der regionalen kooperativen Rheumazentren
- Author
-
Albrecht, Katinka, Thiele, Katja, Alexander, Tobias, Aringer, Martin, Eidner, Thorsten, Henes, Jörg, Hoese, Guido, Karberg, Kirsten, Kiltz, Uta, Krause, Andreas, Ochs, Wolfgang, Richter, Jutta G, Späthling-Mestekemper, Susanna, Steinmüller, Mirko, Wassenberg, Siegfried, Strangfeld, Anja, and Callhoff, Johanna
- Published
- 2024
- Full Text
- View/download PDF
226. Astrophysical systematics in Kinematic Lensing: quantifying an Intrinsic Alignment analog
- Author
-
Huang, Yu-Hsiu, Krause, Elisabeth, Xu, Jiachuan, Eifler, Tim, S., Pranjal R., and Huff, Eric
- Subjects
Astrophysics - Cosmology and Nongalactic Astrophysics - Abstract
Kinematic lensing (KL) is a new weak lensing technique that reduces shape noise for disk galaxies by including spectroscopically measured galaxy kinematics in addition to photometrically measured galaxy shapes. Since KL utilizes the Tully-Fisher relation, any correlation of this relation with the local environment may bias the cosmological interpretation. For the first time, we explore such a Tully-Fisher environmental dependence (TED) effect as a potential astrophysical systematic for KL. Our derivation of the TED systematic can be described in a similar analytical form as intrinsic alignment for traditional weak lensing. We demonstrate analytically that TED only impacts KL if intrinsic aligment for disk galaxies is non-zero. We further use IllustrisTNG simulations to quantify the TED effect. Our two-point correlation measurements do not yield any additional coherent signals that would indicate a systematic bias on KL, within the uncertainties set by the simulation volume., Comment: 12 pages, 5 figures, published in PRD
- Published
- 2024
- Full Text
- View/download PDF
227. GrINd: Grid Interpolation Network for Scattered Observations
- Author
-
Dulny, Andrzej, Heinisch, Paul, Hotho, Andreas, and Krause, Anna
- Subjects
Computer Science - Machine Learning - Abstract
Predicting the evolution of spatiotemporal physical systems from sparse and scattered observational data poses a significant challenge in various scientific domains. Traditional methods rely on dense grid-structured data, limiting their applicability in scenarios with sparse observations. To address this challenge, we introduce GrINd (Grid Interpolation Network for Scattered Observations), a novel network architecture that leverages the high-performance of grid-based models by mapping scattered observations onto a high-resolution grid using a Fourier Interpolation Layer. In the high-resolution space, a NeuralPDE-class model predicts the system's state at future timepoints using differentiable ODE solvers and fully convolutional neural networks parametrizing the system's dynamics. We empirically evaluate GrINd on the DynaBench benchmark dataset, comprising six different physical systems observed at scattered locations, demonstrating its state-of-the-art performance compared to existing models. GrINd offers a promising approach for forecasting physical systems from sparse, scattered observational data, extending the applicability of deep learning methods to real-world scenarios with limited data availability.
- Published
- 2024
228. Global Vegetation Modeling with Pre-Trained Weather Transformers
- Author
-
Janetzky, Pascal, Gallusser, Florian, Hentschel, Simon, Hotho, Andreas, and Krause, Anna
- Subjects
Computer Science - Machine Learning - Abstract
Accurate vegetation models can produce further insights into the complex interaction between vegetation activity and ecosystem processes. Previous research has established that long-term trends and short-term variability of temperature and precipitation affect vegetation activity. Motivated by the recent success of Transformer-based Deep Learning models for medium-range weather forecasting, we adapt the publicly available pre-trained FourCastNet to model vegetation activity while accounting for the short-term dynamics of climate variability. We investigate how the learned global representation of the atmosphere's state can be transferred to model the normalized difference vegetation index (NDVI). Our model globally estimates vegetation activity at a resolution of \SI{0.25}{\degree} while relying only on meteorological data. We demonstrate that leveraging pre-trained weather models improves the NDVI estimates compared to learning an NDVI model from scratch. Additionally, we compare our results to other recent data-driven NDVI modeling approaches from machine learning and ecology literature. We further provide experimental evidence on how much data and training time is necessary to turn FourCastNet into an effective vegetation model. Code and models will be made available upon publication., Comment: Tackling Climate Change with Machine Learning Workshop @ ICLR 2024
- Published
- 2024
229. A PAC-Bayesian Framework for Optimal Control with Stability Guarantees
- Author
-
Boroujeni, Mahrokh Ghoddousi, Galimberti, Clara Lucía, Krause, Andreas, and Ferrari-Trecate, Giancarlo
- Subjects
Electrical Engineering and Systems Science - Systems and Control - Abstract
Stochastic Nonlinear Optimal Control (SNOC) involves minimizing a cost function that averages out the random uncertainties affecting the dynamics of nonlinear systems. For tractability reasons, this problem is typically addressed by minimizing an empirical cost, which represents the average cost across a finite dataset of sampled disturbances. However, this approach raises the challenge of quantifying the control performance against out-of-sample uncertainties. Particularly, in scenarios where the training dataset is small, SNOC policies are prone to overfitting, resulting in significant discrepancies between the empirical cost and the true cost, i.e., the average SNOC cost incurred during control deployment. Therefore, establishing generalization bounds on the true cost is crucial for ensuring reliability in real-world applications. In this paper, we introduce a novel approach that leverages PAC-Bayes theory to provide rigorous generalization bounds for SNOC. Based on these bounds, we propose a new method for designing optimal controllers, offering a principled way to incorporate prior knowledge into the synthesis process, which aids in improving the control policy and mitigating overfitting. Furthermore, by leveraging recent parametrizations of stabilizing controllers for nonlinear systems, our framework inherently ensures closed-loop stability. The effectiveness of our proposed method in incorporating prior knowledge and combating overfitting is shown by designing neural network controllers for tasks in cooperative robotics.
- Published
- 2024
230. Speaker Distance Estimation in Enclosures from Single-Channel Audio
- Author
-
Neri, Michael, Politis, Archontis, Krause, Daniel, Carli, Marco, and Virtanen, Tuomas
- Subjects
Electrical Engineering and Systems Science - Audio and Speech Processing ,Computer Science - Sound - Abstract
Distance estimation from audio plays a crucial role in various applications, such as acoustic scene analysis, sound source localization, and room modeling. Most studies predominantly center on employing a classification approach, where distances are discretized into distinct categories, enabling smoother model training and achieving higher accuracy but imposing restrictions on the precision of the obtained sound source position. Towards this direction, in this paper we propose a novel approach for continuous distance estimation from audio signals using a convolutional recurrent neural network with an attention module. The attention mechanism enables the model to focus on relevant temporal and spectral features, enhancing its ability to capture fine-grained distance-related information. To evaluate the effectiveness of our proposed method, we conduct extensive experiments using audio recordings in controlled environments with three levels of realism (synthetic room impulse response, measured response with convolved speech, and real recordings) on four datasets (our synthetic dataset, QMULTIMIT, VoiceHome-2, and STARSS23). Experimental results show that the model achieves an absolute error of 0.11 meters in a noiseless synthetic scenario. Moreover, the results showed an absolute error of about 1.30 meters in the hybrid scenario. The algorithm's performance in the real scenario, where unpredictable environmental factors and noise are prevalent, yields an absolute error of approximately 0.50 meters. For reproducible research purposes we make model, code, and synthetic datasets available at https://github.com/michaelneri/audio-distance-estimation., Comment: Accepted for publication in IEEE/ACM Transactions on Audio, Speech, and Language Processing
- Published
- 2024
- Full Text
- View/download PDF
231. Continuous, Subject-Specific Attribute Control in T2I Models by Identifying Semantic Directions
- Author
-
Baumann, Stefan Andreas, Krause, Felix, Neumayr, Michael, Stracke, Nick, Hu, Vincent Tao, and Ommer, Björn
- Subjects
Computer Science - Computer Vision and Pattern Recognition ,Computer Science - Artificial Intelligence ,Computer Science - Machine Learning - Abstract
In recent years, advances in text-to-image (T2I) diffusion models have substantially elevated the quality of their generated images. However, achieving fine-grained control over attributes remains a challenge due to the limitations of natural language prompts (such as no continuous set of intermediate descriptions existing between ``person'' and ``old person''). Even though many methods were introduced that augment the model or generation process to enable such control, methods that do not require a fixed reference image are limited to either enabling global fine-grained attribute expression control or coarse attribute expression control localized to specific subjects, not both simultaneously. We show that there exist directions in the commonly used token-level CLIP text embeddings that enable fine-grained subject-specific control of high-level attributes in text-to-image models. Based on this observation, we introduce one efficient optimization-free and one robust optimization-based method to identify these directions for specific attributes from contrastive text prompts. We demonstrate that these directions can be used to augment the prompt text input with fine-grained control over attributes of specific subjects in a compositional manner (control over multiple attributes of a single subject) without having to adapt the diffusion model. Project page: https://compvis.github.io/attribute-control. Code is available at https://github.com/CompVis/attribute-control., Comment: Project page: https://compvis.github.io/attribute-control
- Published
- 2024
232. Bridging the Sim-to-Real Gap with Bayesian Inference
- Author
-
Rothfuss, Jonas, Sukhija, Bhavya, Treven, Lenart, Dörfler, Florian, Coros, Stelian, and Krause, Andreas
- Subjects
Computer Science - Robotics ,Computer Science - Machine Learning - Abstract
We present SIM-FSVGD for learning robot dynamics from data. As opposed to traditional methods, SIM-FSVGD leverages low-fidelity physical priors, e.g., in the form of simulators, to regularize the training of neural network models. While learning accurate dynamics already in the low data regime, SIM-FSVGD scales and excels also when more data is available. We empirically show that learning with implicit physical priors results in accurate mean model estimation as well as precise uncertainty quantification. We demonstrate the effectiveness of SIM-FSVGD in bridging the sim-to-real gap on a high-performance RC racecar system. Using model-based RL, we demonstrate a highly dynamic parking maneuver with drifting, using less than half the data compared to the state of the art.
- Published
- 2024
233. Sound Event Detection and Localization with Distance Estimation
- Author
-
Krause, Daniel Aleksander, Politis, Archontis, and Mesaros, Annamaria
- Subjects
Computer Science - Sound ,Computer Science - Machine Learning ,Electrical Engineering and Systems Science - Audio and Speech Processing - Abstract
Sound Event Detection and Localization (SELD) is a combined task of identifying sound events and their corresponding direction-of-arrival (DOA). While this task has numerous applications and has been extensively researched in recent years, it fails to provide full information about the sound source position. In this paper, we overcome this problem by extending the task to Sound Event Detection, Localization with Distance Estimation (3D SELD). We study two ways of integrating distance estimation within the SELD core - a multi-task approach, in which the problem is tackled by a separate model output, and a single-task approach obtained by extending the multi-ACCDOA method to include distance information. We investigate both methods for the Ambisonic and binaural versions of STARSS23: Sony-TAU Realistic Spatial Soundscapes 2023. Moreover, our study involves experiments on the loss function related to the distance estimation part. Our results show that it is possible to perform 3D SELD without any degradation of performance in sound event detection and DOA estimation., Comment: This paper has been accepted for the 32nd European Signal Processing Conference EUSIPCO 2024 in Lyon
- Published
- 2024
234. Machine Learning LSST 3x2pt analyses -- forecasting the impact of systematics on cosmological constraints using neural networks
- Author
-
Boruah, Supranta S., Eifler, Tim, Miranda, Vivian, Farah, Elyas, Motka, Jay, Krause, Elisabeth, Fang, Xiao, Rogozenski, Paul, and Collaboration, The LSST Dark Energy Science
- Subjects
Astrophysics - Cosmology and Nongalactic Astrophysics - Abstract
Validating modeling choices through simulated analyses and quantifying the impact of different systematic effects will form a major computational bottleneck in the preparation for 3$\times$2 analysis with Stage-IV surveys such as Vera Rubin Observatory's Legacy Survey of Space and Time (LSST). We can significantly reduce the computational requirements by using machine learning based emulators, which allow us to run fast inference while maintaining the full realism of the data analysis pipeline. In this paper, we use such an emulator to run simulated 3$\times$2 (cosmic shear, galaxy-galaxy lensing, and galaxy clustering) analyses for mock LSST-Y1/Y3/Y6/Y10 surveys and study the impact of various systematic effects (galaxy bias, intrinsic alignment, baryonic physics, shear calibration and photo-$z$ uncertainties). Closely following the DESC Science Requirement Document (with several updates) our main findings are: {\it a)} The largest contribution to the `systematic error budget' of LSST 3$\times$2 analysis comes from galaxy bias uncertainties, while the contribution of baryonic and shear calibration uncertainties are significantly less important. {\it b)} Tighter constraints on intrinsic alignment and photo-$z$ parameters can improve cosmological constraints noticeably, which illustrates synergies of LSST and spectroscopic surveys. {\it c)} The scale cuts adopted in the DESC SRD may be too conservative and pushing to smaller scales can increase cosmological information significantly. {\it d)} We investigate the impact of photo-$z$ outliers on 3$\times$2 pt analysis and find that we need to determine the outlier fraction to within $5-10\%$ accuracy to ensure robust cosmological analysis. We caution that these findings depend on analysis choices (parameterizations, priors, scale cuts) and can change for different settings., Comment: 16 pages, 10 Figures, To be submitted to PRD. Comments welcome
- Published
- 2024
235. Attention-Based Neural Network Emulators for Multi-Probe Data Vectors Part II: Assessing Tension Metrics
- Author
-
Saraivanov, Evan, Zhong, Kunhao, Miranda, Vivian, Boruah, Supranta S., Eifler, Tim, and Krause, Elisabeth
- Subjects
Astrophysics - Cosmology and Nongalactic Astrophysics ,Astrophysics - Instrumentation and Methods for Astrophysics - Abstract
The next generation of cosmological surveys is expected to generate unprecedented high-quality data, consequently increasing the already substantial computational costs of Bayesian statistical methods. This will pose a significant challenge to analyzing theoretical models of cosmology. Additionally, new mitigation techniques of baryonic effects, intrinsic alignment, and other systematic effects will inevitably introduce more parameters, slowing down the convergence of Bayesian analyses. In this scenario, machine-learning-based accelerators are a promising solution, capable of reducing the computational costs and execution time of such tools by order of thousands. Yet, they have not been able to provide accurate predictions over the wide prior ranges in parameter space adopted by Stage III/IV collaborations in studies employing real-space two-point correlation functions. This paper offers a leap in this direction by carefully investigating the modern transformer-based neural network (NN) architectures in realistic simulated Rubin Observatory year one cosmic shear $\Lambda$CDM inferences. Building on the framework introduced in Part I, we generalize the transformer block and incorporate additional layer types to develop a more versatile architecture. We present a scalable method to efficiently generate an extensive training dataset that significantly exceeds the scope of prior volumes considered in Part I, while still meeting strict accuracy standards. Through our meticulous architecture comparison and comprehensive hyperparameter optimization, we establish that the attention-based architecture performs an order of magnitude better in accuracy than widely adopted NN designs. Finally, we test and apply our emulators to calibrate tension metrics.
- Published
- 2024
236. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
- Author
-
Gemini Team, Georgiev, Petko, Lei, Ving Ian, Burnell, Ryan, Bai, Libin, Gulati, Anmol, Tanzer, Garrett, Vincent, Damien, Pan, Zhufeng, Wang, Shibo, Mariooryad, Soroosh, Ding, Yifan, Geng, Xinyang, Alcober, Fred, Frostig, Roy, Omernick, Mark, Walker, Lexi, Paduraru, Cosmin, Sorokin, Christina, Tacchetti, Andrea, Gaffney, Colin, Daruki, Samira, Sercinoglu, Olcan, Gleicher, Zach, Love, Juliette, Voigtlaender, Paul, Jain, Rohan, Surita, Gabriela, Mohamed, Kareem, Blevins, Rory, Ahn, Junwhan, Zhu, Tao, Kawintiranon, Kornraphop, Firat, Orhan, Gu, Yiming, Zhang, Yujing, Rahtz, Matthew, Faruqui, Manaal, Clay, Natalie, Gilmer, Justin, Co-Reyes, JD, Penchev, Ivo, Zhu, Rui, Morioka, Nobuyuki, Hui, Kevin, Haridasan, Krishna, Campos, Victor, Mahdieh, Mahdis, Guo, Mandy, Hassan, Samer, Kilgour, Kevin, Vezer, Arpi, Cheng, Heng-Tze, de Liedekerke, Raoul, Goyal, Siddharth, Barham, Paul, Strouse, DJ, Noury, Seb, Adler, Jonas, Sundararajan, Mukund, Vikram, Sharad, Lepikhin, Dmitry, Paganini, Michela, Garcia, Xavier, Yang, Fan, Valter, Dasha, Trebacz, Maja, Vodrahalli, Kiran, Asawaroengchai, Chulayuth, Ring, Roman, Kalb, Norbert, Soares, Livio Baldini, Brahma, Siddhartha, Steiner, David, Yu, Tianhe, Mentzer, Fabian, He, Antoine, Gonzalez, Lucas, Xu, Bibo, Kaufman, Raphael Lopez, Shafey, Laurent El, Oh, Junhyuk, Hennigan, Tom, Driessche, George van den, Odoom, Seth, Lucic, Mario, Roelofs, Becca, Lall, Sid, Marathe, Amit, Chan, Betty, Ontanon, Santiago, He, Luheng, Teplyashin, Denis, Lai, Jonathan, Crone, Phil, Damoc, Bogdan, Ho, Lewis, Riedel, Sebastian, Lenc, Karel, Yeh, Chih-Kuan, Chowdhery, Aakanksha, Xu, Yang, Kazemi, Mehran, Amid, Ehsan, Petrushkina, Anastasia, Swersky, Kevin, Khodaei, Ali, Chen, Gowoon, Larkin, Chris, Pinto, Mario, Yan, Geng, Badia, Adria Puigdomenech, Patil, Piyush, Hansen, Steven, Orr, Dave, Arnold, Sebastien M. R., Grimstad, Jordan, Dai, Andrew, Douglas, Sholto, Sinha, Rishika, Yadav, Vikas, Chen, Xi, Gribovskaya, Elena, Austin, Jacob, Zhao, Jeffrey, Patel, Kaushal, Komarek, Paul, Austin, Sophia, Borgeaud, Sebastian, Friso, Linda, Goyal, Abhimanyu, Caine, Ben, Cao, Kris, Chung, Da-Woon, Lamm, Matthew, Barth-Maron, Gabe, Kagohara, Thais, Olszewska, Kate, Chen, Mia, Shivakumar, Kaushik, Agarwal, Rishabh, Godhia, Harshal, Rajwar, Ravi, Snaider, Javier, Dotiwalla, Xerxes, Liu, Yuan, Barua, Aditya, Ungureanu, Victor, Zhang, Yuan, Batsaikhan, Bat-Orgil, Wirth, Mateo, Qin, James, Danihelka, Ivo, Doshi, Tulsee, Chadwick, Martin, Chen, Jilin, Jain, Sanil, Le, Quoc, Kar, Arjun, Gurumurthy, Madhu, Li, Cheng, Sang, Ruoxin, Liu, Fangyu, Lamprou, Lampros, Munoz, Rich, Lintz, Nathan, Mehta, Harsh, Howard, Heidi, Reynolds, Malcolm, Aroyo, Lora, Wang, Quan, Blanco, Lorenzo, Cassirer, Albin, Griffith, Jordan, Das, Dipanjan, Lee, Stephan, Sygnowski, Jakub, Fisher, Zach, Besley, James, Powell, Richard, Ahmed, Zafarali, Paulus, Dominik, Reitter, David, Borsos, Zalan, Joshi, Rishabh, Pope, Aedan, Hand, Steven, Selo, Vittorio, Jain, Vihan, Sethi, Nikhil, Goel, Megha, Makino, Takaki, May, Rhys, Yang, Zhen, Schalkwyk, Johan, Butterfield, Christina, Hauth, Anja, Goldin, Alex, Hawkins, Will, Senter, Evan, Brin, Sergey, Woodman, Oliver, Ritter, Marvin, Noland, Eric, Giang, Minh, Bolina, Vijay, Lee, Lisa, Blyth, Tim, Mackinnon, Ian, Reid, Machel, Sarvana, Obaid, Silver, David, Chen, Alexander, Wang, Lily, Maggiore, Loren, Chang, Oscar, Attaluri, Nithya, Thornton, Gregory, Chiu, Chung-Cheng, Bunyan, Oskar, Levine, Nir, Chung, Timothy, Eltyshev, Evgenii, Si, Xiance, Lillicrap, Timothy, Brady, Demetra, Aggarwal, Vaibhav, Wu, Boxi, Xu, Yuanzhong, McIlroy, Ross, Badola, Kartikeya, Sandhu, Paramjit, Moreira, Erica, Stokowiec, Wojciech, Hemsley, Ross, Li, Dong, Tudor, Alex, Shyam, Pranav, Rahimtoroghi, Elahe, Haykal, Salem, Sprechmann, Pablo, Zhou, Xiang, Mincu, Diana, Li, Yujia, Addanki, Ravi, Krishna, Kalpesh, Wu, Xiao, Frechette, Alexandre, Eyal, Matan, Dafoe, Allan, Lacey, Dave, Whang, Jay, Avrahami, Thi, Zhang, Ye, Taropa, Emanuel, Lin, Hanzhao, Toyama, Daniel, Rutherford, Eliza, Sano, Motoki, Choe, HyunJeong, Tomala, Alex, Safranek-Shrader, Chalence, Kassner, Nora, Pajarskas, Mantas, Harvey, Matt, Sechrist, Sean, Fortunato, Meire, Lyu, Christina, Elsayed, Gamaleldin, Kuang, Chenkai, Lottes, James, Chu, Eric, Jia, Chao, Chen, Chih-Wei, Humphreys, Peter, Baumli, Kate, Tao, Connie, Samuel, Rajkumar, Santos, Cicero Nogueira dos, Andreassen, Anders, Rakićević, Nemanja, Grewe, Dominik, Kumar, Aviral, Winkler, Stephanie, Caton, Jonathan, Brock, Andrew, Dalmia, Sid, Sheahan, Hannah, Barr, Iain, Miao, Yingjie, Natsev, Paul, Devlin, Jacob, Behbahani, Feryal, Prost, Flavien, Sun, Yanhua, Myaskovsky, Artiom, Pillai, Thanumalayan Sankaranarayana, Hurt, Dan, Lazaridou, Angeliki, Xiong, Xi, Zheng, Ce, Pardo, Fabio, Li, Xiaowei, Horgan, Dan, Stanton, Joe, Ambar, Moran, Xia, Fei, Lince, Alejandro, Wang, Mingqiu, Mustafa, Basil, Webson, Albert, Lee, Hyo, Anil, Rohan, Wicke, Martin, Dozat, Timothy, Sinha, Abhishek, Piqueras, Enrique, Dabir, Elahe, Upadhyay, Shyam, Boral, Anudhyan, Hendricks, Lisa Anne, Fry, Corey, Djolonga, Josip, Su, Yi, Walker, Jake, Labanowski, Jane, Huang, Ronny, Misra, Vedant, Chen, Jeremy, Skerry-Ryan, RJ, Singh, Avi, Rijhwani, Shruti, Yu, Dian, Castro-Ros, Alex, Changpinyo, Beer, Datta, Romina, Bagri, Sumit, Hrafnkelsson, Arnar Mar, Maggioni, Marcello, Zheng, Daniel, Sulsky, Yury, Hou, Shaobo, Paine, Tom Le, Yang, Antoine, Riesa, Jason, Rogozinska, Dominika, Marcus, Dror, Badawy, Dalia El, Zhang, Qiao, Wang, Luyu, Miller, Helen, Greer, Jeremy, Sjos, Lars Lowe, Nova, Azade, Zen, Heiga, Chaabouni, Rahma, Rosca, Mihaela, Jiang, Jiepu, Chen, Charlie, Liu, Ruibo, Sainath, Tara, Krikun, Maxim, Polozov, Alex, Lespiau, Jean-Baptiste, Newlan, Josh, Cankara, Zeyncep, Kwak, Soo, Xu, Yunhan, Chen, Phil, Coenen, Andy, Meyer, Clemens, Tsihlas, Katerina, Ma, Ada, Gottweis, Juraj, Xing, Jinwei, Gu, Chenjie, Miao, Jin, Frank, Christian, Cankara, Zeynep, Ganapathy, Sanjay, Dasgupta, Ishita, Hughes-Fitt, Steph, Chen, Heng, Reid, David, Rong, Keran, Fan, Hongmin, van Amersfoort, Joost, Zhuang, Vincent, Cohen, Aaron, Gu, Shixiang Shane, Mohananey, Anhad, Ilic, Anastasija, Tobin, Taylor, Wieting, John, Bortsova, Anna, Thacker, Phoebe, Wang, Emma, Caveness, Emily, Chiu, Justin, Sezener, Eren, Kaskasoli, Alex, Baker, Steven, Millican, Katie, Elhawaty, Mohamed, Aisopos, Kostas, Lebsack, Carl, Byrd, Nathan, Dai, Hanjun, Jia, Wenhao, Wiethoff, Matthew, Davoodi, Elnaz, Weston, Albert, Yagati, Lakshman, Ahuja, Arun, Gao, Isabel, Pundak, Golan, Zhang, Susan, Azzam, Michael, Sim, Khe Chai, Caelles, Sergi, Keeling, James, Sharma, Abhanshu, Swing, Andy, Li, YaGuang, Liu, Chenxi, Bostock, Carrie Grimes, Bansal, Yamini, Nado, Zachary, Anand, Ankesh, Lipschultz, Josh, Karmarkar, Abhijit, Proleev, Lev, Ittycheriah, Abe, Yeganeh, Soheil Hassas, Polovets, George, Faust, Aleksandra, Sun, Jiao, Rrustemi, Alban, Li, Pen, Shivanna, Rakesh, Liu, Jeremiah, Welty, Chris, Lebron, Federico, Baddepudi, Anirudh, Krause, Sebastian, Parisotto, Emilio, Soricut, Radu, Xu, Zheng, Bloxwich, Dawn, Johnson, Melvin, Neyshabur, Behnam, Mao-Jones, Justin, Wang, Renshen, Ramasesh, Vinay, Abbas, Zaheer, Guez, Arthur, Segal, Constant, Nguyen, Duc Dung, Svensson, James, Hou, Le, York, Sarah, Milan, Kieran, Bridgers, Sophie, Gworek, Wiktor, Tagliasacchi, Marco, Lee-Thorp, James, Chang, Michael, Guseynov, Alexey, Hartman, Ale Jakse, Kwong, Michael, Zhao, Ruizhe, Kashem, Sheleem, Cole, Elizabeth, Miech, Antoine, Tanburn, Richard, Phuong, Mary, Pavetic, Filip, Cevey, Sebastien, Comanescu, Ramona, Ives, Richard, Yang, Sherry, Du, Cosmo, Li, Bo, Zhang, Zizhao, Iinuma, Mariko, Hu, Clara Huiyi, Roy, Aurko, Bijwadia, Shaan, Zhu, Zhenkai, Martins, Danilo, Saputro, Rachel, Gergely, Anita, Zheng, Steven, Jia, Dawei, Antonoglou, Ioannis, Sadovsky, Adam, Gu, Shane, Bi, Yingying, Andreev, Alek, Samangooei, Sina, Khan, Mina, Kocisky, Tomas, Filos, Angelos, Kumar, Chintu, Bishop, Colton, Yu, Adams, Hodkinson, Sarah, Mittal, Sid, Shah, Premal, Moufarek, Alexandre, Cheng, Yong, Bloniarz, Adam, Lee, Jaehoon, Pejman, Pedram, Michel, Paul, Spencer, Stephen, Feinberg, Vladimir, Xiong, Xuehan, Savinov, Nikolay, Smith, Charlotte, Shakeri, Siamak, Tran, Dustin, Chesus, Mary, Bohnet, Bernd, Tucker, George, von Glehn, Tamara, Muir, Carrie, Mao, Yiran, Kazawa, Hideto, Slone, Ambrose, Soparkar, Kedar, Shrivastava, Disha, Cobon-Kerr, James, Sharman, Michael, Pavagadhi, Jay, Araya, Carlos, Misiunas, Karolis, Ghelani, Nimesh, Laskin, Michael, Barker, David, Li, Qiujia, Briukhov, Anton, Houlsby, Neil, Glaese, Mia, Lakshminarayanan, Balaji, Schucher, Nathan, Tang, Yunhao, Collins, Eli, Lim, Hyeontaek, Feng, Fangxiaoyu, Recasens, Adria, Lai, Guangda, Magni, Alberto, De Cao, Nicola, Siddhant, Aditya, Ashwood, Zoe, Orbay, Jordi, Dehghani, Mostafa, Brennan, Jenny, He, Yifan, Xu, Kelvin, Gao, Yang, Saroufim, Carl, Molloy, James, Wu, Xinyi, Arnold, Seb, Chang, Solomon, Schrittwieser, Julian, Buchatskaya, Elena, Radpour, Soroush, Polacek, Martin, Giordano, Skye, Bapna, Ankur, Tokumine, Simon, Hellendoorn, Vincent, Sottiaux, Thibault, Cogan, Sarah, Severyn, Aliaksei, Saleh, Mohammad, Thakoor, Shantanu, Shefey, Laurent, Qiao, Siyuan, Gaba, Meenu, Chang, Shuo-yiin, Swanson, Craig, Zhang, Biao, Lee, Benjamin, Rubenstein, Paul Kishan, Song, Gan, Kwiatkowski, Tom, Koop, Anna, Kannan, Ajay, Kao, David, Schuh, Parker, Stjerngren, Axel, Ghiasi, Golnaz, Gibson, Gena, Vilnis, Luke, Yuan, Ye, Ferreira, Felipe Tiengo, Kamath, Aishwarya, Klimenko, Ted, Franko, Ken, Xiao, Kefan, Bhattacharya, Indro, Patel, Miteyan, Wang, Rui, Morris, Alex, Strudel, Robin, Sharma, Vivek, Choy, Peter, Hashemi, Sayed Hadi, Landon, Jessica, Finkelstein, Mara, Jhakra, Priya, Frye, Justin, Barnes, Megan, Mauger, Matthew, Daun, Dennis, Baatarsukh, Khuslen, Tung, Matthew, Farhan, Wael, Michalewski, Henryk, Viola, Fabio, Quitry, Felix de Chaumont, Lan, Charline Le, Hudson, Tom, Wang, Qingze, Fischer, Felix, Zheng, Ivy, White, Elspeth, Dragan, Anca, Alayrac, Jean-baptiste, Ni, Eric, Pritzel, Alexander, Iwanicki, Adam, Isard, Michael, Bulanova, Anna, Zilka, Lukas, Dyer, Ethan, Sachan, Devendra, Srinivasan, Srivatsan, Muckenhirn, Hannah, Cai, Honglong, Mandhane, Amol, Tariq, Mukarram, Rae, Jack W., Wang, Gary, Ayoub, Kareem, FitzGerald, Nicholas, Zhao, Yao, Han, Woohyun, Alberti, Chris, Garrette, Dan, Krishnakumar, Kashyap, Gimenez, Mai, Levskaya, Anselm, Sohn, Daniel, Matak, Josip, Iturrate, Inaki, Chang, Michael B., Xiang, Jackie, Cao, Yuan, Ranka, Nishant, Brown, Geoff, Hutter, Adrian, Mirrokni, Vahab, Chen, Nanxin, Yao, Kaisheng, Egyed, Zoltan, Galilee, Francois, Liechty, Tyler, Kallakuri, Praveen, Palmer, Evan, Ghemawat, Sanjay, Liu, Jasmine, Tao, David, Thornton, Chloe, Green, Tim, Jasarevic, Mimi, Lin, Sharon, Cotruta, Victor, Tan, Yi-Xuan, Fiedel, Noah, Yu, Hongkun, Chi, Ed, Neitz, Alexander, Heitkaemper, Jens, Sinha, Anu, Zhou, Denny, Sun, Yi, Kaed, Charbel, Hulse, Brice, Mishra, Swaroop, Georgaki, Maria, Kudugunta, Sneha, Farabet, Clement, Shafran, Izhak, Vlasic, Daniel, Tsitsulin, Anton, Ananthanarayanan, Rajagopal, Carin, Alen, Su, Guolong, Sun, Pei, V, Shashank, Carvajal, Gabriel, Broder, Josef, Comsa, Iulia, Repina, Alena, Wong, William, Chen, Warren Weilun, Hawkins, Peter, Filonov, Egor, Loher, Lucia, Hirnschall, Christoph, Wang, Weiyi, Ye, Jingchen, Burns, Andrea, Cate, Hardie, Wright, Diana Gage, Piccinini, Federico, Zhang, Lei, Lin, Chu-Cheng, Gog, Ionel, Kulizhskaya, Yana, Sreevatsa, Ashwin, Song, Shuang, Cobo, Luis C., Iyer, Anand, Tekur, Chetan, Garrido, Guillermo, Xiao, Zhuyun, Kemp, Rupert, Zheng, Huaixiu Steven, Li, Hui, Agarwal, Ananth, Ngani, Christel, Goshvadi, Kati, Santamaria-Fernandez, Rebeca, Fica, Wojciech, Chen, Xinyun, Gorgolewski, Chris, Sun, Sean, Garg, Roopal, Ye, Xinyu, Eslami, S. M. Ali, Hua, Nan, Simon, Jon, Joshi, Pratik, Kim, Yelin, Tenney, Ian, Potluri, Sahitya, Thiet, Lam Nguyen, Yuan, Quan, Luisier, Florian, Chronopoulou, Alexandra, Scellato, Salvatore, Srinivasan, Praveen, Chen, Minmin, Koverkathu, Vinod, Dalibard, Valentin, Xu, Yaming, Saeta, Brennan, Anderson, Keith, Sellam, Thibault, Fernando, Nick, Huot, Fantine, Jung, Junehyuk, Varadarajan, Mani, Quinn, Michael, Raul, Amit, Le, Maigo, Habalov, Ruslan, Clark, Jon, Jalan, Komal, Bullard, Kalesha, Singhal, Achintya, Luong, Thang, Wang, Boyu, Rajayogam, Sujeevan, Eisenschlos, Julian, Jia, Johnson, Finchelstein, Daniel, Yakubovich, Alex, Balle, Daniel, Fink, Michael, Agarwal, Sameer, Li, Jing, Dvijotham, Dj, Pal, Shalini, Kang, Kai, Konzelmann, Jaclyn, Beattie, Jennifer, Dousse, Olivier, Wu, Diane, Crocker, Remi, Elkind, Chen, Jonnalagadda, Siddhartha Reddy, Lee, Jong, Holtmann-Rice, Dan, Kallarackal, Krystal, Liu, Rosanne, Vnukov, Denis, Vats, Neera, Invernizzi, Luca, Jafari, Mohsen, Zhou, Huanjie, Taylor, Lilly, Prendki, Jennifer, Wu, Marcus, Eccles, Tom, Liu, Tianqi, Kopparapu, Kavya, Beaufays, Francoise, Angermueller, Christof, Marzoca, Andreea, Sarcar, Shourya, Dib, Hilal, Stanway, Jeff, Perbet, Frank, Trdin, Nejc, Sterneck, Rachel, Khorlin, Andrey, Li, Dinghua, Wu, Xihui, Goenka, Sonam, Madras, David, Goldshtein, Sasha, Gierke, Willi, Zhou, Tong, Liu, Yaxin, Liang, Yannie, White, Anais, Li, Yunjie, Singh, Shreya, Bahargam, Sanaz, Epstein, Mark, Basu, Sujoy, Lao, Li, Ozturel, Adnan, Crous, Carl, Zhai, Alex, Lu, Han, Tung, Zora, Gaur, Neeraj, Walton, Alanna, Dixon, Lucas, Zhang, Ming, Globerson, Amir, Uy, Grant, Bolt, Andrew, Wiles, Olivia, Nasr, Milad, Shumailov, Ilia, Selvi, Marco, Piccinno, Francesco, Aguilar, Ricardo, McCarthy, Sara, Khalman, Misha, Shukla, Mrinal, Galic, Vlado, Carpenter, John, Villela, Kevin, Zhang, Haibin, Richardson, Harry, Martens, James, Bosnjak, Matko, Belle, Shreyas Rammohan, Seibert, Jeff, Alnahlawi, Mahmoud, McWilliams, Brian, Singh, Sankalp, Louis, Annie, Ding, Wen, Popovici, Dan, Simicich, Lenin, Knight, Laura, Mehta, Pulkit, Gupta, Nishesh, Shi, Chongyang, Fatehi, Saaber, Mitrovic, Jovana, Grills, Alex, Pagadora, Joseph, Munkhdalai, Tsendsuren, Petrova, Dessie, Eisenbud, Danielle, Zhang, Zhishuai, Yates, Damion, Mittal, Bhavishya, Tripuraneni, Nilesh, Assael, Yannis, Brovelli, Thomas, Jain, Prateek, Velimirovic, Mihajlo, Akbulut, Canfer, Mu, Jiaqi, Macherey, Wolfgang, Kumar, Ravin, Xu, Jun, Qureshi, Haroon, Comanici, Gheorghe, Wiesner, Jeremy, Gong, Zhitao, Ruddock, Anton, Bauer, Matthias, Felt, Nick, GP, Anirudh, Arnab, Anurag, Zelle, Dustin, Rothfuss, Jonas, Rosgen, Bill, Shenoy, Ashish, Seybold, Bryan, Li, Xinjian, Mudigonda, Jayaram, Erdogan, Goker, Xia, Jiawei, Simsa, Jiri, Michi, Andrea, Yao, Yi, Yew, Christopher, Kan, Steven, Caswell, Isaac, Radebaugh, Carey, Elisseeff, Andre, Valenzuela, Pedro, McKinney, Kay, Paterson, Kim, Cui, Albert, Latorre-Chimoto, Eri, Kim, Solomon, Zeng, William, Durden, Ken, Ponnapalli, Priya, Sosea, Tiberiu, Choquette-Choo, Christopher A., Manyika, James, Robenek, Brona, Vashisht, Harsha, Pereira, Sebastien, Lam, Hoi, Velic, Marko, Owusu-Afriyie, Denese, Lee, Katherine, Bolukbasi, Tolga, Parrish, Alicia, Lu, Shawn, Park, Jane, Venkatraman, Balaji, Talbert, Alice, Rosique, Lambert, Cheng, Yuchung, Sozanschi, Andrei, Paszke, Adam, Kumar, Praveen, Austin, Jessica, Li, Lu, Salama, Khalid, Perz, Bartek, Kim, Wooyeol, Dukkipati, Nandita, Baryshnikov, Anthony, Kaplanis, Christos, Sheng, XiangHai, Chervonyi, Yuri, Unlu, Caglar, Casas, Diego de Las, Askham, Harry, Tunyasuvunakool, Kathryn, Gimeno, Felix, Poder, Siim, Kwak, Chester, Miecnikowski, Matt, Dimitriev, Alek, Parisi, Aaron, Liu, Dangyi, Tsai, Tomy, Shevlane, Toby, Kouridi, Christina, Garmon, Drew, Goedeckemeyer, Adrian, Brown, Adam R., Vijayakumar, Anitha, Elqursh, Ali, Jazayeri, Sadegh, Huang, Jin, Carthy, Sara Mc, Hoover, Jay, Kim, Lucy, Kumar, Sandeep, Chen, Wei, Biles, Courtney, Bingham, Garrett, Rosen, Evan, Wang, Lisa, Tan, Qijun, Engel, David, Pongetti, Francesco, de Cesare, Dario, Hwang, Dongseong, Yu, Lily, Pullman, Jennifer, Narayanan, Srini, Levin, Kyle, Gopal, Siddharth, Li, Megan, Aharoni, Asaf, Trinh, Trieu, Lo, Jessica, Casagrande, Norman, Vij, Roopali, Matthey, Loic, Ramadhana, Bramandia, Matthews, Austin, Carey, CJ, Johnson, Matthew, Goranova, Kremena, Shah, Rohin, Ashraf, Shereen, Dasgupta, Kingshuk, Larsen, Rasmus, Wang, Yicheng, Vuyyuru, Manish Reddy, Jiang, Chong, Ijazi, Joana, Osawa, Kazuki, Smith, Celine, Boppana, Ramya Sree, Bilal, Taylan, Koizumi, Yuma, Xu, Ying, Altun, Yasemin, Shabat, Nir, Bariach, Ben, Korchemniy, Alex, Choo, Kiam, Ronneberger, Olaf, Iwuanyanwu, Chimezie, Zhao, Shubin, Soergel, David, Hsieh, Cho-Jui, Cai, Irene, Iqbal, Shariq, Sundermeyer, Martin, Chen, Zhe, Bursztein, Elie, Malaviya, Chaitanya, Biadsy, Fadi, Shroff, Prakash, Dhillon, Inderjit, Latkar, Tejasi, Dyer, Chris, Forbes, Hannah, Nicosia, Massimo, Nikolaev, Vitaly, Greene, Somer, Georgiev, Marin, Wang, Pidong, Martin, Nina, Sedghi, Hanie, Zhang, John, Banzal, Praseem, Fritz, Doug, Rao, Vikram, Wang, Xuezhi, Zhang, Jiageng, Patraucean, Viorica, Du, Dayou, Mordatch, Igor, Jurin, Ivan, Liu, Lewis, Dubey, Ayush, Mohan, Abhi, Nowakowski, Janek, Ion, Vlad-Doru, Wei, Nan, Tojo, Reiko, Raad, Maria Abi, Hudson, Drew A., Keshava, Vaishakh, Agrawal, Shubham, Ramirez, Kevin, Wu, Zhichun, Nguyen, Hoang, Liu, Ji, Sewak, Madhavi, Petrini, Bryce, Choi, DongHyun, Philips, Ivan, Wang, Ziyue, Bica, Ioana, Garg, Ankush, Wilkiewicz, Jarek, Agrawal, Priyanka, Guo, Danhao, Xue, Emily, Shaik, Naseer, Leach, Andrew, Khan, Sadh MNM, Wiesinger, Julia, Jerome, Sammy, Chakladar, Abhishek, Wang, Alek Wenjiao, Ornduff, Tina, Abu, Folake, Ghaffarkhah, Alireza, Wainwright, Marcus, Cortes, Mario, Liu, Frederick, Maynez, Joshua, Terzis, Andreas, Samangouei, Pouya, Mansour, Riham, Kępa, Tomasz, Aubet, François-Xavier, Algymr, Anton, Banica, Dan, Weisz, Agoston, Orban, Andras, Senges, Alexandre, Andrejczuk, Ewa, Geller, Mark, Santo, Niccolo Dal, Anklin, Valentin, Merey, Majd Al, Baeuml, Martin, Strohman, Trevor, Bai, Junwen, Petrov, Slav, Wu, Yonghui, Hassabis, Demis, Kavukcuoglu, Koray, Dean, Jeff, and Vinyals, Oriol
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence - Abstract
In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February version on the great majority of capabilities and benchmarks; (2) Gemini 1.5 Flash, a more lightweight variant designed for efficiency with minimal regression in quality. Gemini 1.5 models achieve near-perfect recall on long-context retrieval tasks across modalities, improve the state-of-the-art in long-document QA, long-video QA and long-context ASR, and match or surpass Gemini 1.0 Ultra's state-of-the-art performance across a broad set of benchmarks. Studying the limits of Gemini 1.5's long-context ability, we find continued improvement in next-token prediction and near-perfect retrieval (>99%) up to at least 10M tokens, a generational leap over existing models such as Claude 3.0 (200k) and GPT-4 Turbo (128k). Finally, we highlight real-world use cases, such as Gemini 1.5 collaborating with professionals on completing their tasks achieving 26 to 75% time savings across 10 different job categories, as well as surprising new capabilities of large language models at the frontier; when given a grammar manual for Kalamang, a language with fewer than 200 speakers worldwide, the model learns to translate English to Kalamang at a similar level to a person who learned from the same content.
- Published
- 2024
237. Emission lines due to ionizing radiation from a compact object in the remnant of Supernova 1987A
- Author
-
Fransson, C., Barlow, M. J., Kavanagh, P. J., Larsson, J., Jones, O. C., Sargent, B., Meixner, M., Bouchet, P., Temim, T., Wright, G. S., Blommaert, J. A. D. L., Habel, N., Hirschauer, A. S., Hjorth, J., Lenkić, L., Tikkanen, T., Wesson, R., Coulais, A., Fox, O. D., Gastaud, R., Glasse, A., Jaspers, J., Krause, O., Lau, R. M., Nayak, O., Rest, A., Colina, L., van Dishoeck, E. F., Gudel, M., Henning, Th., Lagage, P. -O., Őstlin, G., Ray, T. P., and Vandenbussche, B.
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,Astrophysics - Solar and Stellar Astrophysics - Abstract
The nearby Supernova 1987A was accompanied by a burst of neutrino emission, which indicates that a compact object (a neutron star or black hole) was formed in the explosion. There has been no direct observation of this compact object. In this work, we observe the supernova remnant with JWST spectroscopy finding narrow infrared emission lines of argon and sulphur. The line emission is spatially unresolved and blueshifted in velocity relative to the supernova rest frame. We interpret the lines as gas illuminated by a source of ionizing photons located close to the center of the expanding ejecta. Photoionization models show that the line ratios are consistent with ionization by a cooling neutron star or pulsar wind nebula. The velocity shift could be evidence for a neutron star natal kick., Comment: Authors version of manuscript published in Science on 22 Feb 2024
- Published
- 2024
- Full Text
- View/download PDF
238. Quasiparticle effects in magnetic-field-resilient 3D transmons
- Author
-
Krause, J., Marchegiani, G., Janssen, L. M., Catelani, G., Ando, Yoichi, and Dickel, C.
- Subjects
Quantum Physics ,Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Superconductivity - Abstract
Recent research shows that quasiparticle-induced decoherence of superconducting qubits depends on the superconducting-gap asymmetry originating from the different thicknesses of the top and bottom films in Al/AlO$_x$/Al junctions. Magnetic field is a key tuning knob to investigate this dependence as it can change the superconducting gaps in situ. We present measurements of the parity-switching time of a field-resilient 3D transmon with in-plane field up to 0.41T. At low fields, small parity splitting requires qutrit pulse sequences for parity measurements. We measure a non-monotonic evolution of the parity lifetime with in-plane magnetic field, increasing up to 0.2T, followed by a decrease at higher fields. We demonstrate that the superconducting-gap asymmetry plays a crucial role in the observed behavior. At zero field, the qubit frequency is nearly resonant with the superconducting-gap difference, favoring the energy exchange with the quasiparticles and so enhancing the parity-switching rate. With a higher magnetic field, the qubit frequency decreases and gets detuned from the gap difference, causing the initial increase of the parity lifetime, while photon-assisted qubit transitions increase, producing the subsequent decrease at higher fields. Besides giving a deeper insight into the parity-switching mechanism in conventional transmon qubits, we establish that Al-AlO$_x$-Al JJs could be used in architectures for the parity-readout and manipulation of topological qubits based on Majorana zero modes.
- Published
- 2024
- Full Text
- View/download PDF
239. Probing AGN jet precession with LISA
- Author
-
Steinle, Nathan, Gerosa, Davide, and Krause, Martin G. H.
- Subjects
Astrophysics - High Energy Astrophysical Phenomena ,General Relativity and Quantum Cosmology - Abstract
The precession of astrophysical jets produced by active-galactic nuclei is likely related to the dynamics of the accretion disks surrounding the central supermassive black holes (BHs) from which jets are launched. The two main mechanisms that can drive jet precession arise from Lense-Thirring precession and tidal torquing. These can explain direct and indirect observations of precessing jets; however, such explanations often utilize crude approximations of the disk evolution and observing jet precession can be challenging with electromagnetic facilities. Simultaneously, the Laser Interferometer Space Antenna (LISA) is expected to measure gravitational waves from the mergers of massive binary BHs with high accuracy and probe their progenitor evolution. In this paper, we connect the LISA detectability of binary BH mergers to the possible jet precession during their progenitor evolution. We make use of a semi-analytic model that self-consistently treats disk-driven BH alignment and binary inspiral and includes the possibility of disk breaking. We find that tidal torquing of the accretion disk provides a wide range of jet precession timescales depending on the binary separation and the spin direction of the BH from which the jet is launched. Efficient disk-driven BH alignment results in shorter timescales of $\sim 1$ yr which are correlated with higher LISA signal-to-noise ratios. Disk breaking results in the longest possible times of $\sim 10^7$ yrs, suggesting a deep interplay between the disk critical obliquity (i.e. where the disk breaks) and jet precession. Studies such as ours will help to reveal the cosmic population of precessing jets that are detectable with gravitational waves.
- Published
- 2024
240. Magnetic-field dependence of a Josephson traveling-wave parametric amplifier and integration into a high-field setup
- Author
-
Janssen, L. M., Butseraen, G., Krause, J., Coissard, A., Planat, L., Roch, N., Catelani, G., Ando, Yoichi, and Dickel, C.
- Subjects
Quantum Physics ,Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Superconductivity - Abstract
We investigate the effect of magnetic field on a photonic-crystal Josephson traveling-wave parametric amplifier (TWPA). We show that the observed change in photonic bandgap and plasma frequency of the TWPA can be modeled by considering the suppression of the critical current in the Josephson junctions (JJs) of the TWPA due to the Fraunhofer effect and closing of the superconducting gap. Accounting for the JJ geometry is crucial for understanding the field dependence. In one in-plane direction, the TWPA bandgap can be shifted by 2 GHz using up to 60 mT of field, without losing gain or bandwidth, showing that TWPAs without SQUIDs can be field tunable. In the other in-plane direction, the magnetic field is perpendicular to the larger side of the Josephson junctions, so the Fraunhofer effect has a smaller period. This larger side of the JJs is modulated to create the bandgap. The field interacts more strongly with the larger junctions, and as a result, the TWPA bandgap closes and reopens as the field increases, causing the TWPA to become severely compromised already at 2 mT. A slightly higher operating limit of 5 mT is found in out-of-plane field, for which the TWPA's response is hysteretic. These measurements reveal the requirements for magnetic shielding needed to use TWPAs in experiments where high fields at the sample are required; we show that with magnetic shields we can operate the TWPA while applying over 2 T to the sample.
- Published
- 2024
- Full Text
- View/download PDF
241. Attention-based Neural Network Emulators for Multi-Probe Data Vectors Part I: Forecasting the Growth-Geometry split
- Author
-
Zhong, Kunhao, Saraivanov, Evan, Caputi, James, Miranda, Vivian, Boruah, Supranta S., Eifler, Tim, and Krause, Elisabeth
- Subjects
Astrophysics - Cosmology and Nongalactic Astrophysics - Abstract
We present a new class of machine-learning emulators that accurately model the cosmic shear, galaxy-galaxy lensing, and galaxy clustering real space correlation functions in the context of Rubin Observatory year one simulated data. To illustrate its capabilities in forecasting models beyond the standard $\Lambda$CDM, we forecast how well LSST Year 1 data will be able to probe the consistency between geometry $\Omega^{\rm geo}_\mathrm{m}$ and growth $\Omega^{\rm growth}_\mathrm{m}$ dark matter densities in the so-called split $\Lambda$CDM parameterization. When trained with a few million samples, our emulator shows uniform accuracy across a wide range in an 18-dimensional parameter space. We provide a detailed comparison of three neural network designs, illustrating the importance of adopting state-of-the-art Transformer blocks. Our study also details their performance when computing Bayesian evidence for cosmic shear on three fiducial cosmologies. The transformers-based emulator is always accurate within PolyChord's precision. As an application, we use our emulator to study the degeneracies between dark energy models and growth geometry split parameterizations. We find that the growth-geometry split remains to be a meaningful test of the smooth dark energy assumption., Comment: 16 pages, 6 figures
- Published
- 2024
242. Two-dimensional photonic crystal cavities in ZnSe quantum well structures
- Author
-
Qiao, Siqi, Driesch, Nils von den, Chen, Xi, Trellenkamp, Stefan, Lentz, Florian, Krause, Christoph, Bennemann, Benjamin, Brazda, Thorsten, LeBeau, James M., and Pawlis, Alexander
- Subjects
Condensed Matter - Materials Science - Abstract
ZnSe and related materials like ZnMgSe and ZnCdSe are promising II-VI host materials for optically mediated quantum information technology such as single photon sources or spin qubits. Integrating these heterostructures into photonic crystal (PC) cavities enables further improvements, for example realizing Purcell-enhanced single photon sources with increased quantum efficiency. Here we report on the successful implementation of two-dimensional (2D) PC cavities in strained ZnSe quantum wells (QW) on top of a novel AlAs supporting layer. This approach overcomes typical obstacles associated with PC membrane fabrication in strained materials, such as cracks and strain relaxation in the corresponding devices. We demonstrate the attainment of the required mechanical stability in our PC devices, complete strain retainment and effective vertical optical confinement. Structural analysis of our PC cavities reveals excellent etching anisotropy. Additionally, elemental mapping in a scanning transmission electron microscope confirms the transformation of AlAs into AlOx by post-growth wet oxidation and reveals partial oxidation of ZnMgSe at the etched sidewalls in the PC. This knowledge is utilized to tailor FDTD simulations and to extract the ZnMgSe dispersion relation with small oxygen content. Optical characterization of the PC cavities with cross-polarized resonance scattering spectroscopy verifies the presence of cavity modes. The excellent agreement between simulation and measured cavity mode energies demonstrates wide tunability of the PC cavity and proves the pertinence of our model. This implementation of 2D PC cavities in the ZnSe material system establishes a solid foundation for future developments of ZnSe quantum devices.
- Published
- 2024
243. Flux-periodic supercurrent oscillations in an Aharonov-Bohm-type nanowire Josephson junction
- Author
-
Zellekens, Patrick, Deacon, Russell S., Basaric, Farah, Juluri, Raghavendra, Randle, Michael D., Bennemann, Benjamin, Krause, Christoph, Zimmermann, Erik, Sanchez, Ana M., Grützmacher, Detlev, Pawlis, Alexander, Ishibashi, Koji, and Schäpers, Thomas
- Subjects
Condensed Matter - Mesoscale and Nanoscale Physics ,Condensed Matter - Superconductivity - Abstract
Phase winding effects in hollow semiconductor nanowires with superconducting shells have been proposed as a route to engineer topological superconducting states. We investigate GaAs/InAs core/shell nanowires with half-shells of epitaxial aluminium as a potential platform for such devices, where the thin InAs shell confines the electron wave function around the GaAs core. With normal contacts we observed pronounced $h/e$ flux periodic oscillations in the magnetoconductance, indicating the presence of a tubular conductive channel in the InAs shell. Conversely, the switching current in Josephson junctions oscillates with approximately half that period, i.e. $h/2e$, indicating transport via Andreev transport processes in the junction enclosing threading magnetic flux. On these structures, we systematically studied the gate-, field-, and temperature-dependent evolution of the supercurrent. Results indicate that Andreev transport processes can occur about the wire circumference indicating full proximitization of the InAs shell from the half-shell superconducting contacts., Comment: 13 pages, 6 figures
- Published
- 2024
244. JWST MIRI Imager Observations of Supernova SN 1987A
- Author
-
Bouchet, P., Gastaud, R., Coulais, A., Barlow, M. J., Fransson, C., Kavanagh, P. J., Larsson, J., Temim, T., Jones, O. C., Hirschauer, A. S., Tikkanen, T., Blommaert, J. A. D. L., Fox, O. D., Glasse, A., Habel, N., Hjorth, J., Jaspers, J., Krause, O., Lau, R. M., Lenkić, L., Meixner, M., Nayak, O., Rest, A., Sargent, B., Wesson, R., Wright, G. S., Colina, L., Van Dishoeck, E. F., Güdel, M., Henning, Th., Lagage, P. -O., Östlin, G., Ray, T. P., and Vandenbussche, B.
- Subjects
Astrophysics - Solar and Stellar Astrophysics ,Astrophysics - High Energy Astrophysical Phenomena - Abstract
There exist very few mid-infrared (IR) observations of supernovae (SNe) in general. Therefore, SN 1987A, the closest visible SN in 400 years, gives us the opportunity to explore the mid-IR properties of SNe, the dust in their ejecta and surrounding medium, and to witness the birth of a SN remnant (SNR). The James Webb Space Telescope (JWST), with its high spatial resolution and extreme sensitivity, gives a new view on these issues. We report on the first imaging observations obtained with the Mid-InfraRed Instrument (MIRI). We build temperature maps and discuss the morphology of the nascent SNR. Our results show that the temperatures in the equatorial ring (ER) are quite non-uniform. This could be due to dust destruction in some parts of the ring, as had been assumed in some previous works. We show that the IR emission extends beyond the ER, illustrating the fact that the shock wave has now passed through this ring to affect the circumstellar medium on a larger scale. Finally, while sub-mm Atacama Large Millimeter Array (ALMA) observations have hinted at the location of the compact remnant of SN 1987A, we note that our MIRI data have found no such evidence., Comment: 19 pages, 19 figures, 2 tables; Accepted for publication in the Astrophysical Journal (February 2, 2024)
- Published
- 2024
245. Integrating Additive Multigrid with Multipreconditioned Conjugate Gradient Method
- Author
-
Kothari, Hardik, Nestola, Maria Giuseppina Chiara, Favino, Marco, and Krause, Rolf
- Subjects
Mathematics - Numerical Analysis - Abstract
Due to its optimal complexity, the multigrid (MG) method is one of the most popular approaches for solving large-scale linear systems arising from the discretization of partial differential equations. However, the parallel implementation of standard MG methods, which are inherently multiplicative, suffers from increasing communication complexity. In such cases, the additive variants of MG methods provide a good alternative due to their inherently parallel nature, although they exhibit slower convergence. This work combines the additive multigrid method with the multipreconditioned conjugate gradient (MPCG) method. In the proposed approach, the MPCG method employs the corrections from the different levels of the MG hierarchy as separate preconditioned search directions. In this approach, the MPCG method updates the current iterate by using the linear combination of the preconditioned search directions, where the optimal coefficients for the linear combination are computed by exploiting the energy norm minimization of the CG method. The idea behind our approach is to combine the $A$-conjugacy of the search directions of the MPCG method and the quasi $H_1$-orthogonality of the corrections from the MG hierarchy. In the numerical section, we study the performance of the proposed method compared to the standard additive and multiplicative MG methods used as preconditioners for the CG method.
- Published
- 2024
246. Language-Driven Engineering An Interdisciplinary Software Development Paradigm
- Author
-
Steffen, Bernhard, Margaria, Tiziana, Bainczyk, Alexander, Boßelmann, Steve, Busch, Daniel, Driessen, Marc, Frohme, Markus, Howar, Falk, Jörges, Sven, Krause, Marvin, Krumrey, Marco, Lamprecht, Anna-Lena, Lybecait, Michael, Murtovi, Alnis, Naujokat, Stefan, Neubauer, Johannes, Schieweck, Alexander, Schürmann, Jonas, Smyth, Steven, Steffen, Barbara, Storek, Fabian, Tegeler, Tim, Teumert, Sebastian, Wirkner, Dominic, and Zweihoff, Philip
- Subjects
Computer Science - Software Engineering ,Computer Science - Programming Languages - Abstract
We illustrate how purpose-specific, graphical modeling enables application experts with different levels of expertise to collaboratively design and then produce complex applications using their individual, purpose-specific modeling language. Our illustration includes seven graphical Integrated Modeling Environments (IMEs) that support full code generation, as well as four browser-based applications that were modeled and then fully automatically generated and produced using DIME, our most complex graphical IME. While the seven IMEs were chosen to illustrate the types of languages we support with our Language-Driven Engineering (LDE) approach, the four DIME products were chosen to give an impression of the power of our LDE-generated IMEs. In fact, Equinocs, Springer Nature's future editorial system for proceedings, is also being fully automatically generated and then deployed at their Dordrecht site using a deployment pipeline generated with Rig, one of the IMEs presented. Our technology is open source and the products presented are currently in use., Comment: 43 pages, 30 figures
- Published
- 2024
247. Asymptotic spectral properties and preconditioning of an approximated nonlocal Helmholtz equation with Caputo fractional Laplacian and variable coefficient wave number $\mu$
- Author
-
Adriani, Andrea, Sormani, Rosita Luisa, Tablino-Possio, Cristina, Krause, Rolf, and Serra-Capizzano, Stefano
- Subjects
Mathematics - Numerical Analysis ,65F08, 35R11, 65N22, 15A18, 47B35 - Abstract
The current study investigates the asymptotic spectral properties of a finite difference approximation of nonlocal Helmholtz equations with a Caputo fractional Laplacian and a variable coefficient wave number $\mu$, as it occurs when considering a wave propagation in complex media, characterized by nonlocal interactions and spatially varying wave speeds. More specifically, by using tools from Toeplitz and generalized locally Toeplitz theory, the present research delves into the spectral analysis of nonpreconditioned and preconditioned matrix-sequences. We report numerical evidences supporting the theoretical findings. Finally, open problems and potential extensions in various directions are presented and briefly discussed., Comment: 28 pages, 10 figures. arXiv admin note: text overlap with arXiv:2206.05171 by other authors
- Published
- 2024
248. Are Odd Radio Circles phoenixes of powerful radio galaxies?
- Author
-
Shabala, Stanislav, Yates-Jones, Patrick, Jerrim, Larissa, Turner, Ross, Krause, Martin, Norris, Ray, Koribalski, Baerbel, Filipovic, Miroslav, Rudnick, Larry, Power, Chris, and Crocker, Roland
- Subjects
Astrophysics - Astrophysics of Galaxies ,Astrophysics - Cosmology and Nongalactic Astrophysics ,Astrophysics - High Energy Astrophysical Phenomena - Abstract
Odd Radio Circles (ORCs) are a class of low surface brightness, circular objects approximately one arcminute in diameter. ORCs were recently discovered in the Australian Square Kilometre Array Pathfinder (ASKAP) data, and subsequently confirmed with follow-up observations on other instruments, yet their origins remain uncertain. In this paper, we suggest that ORCs could be remnant lobes of powerful radio galaxies, re-energised by the passage of a shock. Using relativistic hydrodynamic simulations with synchrotron emission calculated in post-processing, we show that buoyant evolution of remnant radio lobes is alone too slow to produce the observed ORC morphology. However, the passage of a shock can produce both filled and edge-brightnened ORC-like morphologies for a wide variety of shock and observing orientations. Circular ORCs are predicted to have host galaxies near the geometric centre of the radio emission, consistent with observations of these objects. Significantly offset hosts are possible for elliptical ORCs, potentially causing challenges for accurate host galaxy identification. Observed ORC number counts are broadly consistent with a paradigm in which moderately powerful radio galaxies are their progenitors., Comment: 25 pages, 15 figures. Accepted for publication in PASA
- Published
- 2024
249. Transductive Active Learning: Theory and Applications
- Author
-
Hübotter, Jonas, Sukhija, Bhavya, Treven, Lenart, As, Yarden, and Krause, Andreas
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
We study a generalization of classical active learning to real-world settings with concrete prediction targets where sampling is restricted to an accessible region of the domain, while prediction targets may lie outside this region. We analyze a family of decision rules that sample adaptively to minimize uncertainty about prediction targets. We are the first to show, under general regularity assumptions, that such decision rules converge uniformly to the smallest possible uncertainty obtainable from the accessible data. We demonstrate their strong sample efficiency in two key applications: active fine-tuning of large neural networks and safe Bayesian optimization, where they achieve state-of-the-art performance., Comment: accepted in NeurIPS 2024. arXiv admin note: text overlap with arXiv:2402.15441
- Published
- 2024
250. Active Few-Shot Fine-Tuning
- Author
-
Hübotter, Jonas, Sukhija, Bhavya, Treven, Lenart, As, Yarden, and Krause, Andreas
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence - Abstract
We study the question: How can we select the right data for fine-tuning to a specific task? We call this data selection problem active fine-tuning and show that it is an instance of transductive active learning, a novel generalization of classical active learning. We propose ITL, short for information-based transductive learning, an approach which samples adaptively to maximize information gained about the specified task. We are the first to show, under general regularity assumptions, that such decision rules converge uniformly to the smallest possible uncertainty obtainable from the accessible data. We apply ITL to the few-shot fine-tuning of large neural networks and show that fine-tuning with ITL learns the task with significantly fewer examples than the state-of-the-art.
- Published
- 2024
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.