Author: "Kohler, P." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Kohler, P."' showing total 7,028 results

Start Over Author "Kohler, P."

7,028 results on '"Kohler, P."'

1. FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Author: Anagnostidis, Sotiris, Bachmann, Gregor, Kim, Yeongmin, Kohler, Jonas, Georgopoulos, Markos, Sanakoyeu, Artsiom, Du, Yuming, Pumarola, Albert, Thabet, Ali, and Schönfeld, Edgar
Subjects: Computer Science - Machine Learning, Computer Science - Computer Vision and Pattern Recognition
Abstract: Despite their remarkable performance, modern Diffusion Transformers are hindered by substantial resource requirements during inference, stemming from the fixed and large amount of compute needed for each denoising step. In this work, we revisit the conventional static paradigm that allocates a fixed compute budget per denoising iteration and propose a dynamic strategy instead. Our simple and sample-efficient framework enables pre-trained DiT models to be converted into \emph{flexible} ones -- dubbed FlexiDiT -- allowing them to process inputs at varying compute budgets. We demonstrate how a single \emph{flexible} model can generate images without any drop in quality, while reducing the required FLOPs by more than $40$\% compared to their static counterparts, for both class-conditioned and text-conditioned image generation. Our method is general and agnostic to input and conditioning modalities. We show how our approach can be readily extended for video generation, where FlexiDiT models generate samples with up to $75$\% less compute without compromising performance.
Published: 2025

2. Judge Decoding: Faster Speculative Sampling Requires Going Beyond Model Alignment

Author: Bachmann, Gregor, Anagnostidis, Sotiris, Pumarola, Albert, Georgopoulos, Markos, Sanakoyeu, Artsiom, Du, Yuming, Schönfeld, Edgar, Thabet, Ali, and Kohler, Jonas
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language
Abstract: The performance of large language models (LLMs) is closely linked to their underlying size, leading to ever-growing networks and hence slower inference. Speculative decoding has been proposed as a technique to accelerate autoregressive generation, leveraging a fast draft model to propose candidate tokens, which are then verified in parallel based on their likelihood under the target model. While this approach guarantees to reproduce the target output, it incurs a substantial penalty: many high-quality draft tokens are rejected, even when they represent objectively valid continuations. Indeed, we show that even powerful draft models such as GPT-4o, as well as human text cannot achieve high acceptance rates under the standard verification scheme. This severely limits the speedup potential of current speculative decoding methods, as an early rejection becomes overwhelmingly likely when solely relying on alignment of draft and target. We thus ask the following question: Can we adapt verification to recognize correct, but non-aligned replies? To this end, we draw inspiration from the LLM-as-a-judge framework, which demonstrated that LLMs are able to rate answers in a versatile way. We carefully design a dataset to elicit the same capability in the target model by training a compact module on top of the embeddings to produce ``judgements" of the current continuation. We showcase our strategy on the Llama-3.1 family, where our 8b/405B-Judge achieves a speedup of 9x over Llama-405B, while maintaining its quality on a large range of benchmarks. These benefits remain present even in optimized inference frameworks, where our method reaches up to 141 tokens/s for 8B/70B-Judge and 129 tokens/s for 8B/405B on 2 and 8 H100s respectively.
Published: 2025

3. ColNet: Collaborative Optimization in Decentralized Federated Multi-task Learning Systems

Author: Feng, Chao, Kohler, Nicolas Fazli, Celdran, Alberto Huertas, Bovet, Gerome, and Stiller, Burkhard
Subjects: Computer Science - Machine Learning
Abstract: The integration of Federated Learning (FL) and Multi-Task Learning (MTL) has been explored to address client heterogeneity, with Federated Multi-Task Learning (FMTL) treating each client as a distinct task. However, most existing research focuses on data heterogeneity (e.g., addressing non-IID data) rather than task heterogeneity, where clients solve fundamentally different tasks. Additionally, much of the work relies on centralized settings with a server managing the federation, leaving the more challenging domain of decentralized FMTL largely unexplored. Thus, this work bridges this gap by proposing ColNet, a framework designed for heterogeneous tasks in decentralized federated environments. ColNet divides models into the backbone and task-specific layers, forming groups of similar clients, with group leaders performing conflict-averse cross-group aggregation. A pool of experiments with different federations demonstrated ColNet outperforms the compared aggregation schemes in decentralized settings with label and task heterogeneity scenarios.
Published: 2025

4. Reducing radiation exposure in pediatric cervical spine imaging for trauma: a multi-disciplinary quality improvement initiative.

Author: Yu, Nina, Kohler, Jonathan, Grether-Jones, Kendra, Murphy, Maureen, and Zwienenberg, Marike
Subjects: Imaging, Pediatric, Quality improvement, Trauma, Humans, Female, Male, Child, Child, Preschool, Tomography, X-Ray Computed, Quality Improvement, Cervical Vertebrae, Retrospective Studies, Infant, Adolescent, Radiation Exposure, Spinal Injuries, Guideline Adherence, Infant, Newborn, Trauma Centers
Abstract: PURPOSE: Pediatric cervical spine injury (PCSI) can result in devastating neurologic disability. While computed tomography (CT) imaging is both sensitive and specific in detecting clinically significant injuries, indiscriminate utilization can lead to excessive ionizing radiation exposure. A routine institutional audit revealed CTs were inappropriately obtained 54% of the time. This study evaluates the effects of an updated protocol to reduce radiation exposure in pediatric trauma patients. METHODS: Data were retrospectively analyzed from a pediatric level 1 trauma center from 2021 to 2022. The data were divided into two cohorts, pre-implementation (2021) and post-implementation (2022). Inclusion criteria were patients 0-14 years old with a Glasgow Coma Scale (GCS) ranging 9-15. Outside-hospital transfers were excluded. The primary study endpoints were guideline compliance and CT utilization. RESULTS: A total of 82 subjects were enrolled in this study. In 2021, there were 38 subjects (female/male 15/23, mean age 5.9 years old) with an average GCS of 13.6. In 2022, there were 44 subjects (female/male 19/25, mean age 5.2 years old) with an average GCS of 14.0. In 2021, the overall protocol adherence rate was 81.6%, and post-implementation in 2022, compliance was 93.2% (p = 0.109). Following implementation, the rate of inappropriate (protocol non-adherent CT) use decreased from 58.6 to 6.8% (p
Published: 2025

5. Holographic pseudoentanglement and the complexity of the AdS/CFT dictionary

Author: Akers, Chris, Bouland, Adam, Chen, Lijie, Kohler, Tamara, Metger, Tony, and Vazirani, Umesh
Subjects: High Energy Physics - Theory, Quantum Physics
Abstract: The `quantum gravity in the lab' paradigm suggests that quantum computers might shed light on quantum gravity by simulating the CFT side of the AdS/CFT correspondence and mapping the results to the AdS side. This relies on the assumption that the duality map (the `dictionary') is efficient to compute. In this work, we show that the complexity of the AdS/CFT dictionary is surprisingly subtle: there might be cases in which one can efficiently apply operators to the CFT state (a task we call 'operator reconstruction') without being able to extract basic properties of the dual bulk state such as its geometry (which we call 'geometry reconstruction'). Geometry reconstruction corresponds to the setting where we want to extract properties of a completely unknown bulk dual from a simulated CFT boundary state. We demonstrate that geometry reconstruction may be generically hard due to the connection between geometry and entanglement in holography. In particular we construct ensembles of states whose entanglement approximately obey the Ryu-Takayanagi formula for arbitrary geometries, but which are nevertheless computationally indistinguishable. This suggests that even for states with the special entanglement structure of holographic CFT states, geometry reconstruction might be hard. This result should be compared with existing evidence that operator reconstruction is generically easy in AdS/CFT. A useful analogy for the difference between these two tasks is quantum fully homomorphic encryption (FHE): this encrypts quantum states in such a way that no efficient adversary can learn properties of the state, but operators can be applied efficiently to the encrypted state. We show that quantum FHE can separate the complexity of geometry reconstruction vs operator reconstruction, which raises the question whether FHE could be a useful lens through which to view AdS/CFT., Comment: 45 pages, 9 figures
Published: 2024

6. Movie Gen: A Cast of Media Foundation Models

Author: Polyak, Adam, Zohar, Amit, Brown, Andrew, Tjandra, Andros, Sinha, Animesh, Lee, Ann, Vyas, Apoorv, Shi, Bowen, Ma, Chih-Yao, Chuang, Ching-Yao, Yan, David, Choudhary, Dhruv, Wang, Dingkang, Sethi, Geet, Pang, Guan, Ma, Haoyu, Misra, Ishan, Hou, Ji, Wang, Jialiang, Jagadeesh, Kiran, Li, Kunpeng, Zhang, Luxin, Singh, Mannat, Williamson, Mary, Le, Matt, Yu, Matthew, Singh, Mitesh Kumar, Zhang, Peizhao, Vajda, Peter, Duval, Quentin, Girdhar, Rohit, Sumbaly, Roshan, Rambhatla, Sai Saketh, Tsai, Sam, Azadi, Samaneh, Datta, Samyak, Chen, Sanyuan, Bell, Sean, Ramaswamy, Sharadh, Sheynin, Shelly, Bhattacharya, Siddharth, Motwani, Simran, Xu, Tao, Li, Tianhe, Hou, Tingbo, Hsu, Wei-Ning, Yin, Xi, Dai, Xiaoliang, Taigman, Yaniv, Luo, Yaqiao, Liu, Yen-Cheng, Wu, Yi-Chiao, Zhao, Yue, Kirstain, Yuval, He, Zecheng, He, Zijian, Pumarola, Albert, Thabet, Ali, Sanakoyeu, Artsiom, Mallya, Arun, Guo, Baishan, Araya, Boris, Kerr, Breena, Wood, Carleigh, Liu, Ce, Peng, Cen, Vengertsev, Dimitry, Schonfeld, Edgar, Blanchard, Elliot, Juefei-Xu, Felix, Nord, Fraylie, Liang, Jeff, Hoffman, John, Kohler, Jonas, Fire, Kaolin, Sivakumar, Karthik, Chen, Lawrence, Yu, Licheng, Gao, Luya, Georgopoulos, Markos, Moritz, Rashel, Sampson, Sara K., Li, Shikai, Parmeggiani, Simone, Fine, Steve, Fowler, Tara, Petrovic, Vladan, and Du, Yuming
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Electrical Engineering and Systems Science - Image and Video Processing
Abstract: We present Movie Gen, a cast of foundation models that generates high-quality, 1080p HD videos with different aspect ratios and synchronized audio. We also show additional capabilities such as precise instruction-based video editing and generation of personalized videos based on a user's image. Our models set a new state-of-the-art on multiple tasks: text-to-video synthesis, video personalization, video editing, video-to-audio generation, and text-to-audio generation. Our largest video generation model is a 30B parameter transformer trained with a maximum context length of 73K video tokens, corresponding to a generated video of 16 seconds at 16 frames-per-second. We show multiple technical innovations and simplifications on the architecture, latent spaces, training objectives and recipes, data curation, evaluation protocols, parallelization techniques, and inference optimizations that allow us to reap the benefits of scaling pre-training data, model size, and training compute for training large scale media generation models. We hope this paper helps the research community to accelerate progress and innovation in media generation models. All videos from this paper are available at https://go.fb.me/MovieGenResearchVideos.
Published: 2024

7. PORTAL: Scalable Tabular Foundation Models via Content-Specific Tokenization

Author: Spinaci, Marco, Polewczyk, Marek, Hoffart, Johannes, Kohler, Markus C., Thelin, Sam, and Klein, Tassilo
Subjects: Computer Science - Machine Learning
Abstract: Self-supervised learning on tabular data seeks to apply advances from natural language and image domains to the diverse domain of tables. However, current techniques often struggle with integrating multi-domain data and require data cleaning or specific structural requirements, limiting the scalability of pre-training datasets. We introduce PORTAL (Pretraining One-Row-at-a-Time for All tabLes), a framework that handles various data modalities without the need for cleaning or preprocessing. This simple yet powerful approach can be effectively pre-trained on online-collected datasets and fine-tuned to match state-of-the-art methods on complex classification and regression tasks. This work offers a practical advancement in self-supervised learning for large-scale tabular data., Comment: Accepted at Table Representation Learning Workshop at NeurIPS 2024
Published: 2024

8. 3D printing by two-photon polymerization of hollow microneedles for interstitial fluid extraction

Author: Silva, Tiago Elias Abi-Ramia, Kohler, Stephan, Bartzsch, Nicolas, Beuschlein, Felix, and Guentner, Andreas T.
Subjects: Physics - Medical Physics
Abstract: Dermal interstitial fluid (ISF) is a rich source of biomarkers (e.g., glucose) that can be used for continuous health monitoring with wearable sensors. Hollow microneedle devices are a promising solution to extract ISF on demand by penetrating the skin with minimal pain. However, they rely on inserting bio-incompatible materials (e.g., silicon) into individuals, limiting the application time. Here, the direct 3D printing of polymer hollow microneedles on silicon-based microfluidic devices and the successful in-vivo extraction of ISF are demonstrated. Our additive manufacturing approach enables the versatile combination of materials and rapid prototyping of microneedle geometry. After improving the design through finite element modeling, a hollow microneedle geometry was printed by two-photon polymerization and experimentally characterized with mechanical and fluidic tests. Microneedles were fabricated with high accuracy (i.e., 997 +/- 2 um) and reliably interfaced with the microfluidic chip (i.e., centerline alignment within 5% of diameter). The needles demonstrated sufficient mechanical strength (i.e., 411 +/- 3 mN per needle) to endure at least 10 consecutive insertions into simulated skin. Biocompatibility and ISF extraction were demonstrated in an in-vivo 72-hour test, showing the safety and reliability of our approach. Such a platform is promising for minimally invasive, continuous monitoring of biomarkers in ISF, aiding in medical diagnoses and personalized health treatments.
Published: 2024

9. Music-triggered fashion design: from songs to the metaverse

Author: Delgado, Martina, Llopart, Marta, Sarabia, Eva, Taboada, Sandra, Vierge, Pol, Vilariño, Fernando, Kohler, Joan Moya, Golijov, Julieta Grimberg, and Bilkis, Matías
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Computers and Society, Computer Science - Social and Information Networks
Abstract: The advent of increasingly-growing virtual realities poses unprecedented opportunities and challenges to different societies. Artistic collectives are not an exception, and we here aim to put special attention into musicians. Compositions, lyrics and even show-advertisements are constituents of a message that artists transmit about their reality. As such, artistic creations are ultimately linked to feelings and emotions, with aesthetics playing a crucial role when it comes to transmit artist's intentions. In this context, we here analyze how virtual realities can help to broaden the opportunities for musicians to bridge with their audiences, by devising a dynamical fashion-design recommendation system inspired by sound stimulus. We present our first steps towards re-defining musical experiences in the metaverse, opening up alternative opportunities for artists to connect both with real and virtual (\textit{e.g.} machine-learning agents operating in the metaverse) in potentially broader ways.
Published: 2024

10. You Only Need to Change Your Direction: A Look at the Potential Impact of ChatGPT on Education

Author: Kyle Kohler
Abstract: ChatGPT, a generative AI program developed by OpenAI, has raised serious questions about the future of education since its launch in November 2022. This paper argues that ChatGPT has the potential to redefine existing educational theories and the role of teachers in language education. Specifically, the paper examines ChatGPT's impact on language education and suggests ways to incorporate it into teaching practice. Since people have been proclaiming that AI would replace teachers since the 1970s, the history of AI in education is relevant to this paper. Therefore, the paper first discusses the history of AI in education to provide context and highlight how ChatGPT fits into this larger conversation. Afterwards, the paper then explores how ChatGPT might redefine existing learning theories such as Constructivism and Self-Determination Theory using the AIED framework. ChatGPT has the potential to provide learners with personalized and adaptive learning experiences that align with these learning theories. For this reason, the paper will investigate how ChatGPT may impact the role of learners through the lens of Learner Autonomy before looking at how it might also redefine the role of the teacher in the classrooms of the near future. While ChatGPT has the potential to automate certain aspects of language education, the role of the teacher as a mentor and guide in the learning process is still essential. Finally, the paper offers suggestions for how educators can effectively incorporate generative AI programs like ChatGPT into their teaching practice and curriculum. The paper concludes by emphasizing the importance of approaching the integration of AI in education with caution and a critical perspective.
Published: 2024

11. Dense and Disconnected: Analyzing the Sedimented Style of ChatGPT-Generated Text at Scale

Author: Ben Markey, David West Brown, Michael Laudenbach, and Alan Kohler
Abstract: ChatGPT and other LLMs are at the forefront of pedagogical considerations in classrooms across the academy. Many studies have spoken to the technology's capacity to generate one-off texts in a variety of genres. This study complements those by inquiring into its capacity to generate compelling texts at scale. In this study, we quantitatively and qualitatively analyze a small corpus of generated texts in two genres and gauge it against novice and published academic writers along known dimensions of linguistic variation. Theoretically, we position and historicize ChatGPT as a writing technology and consider the ways in which generated text may not be congruent with established trajectories of writing development in higher education. Our study found that generated texts are more informationally dense than authored texts and often read as dialogically closed, "empty," and "fluffy." We close with a discussion of potentially explanatory linguistic features, as well as relevant pedagogical implications.
Published: 2024
Full Text: View/download PDF

12. Advancing stem cell technologies for conservation of wildlife biodiversity.

Author: Hutchinson, Ashlee M, Appeltant, Ruth, Burdon, Tom, Bao, Qiuye, Bargaje, Rhishikesh, Bodnar, Andrea, Chambers, Stuart, Comizzoli, Pierre, Cook, Laura, Endo, Yoshinori, Harman, Bob, Hayashi, Katsuhiko, Hildebrandt, Thomas, Korody, Marisa L, Lakshmipathy, Uma, Loring, Jeanne F, Munger, Clara, Ng, Alex HM, Novak, Ben, Onuma, Manabu, Ord, Sara, Paris, Monique, Pask, Andrew J, Pelegri, Francisco, Pera, Martin, Phelan, Ryan, Rosental, Benyamin, Ryder, Oliver A, Sukparangsi, Woranop, Sullivan, Gareth, Tay, Nicole Liling, Traylor-Knowles, Nikki, Walker, Shawn, Weberling, Antonia, Whitworth, Deanne J, Williams, Suzannah A, Wojtusik, Jessye, Wu, Jun, Ying, Qi-Long, Zwaka, Thomas P, and Kohler, Timo N
Subjects: In vitro gametogenesis, Biodiversity, Conservation, Disease modelling, IPSC, Stem cells, Animals, Conservation of Natural Resources, Animals, Wild, Stem Cells, Humans, Biological Sciences, Medical and Health Sciences, Biological sciences, Biomedical and clinical sciences, Health sciences
Abstract: Wildlife biodiversity is essential for healthy, resilient and sustainable ecosystems. For biologists, this diversity also represents a treasure trove of genetic, molecular and developmental mechanisms that deepen our understanding of the origins and rules of life. However, the rapid decline in biodiversity reported recently foreshadows a potentially catastrophic collapse of many important ecosystems and the associated irreversible loss of many forms of life on our planet. Immediate action by conservationists of all stripes is required to avert this disaster. In this Spotlight, we draw together insights and proposals discussed at a recent workshop hosted by Revive & Restore, which gathered experts to discuss how stem cell technologies can support traditional conservation techniques and help protect animal biodiversity. We discuss reprogramming, in vitro gametogenesis, disease modelling and embryo modelling, and we highlight the prospects for leveraging stem cell technologies beyond mammalian species.
Published: 2024

13. Unveiling the Social Fabric: A Temporal, Nation-Scale Social Network and its Characteristics

Author: Cremers, Jolien, Kohler, Benjamin, Maier, Benjamin Frank, Eriksen, Stine Nymann, Einsiedler, Johanna, Christensen, Frederik Kølby, Lehmann, Sune, Lassen, David Dreyer, Mortensen, Laust Hvas, and Bjerre-Nielsen, Andreas
Subjects: Computer Science - Social and Information Networks, Physics - Physics and Society
Abstract: Social networks shape individuals' lives, influencing everything from career paths to health. This paper presents a registry-based, multi-layer and temporal network of the entire Danish population in the years 2008-2021 (roughly 7.2 mill. individuals). Our network maps the relationships formed through family, households, neighborhoods, colleagues and classmates. We outline key properties of this multiplex network, introducing both an individual-focused perspective as well as a bipartite representation. We show how to aggregate and combine the layers, and how to efficiently compute network measures such as shortest paths in large administrative networks. Our analysis reveals how past connections reappear later in other layers, that the number of relationships aggregated over time reflects the position in the income distribution, and that we can recover canonical shortest path length distributions when appropriately weighting connections. Along with the network data, we release a Python package that uses the bipartite network representation for efficient analysis.
Published: 2024

14. Using maximum entropy modeling to predict the potential distributions of genus Copaifera L. in a conservation unit in the Brazilian Amazon

Author: de Oliveira Sousa, Jonathan Benathar, de Sousa Conceição Benathar, Isamara, Ebling, Angelo Augusto, Kohler, Sintia Valerio, de Oliveira, Ximena Mendes, de Paula Protásio, Thiago, Rodrigues, Marcos, and Goulart, Selma Lopes
Published: 2025
Full Text: View/download PDF

15. COVID-19 Vaccination Messengers, Communication Channels, and Messages Trusted Among Black Communities in the USA: a Review

Author: Rabin, Yael and Kohler, Racquel E.
Published: 2025
Full Text: View/download PDF

16. Digoxin for reduction of circulating tumor cell cluster size in metastatic breast cancer: a proof-of-concept trial

Author: Kurzeder, Christian, Nguyen-Sträuli, Bich Doan, Krol, Ilona, Ring, Alexander, Castro-Giner, Francesc, Nüesch, Manuel, Asawa, Simran, Zhang, Yu Wei, Budinjas, Selina, Gvozdenovic, Ana, Vogel, Maren, Kohler, Angela, Grašič Kuhar, Cvetka, Schwab, Fabienne D., Heinzelmann-Schwarz, Viola, Weber, Walter Paul, Rochlitz, Christoph, Vorburger, Denise, Frauchiger-Heuer, Heike, Witzel, Isabell, Wicki, Andreas, Kuster, Gabriela M., Vetter, Marcus, and Aceto, Nicola
Published: 2025
Full Text: View/download PDF

17. Diversity and biogeography of the bacterial microbiome in glacier-fed streams

Author: Ezzat, Leïla, Peter, Hannes, Bourquin, Massimo, Busi, Susheel Bhanu, Michoud, Grégoire, Fodelianakis, Stilianos, Kohler, Tyler J., Lamy, Thomas, Geers, Aileen, Pramateftaki, Paraskevi, Baier, Florian, Marasco, Ramona, Daffonchio, Daniele, Deluigi, Nicola, Wilmes, Paul, Styllas, Michail, Schön, Martina, Tolosano, Matteo, De Staercke, Vincent, and Battin, Tom J.
Published: 2025
Full Text: View/download PDF

18. Personalized, autologous neoantigen-specific T cell therapy in metastatic melanoma: a phase 1 trial

Author: Borgers, Jessica S. W., Lenkala, Divya, Kohler, Victoria, Jackson, Emily K., Linssen, Matthijs D., Hymson, Sebastian, McCarthy, Brian, O’Reilly Cosgrove, Elizabeth, Balogh, Kristen N., Esaulova, Ekaterina, Starr, Kimberly, Ware, Yvonne, Klobuch, Sebastian, Sciuto, Tracey, Chen, Xi, Mahimkar, Gauri, Sheen, Joong Hyuk F., Ramesh, Suchitra, Wilgenhof, Sofie, van Thienen, Johannes V., Scheiner, Karina C., Jedema, Inge, Rooney, Michael, Dong, Jesse Z., Srouji, John R., Juneja, Vikram R., Arieta, Christina M., Nuijen, Bastiaan, Gottstein, Claudia, Finney, Olivia C., Manson, Kelledy, Nijenhuis, Cynthia M., Gaynor, Richard B., DeMario, Mark, Haanen, John B., and van Buuren, Marit M.
Published: 2025
Full Text: View/download PDF

19. Antigen experience history directs distinct functional states of CD8+ CAR T cells during the antileukemia response

Author: DeGolier, Kole R., Danis, Etienne, D’Antonio, Marc, Cimons, Jennifer, Yarnell, Michael, Kedl, Ross M., Kohler, M. Eric, Scott-Browne, James P., and Fry, Terry J.
Published: 2025
Full Text: View/download PDF

20. Mapping the metagenomic diversity of the multi-kingdom glacier-fed stream microbiome

Author: Michoud, Grégoire, Peter, Hannes, Busi, Susheel Bhanu, Bourquin, Massimo, Kohler, Tyler J., Geers, Aileen, Ezzat, Leila, and Battin, Tom J.
Published: 2025
Full Text: View/download PDF

21. Multiway Multislice PHATE: Visualizing Hidden Dynamics of RNNs through Training

Author: Xie, Jiancheng, Voinov, Lou C. Kohler, Mudrik, Noga, Mishne, Gal, and Charles, Adam
Subjects: Computer Science - Machine Learning
Abstract: Recurrent neural networks (RNNs) are a widely used tool for sequential data analysis, however, they are still often seen as black boxes of computation. Understanding the functional principles of these networks is critical to developing ideal model architectures and optimization strategies. Previous studies typically only emphasize the network representation post-training, overlooking their evolution process throughout training. Here, we present Multiway Multislice PHATE (MM-PHATE), a novel method for visualizing the evolution of RNNs' hidden states. MM-PHATE is a graph-based embedding using structured kernels across the multiple dimensions spanned by RNNs: time, training epoch, and units. We demonstrate on various datasets that MM-PHATE uniquely preserves hidden representation community structure among units and identifies information processing and compression phases during training. The embedding allows users to look under the hood of RNNs across training and provides an intuitive and comprehensive strategy to understanding the network's internal dynamics and draw conclusions, e.g., on why and how one model outperforms another or how a specific architecture might impact an RNN's learning ability.
Published: 2024

22. Challenge-Device-Synthesis: A multi-disciplinary approach for the development of social innovation competences for students of Artificial Intelligence

Author: Bilkis, Matías, Kohler, Joan Moya, and Vilariño, Fernando
Subjects: Computer Science - Artificial Intelligence, Physics - Physics Education
Abstract: The advent of Artificial Intelligence is expected to imply profound changes in the short-term. It is therefore imperative for Academia, and particularly for the Computer Science scope, to develop cross-disciplinary tools that bond AI developments to their social dimension. To this aim, we introduce the Challenge-Device-Synthesis methodology (CDS), in which a specific challenge is presented to the students of AI, who are required to develop a device as a solution for the challenge. The device becomes the object of study for the different dimensions of social transformation, and the conclusions addressed by the students during the discussion around the device are presented in a synthesis piece in the shape of a 10-page scientific paper. The latter is evaluated taking into account both the depth of analysis and the level to which it genuinely reflects the social transformations associated with the proposed AI-based device. We provide data obtained during the pilot for the implementation phase of CDS within the subject of Social Innovation, a 6-ECTS subject from the 6th semester of the Degree of Artificial Intelligence, UAB-Barcelona. We provide details on temporalisation, task distribution, methodological tools used and assessment delivery procedure, as well as qualitative analysis of the results obtained., Comment: accepted as contribution for EDULEARN24 - 16th annual International Conference on Education and New Learning Technologies
Published: 2024

23. Interpretable and Editable Programmatic Tree Policies for Reinforcement Learning

Author: Kohler, Hector, Delfosse, Quentin, Akrour, Riad, Kersting, Kristian, and Preux, Philippe
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Deep reinforcement learning agents are prone to goal misalignments. The black-box nature of their policies hinders the detection and correction of such misalignments, and the trust necessary for real-world deployment. So far, solutions learning interpretable policies are inefficient or require many human priors. We propose INTERPRETER, a fast distillation method producing INTerpretable Editable tRee Programs for ReinforcEmenT lEaRning. We empirically demonstrate that INTERPRETER compact tree programs match oracles across a diverse set of sequential decision tasks and evaluate the impact of our design choices on interpretability and performances. We show that our policies can be interpreted and edited to correct misalignments on Atari games and to explain real farming strategies.
Published: 2024

24. Quantum Dissipation at Conical Intersections of Quasienergies

Author: Kohler, Sigmund
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: We investigate the properties of Floquet states in the vicinity of a conical intersection of quasienergies and work out the consequences of the underlying spatio-temporal symmetries for a driven two-level system coupled to an ohmic heat bath. We find that on manifolds with constant quasienergy splitting, the mean energies of the Floquet states are continuously interchanged. In the presence of dissipation, the parameter dependence of the stationary populations generally resembles that of the mean energies. In turn, the mean energies are an indicator for the qualitative behavior of the density operator in the long-time limit. A further consequence of the symmetries is that for specific driving parameters, the stationary state may be fully mixed even at arbitrarily low temperatures. For large driving frequencies, such states with maximal entropy are found in the whole vicinity of the intersection, which can be explained by a chirality emerging in this limit. Analytical results beyond a high-frequency approximation are illustrated by numerical data., Comment: 7 pages, 6 figures
Published: 2024
Full Text: View/download PDF

25. Analysis of the rate of convergence of an over-parametrized convolutional neural network image classifier learned by gradient descent

Author: Kohler, Michael, Krzyzak, Adam, and Walter, Benjamin
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: Image classification based on over-parametrized convolutional neural networks with a global average-pooling layer is considered. The weights of the network are learned by gradient descent. A bound on the rate of convergence of the difference between the misclassification risk of the newly introduced convolutional neural network estimate and the minimal possible value is derived.
Published: 2024

26. Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation

Author: Kohler, Jonas, Pumarola, Albert, Schönfeld, Edgar, Sanakoyeu, Artsiom, Sumbaly, Roshan, Vajda, Peter, and Thabet, Ali
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Diffusion models are a powerful generative framework, but come with expensive inference. Existing acceleration methods often compromise image quality or fail under complex conditioning when operating in an extremely low-step regime. In this work, we propose a novel distillation framework tailored to enable high-fidelity, diverse sample generation using just one to three steps. Our approach comprises three key components: (i) Backward Distillation, which mitigates training-inference discrepancies by calibrating the student on its own backward trajectory; (ii) Shifted Reconstruction Loss that dynamically adapts knowledge transfer based on the current time step; and (iii) Noise Correction, an inference-time technique that enhances sample quality by addressing singularities in noise prediction. Through extensive experiments, we demonstrate that our method outperforms existing competitors in quantitative metrics and human evaluations. Remarkably, it achieves performance comparable to the teacher model using only three denoising steps, enabling efficient high-quality generation.
Published: 2024

27. PID Tuning using Cross-Entropy Deep Learning: a Lyapunov Stability Analysis

Author: Kohler, Hector, Clement, Benoit, Chaffre, Thomas, and Chenadec, Gilles Le
Subjects: Electrical Engineering and Systems Science - Systems and Control, Computer Science - Machine Learning, Computer Science - Robotics
Abstract: Underwater Unmanned Vehicles (UUVs) have to constantly compensate for the external disturbing forces acting on their body. Adaptive Control theory is commonly used there to grant the control law some flexibility in its response to process variation. Today, learning-based (LB) adaptive methods are leading the field where model-based control structures are combined with deep model-free learning algorithms. This work proposes experiments and metrics to empirically study the stability of such a controller. We perform this stability analysis on a LB adaptive control system whose adaptive parameters are determined using a Cross-Entropy Deep Learning method.
Published: 2024
Full Text: View/download PDF

28. Towards a Research Community in Interpretable Reinforcement Learning: the InterpPol Workshop

Author: Kohler, Hector, Delfosse, Quentin, Festor, Paul, and Preux, Philippe
Subjects: Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction, Computer Science - Machine Learning, Computer Science - Symbolic Computation
Abstract: Embracing the pursuit of intrinsically explainable reinforcement learning raises crucial questions: what distinguishes explainability from interpretability? Should explainable and interpretable agents be developed outside of domains where transparency is imperative? What advantages do interpretable policies offer over neural networks? How can we rigorously define and measure interpretability in policies, without user studies? What reinforcement learning paradigms,are the most suited to develop interpretable agents? Can Markov Decision Processes integrate interpretable state representations? In addition to motivate an Interpretable RL community centered around the aforementioned questions, we propose the first venue dedicated to Interpretable RL: the InterpPol Workshop.
Published: 2024

29. Learning of deep convolutional network image classifiers via stochastic gradient descent and over-parametrization

Author: Kohler, Michael, Krzyzak, Adam, and Sänger, Alisha
Subjects: Mathematics - Statistics Theory
Abstract: Image classification from independent and identically distributed random variables is considered. Image classifiers are defined which are based on a linear combination of deep convolutional networks with max-pooling layer. Here all the weights are learned by stochastic gradient descent. A general result is presented which shows that the image classifiers are able to approximate the best possible deep convolutional network. In case that the a posteriori probability satisfies a suitable hierarchical composition model it is shown that the corresponding deep convolutional neural network image classifier achieves a rate of convergence which is independent of the dimension of the images., Comment: arXiv admin note: text overlap with arXiv:2312.17007
Published: 2024

30. Extreme overall mushroom genome expansion in Mycena s.s. irrespective of plant hosts or substrate specializations.

Author: Harder, Christoffer, Miyauchi, Shingo, Virágh, Máté, Kuo, Alan, Thoen, Ella, Andreopoulos, Bill, Lu, Dabao, Skrede, Inger, Drula, Elodie, Henrissat, Bernard, Morin, Emmanuelle, Kohler, Annegret, Barry, Kerrie, LaButti, Kurt, Salamov, Asaf, Lipzen, Anna, Merényi, Zsolt, Hegedüs, Botond, Baldrian, Petr, Stursova, Martina, Weitz, Hedda, Taylor, Andy, Koriabine, Maxim, Savage, Emily, Grigoriev, Igor, Nagy, László, Martin, Francis, and Kauserud, Håvard
Subjects: Arctic biology, TE proliferation, biotrophy–saprotrophy evolution, carbon degradation, fungal genomics, fungal guild, genome size diversity, plant-fungus interactions, root-associations, saprotrophs, Genome, Fungal, Agaricales, Phylogeny, DNA Transposable Elements, Evolution, Molecular, Gene Transfer, Horizontal, Plants
Abstract: Mycena s.s. is a ubiquitous mushroom genus whose members degrade multiple dead plant substrates and opportunistically invade living plant roots. Having sequenced the nuclear genomes of 24 Mycena species, we find them to defy the expected patterns for fungi based on both their traditionally perceived saprotrophic ecology and substrate specializations. Mycena displayed massive genome expansions overall affecting all gene families, driven by novel gene family emergence, gene duplications, enlarged secretomes encoding polysaccharide degradation enzymes, transposable element (TE) proliferation, and horizontal gene transfers. Mainly due to TE proliferation, Arctic Mycena species display genomes of up to 502 Mbp (2-8× the temperate Mycena), the largest among mushroom-forming Agaricomycetes, indicating a possible evolutionary convergence to genomic expansions sometimes seen in Arctic plants. Overall, Mycena show highly unusual, varied mosaic-like genomic structures adaptable to multiple lifestyles, providing genomic illustration for the growing realization that fungal niche adaptations can be far more fluid than traditionally believed.
Published: 2024

31. Higher plasma levels of endocannabinoids and analogues are correlated with a worse cardiometabolic profile in middle-aged adults

Author: Rodríguez-García, Carmen, Osuna-Prieto, Francisco J., Kohler, Isabelle, Sanchez-Gomez, Joaquin, Ruiz-Campos, Samuel, Castillo, Manuel J., Amaro-Gahete, Francisco J., Martínez-Tellez, Borja, and Jurado-Fasoli, Lucas
Published: 2024
Full Text: View/download PDF

32. The clinical significance of imperfection: is idiopathic corporal asymmetry related to curvature during penile prosthesis placement?

Author: Bole, Raevti, Alom, Manaf, Habashy, Engy, Ahmed, Mohammed, Ziegelmann, Matthew, Kohler, Tobias, and Helo, Sevann
Published: 2024
Full Text: View/download PDF

33. Simultaneous Winslow and Petersen’s Hernias after a Roux-en-Y Gastric Bypass Causing Bowel Obstruction: Laparoscopic Management and Review of the Literature

Author: Litchinko, A., Kohler, R., Jung, M. K., Toso, C., and Moenig, S.
Published: 2024
Full Text: View/download PDF

34. Burden of post–acute COVID-19 sequelae in healthcare workers and its course over a 30-month period–results from a prospective multicentre cohort

Author: Dörr, Tamara, Strahm, Carol, Güsewell, Sabine, Ballouz, Tala, Kocan, Emina, Cusini, Alexia, Goppel, Stephan, Grässli, Fabian, Möller, J. Carsten, Puhan, Milo A., Risch, Lorenz, Ruetti, Markus, Schlegel, Matthias, Stocker, Reto, von Kietzell, Matthias, Vuichard-Gysin, Danielle, Kuster, Stefan P., Kahlert, Christian R., and Kohler, Philipp
Published: 2024
Full Text: View/download PDF

35. Bernhard Blechmann’s Body: The “Jewish Race” in Nineteenth-Century Physical Anthropology in Transnational Perspective

Author: Kohler, Noa Sophie
Published: 2024
Full Text: View/download PDF

36. Pandemics, public policy, and Peltzman effects

Author: Dietrich, Alexander, Kohler, Wilhelm, and Müller, Gernot
Published: 2024
Full Text: View/download PDF

37. An MSstats workflow for detecting differentially abundant proteins in large-scale data-independent acquisition mass spectrometry experiments with FragPipe processing

Author: Kohler, Devon, Staniak, Mateusz, Yu, Fengchao, Nesvizhskii, Alexey I., and Vitek, Olga
Published: 2024
Full Text: View/download PDF

38. Security of quantum position-verification limits Hamiltonian simulation via holography

Author: Apel, Harriet, Cubitt, Toby, Hayden, Patrick, Kohler, Tamara, and Pérez-García, David
Subjects: Quantum Physics, High Energy Physics - Theory
Abstract: We investigate the link between quantum position-verification (QPV) and holography established in [MPS19] using holographic quantum error correcting codes as toy models. By inserting the "temporal" scaling of the AdS metric by hand via the bulk Hamiltonian interaction strength, we recover a toy model with consistent causality structure. This leads to an interesting implication between two topics in quantum information: if position-based verification is secure against attacks with small entanglement then there are new fundamental lower bounds for resources required for one Hamiltonian to simulate another., Comment: (v2: updated presentation)
Published: 2024
Full Text: View/download PDF

39. Metatranscriptomics sheds light on the links between the functional traits of fungal guilds and ecological processes in forest soil ecosystems

Author: Auer, Lucas, Buée, Marc, Fauchery, Laure, Lombard, Vincent, Barry, Kerry W, Clum, Alicia, Copeland, Alex, Daum, Chris, Foster, Brian, LaButti, Kurt, Singan, Vasanth, Yoshinaga, Yuko, Martineau, Christine, Alfaro, Manuel, Castillo, Federico J, Imbert, J Bosco, Ramírez, Lucia, Castanera, Raúl, Pisabarro, Antonio G, Finlay, Roger, Lindahl, Björn, Olson, Ake, Séguin, Armand, Kohler, Annegret, Henrissat, Bernard, Grigoriev, Igor V, and Martin, Francis M
Subjects: Microbiology, Biological Sciences, Ecology, Genetics, Life on Land, Forests, Fungi, Soil Microbiology, Transcriptome, Mycorrhizae, Gene Expression Profiling, Gene Expression Regulation, Fungal, Nitrogen, Soil, Ecosystem, RNA, Messenger, forest soil, functional traits, fungal guilds, metatranscriptomics, organic matter degradation, Agricultural and Veterinary Sciences, Plant Biology & Botany, Plant biology, Climate change impacts and adaptation, Ecological applications
Abstract: Soil fungi belonging to different functional guilds, such as saprotrophs, pathogens, and mycorrhizal symbionts, play key roles in forest ecosystems. To date, no study has compared the actual gene expression of these guilds in different forest soils. We used metatranscriptomics to study the competition for organic resources by these fungal groups in boreal, temperate, and Mediterranean forest soils. Using a dedicated mRNA annotation pipeline combined with the JGI MycoCosm database, we compared the transcripts of these three fungal guilds, targeting enzymes involved in C- and N mobilization from plant and microbial cell walls. Genes encoding enzymes involved in the degradation of plant cell walls were expressed at a higher level in saprotrophic fungi than in ectomycorrhizal and pathogenic fungi. However, ectomycorrhizal and saprotrophic fungi showed similarly high expression levels of genes encoding enzymes involved in fungal cell wall degradation. Transcripts for N-related transporters were more highly expressed in ectomycorrhizal fungi than in other groups. We showed that ectomycorrhizal and saprotrophic fungi compete for N in soil organic matter, suggesting that their interactions could decelerate C cycling. Metatranscriptomics provides a unique tool to test controversial ecological hypotheses and to better understand the underlying ecological processes involved in soil functioning and carbon stabilization.
Published: 2024

40. Current Postlaunch Implementation of State Mandates of Newborn Screening for Critical Congenital Heart Disease by Pulse Oximetry in U.S. States and Hospitals

Author: Sakai-Bizmark, Rie, Chang, Ruey-Kang R, Martin, Gerard R, Hom, Lisa A, Marr, Emily H, Ko, Jamie, Goff, Donna A, Mena, Laurie A, von Kohler, Connie, Bedel, Lauren EM, Murillo, Mary, Estevez, Dennys, and Hays, Ron D
Subjects: Health Services and Systems, Biomedical and Clinical Sciences, Health Sciences, Cardiovascular, Pediatric, Heart Disease, Health Services, Clinical Research, Congenital Structural Anomalies, Rare Diseases, Humans, Oximetry, Heart Defects, Congenital, Neonatal Screening, Infant, Newborn, United States, Guideline Adherence, Surveys and Questionnaires, Hospitals, Mandatory Programs, RUSP, pulse oximetry screening, CFIR, implementation, newborn screening, CCHD, congenital heart disease, nurses, Clinical Sciences, Paediatrics and Reproductive Medicine, Obstetrics & Reproductive Medicine, Paediatrics, Reproductive medicine, Midwifery
Abstract: ObjectiveOur objective was to gauge adherence to nationally endorsed protocols in implementation of pulse oximetry (POx) screening for critical congenital heart disease (CCHD) in infants after mandate by all states and to assess associated characteristics.Study designBetween March and October 2019, an online questionnaire was administered to nurse supervisors who oversee personnel conducting POx screening. The questionnaire used eight questions regarding performance and interpretation of screening protocols to measure policy consistency, which is adherence to nationally endorsed protocols for POx screening developed by professional medical societies. Multilevel linear regression models evaluated associations between policy consistency and characteristics of hospitals and individuals, state of hospital location, early versus late mandate adopters, and state reporting requirements.ResultsResponses from 189 nurse supervisors spanning 38 states were analyzed. Only 17% received maximum points indicating full policy consistency, and 24% selected all four options for potential hypoxia that require a repeat screen. Notably, 33% did not recognize ≤90% SpO2 as an immediate failed screen and 31% responded that an infant with SpO2 of 89% in one extremity will be rescreened by nurses in an hour rather than receiving an immediate physician referral. Lower policy consistency was associated with lack of state reporting mandates (beta = -1.23 p = 0.01) and early adoption by states (beta = -1.01, p
Published: 2024

41. Accelerating Medicines Partnership® Schizophrenia (AMP® SCZ): Rationale and Study Design of the Largest Global Prospective Cohort Study of Clinical High Risk for Psychosis.

Author: Wannan, Cassandra, Nelson, Barnaby, Addington, Jean, Allott, Kelly, Anticevic, Alan, Arango, Celso, Baker, Justin, McGorry, Patrick, Mittal, Vijay, Nordentoft, Merete, Nunez, Angela, Pasternak, Ofer, Pearlson, Godfrey, Perez, Jesus, Perkins, Diana, Powers, Albert, Roalf, David, Sabb, Fred, Schiffman, Jason, Shah, Jai, Smesny, Stefan, Spark, Jessica, Stone, William, Strauss, Gregory, Tamayo, Zailyn, Torous, John, Upthegrove, Rachel, Vangel, Mark, Verma, Swapna, Wang, Jijun, Rossum, Inge, Wolf, Daniel, Wolff, Phillip, Wood, Stephen, Yung, Alison, Agurto, Carla, Alvarez-Jimenez, Mario, Amminger, Paul, Armando, Marco, Asgari-Targhi, Ameneh, Cahill, John, Carrión, Ricardo, Castro, Eduardo, Cetin-Karayumak, Suheyla, Mallar Chakravarty, M, Cho, Youngsun, Cotter, David, DAlfonso, Simon, Ennis, Michaela, Fadnavis, Shreyas, Fonteneau, Clara, Gao, Caroline, Gupta, Tina, Gur, Raquel, Gur, Ruben, Hamilton, Holly, Hoftman, Gil, Jacobs, Grace, Jarcho, Johanna, Ji, Jie, Kohler, Christian, Lalousis, Paris, Lavoie, Suzie, Lepage, Martin, Liebenthal, Einat, Mervis, Josh, Murty, Vishnu, Nicholas, Spero, Ning, Lipeng, Penzel, Nora, Poldrack, Russell, Polosecki, Pablo, Pratt, Danielle, Rabin, Rachel, Rahimi Eichi, Habiballah, Rathi, Yogesh, Reichenberg, Avraham, Reinen, Jenna, Rogers, Jack, Ruiz-Yu, Bernalyn, Scott, Isabelle, Seitz-Holland, Johanna, Srihari, Vinod, Srivastava, Agrima, Thompson, Andrew, Turetsky, Bruce, Walsh, Barbara, Whitford, Thomas, Wigman, Johanna, Yao, Beier, Yuen, Hok, Ahmed, Uzair, Byun, Andrew, Chung, Yoonho, Do, Kim, Hendricks, Larry, Huynh, Kevin, Jeffries, Clark, Lane, Erlend, and Langholm, Carsten
Subjects: clinical high risk, consortium, early detection, prediction, prevention, psychosis, Humans, Psychotic Disorders, Schizophrenia, Prospective Studies, Adult, Prodromal Symptoms, Young Adult, International Cooperation, Adolescent, Research Design, Male, Female
Abstract: This article describes the rationale, aims, and methodology of the Accelerating Medicines Partnership® Schizophrenia (AMP® SCZ). This is the largest international collaboration to date that will develop algorithms to predict trajectories and outcomes of individuals at clinical high risk (CHR) for psychosis and to advance the development and use of novel pharmacological interventions for CHR individuals. We present a description of the participating research networks and the data processing analysis and coordination center, their processes for data harmonization across 43 sites from 13 participating countries (recruitment across North America, Australia, Europe, Asia, and South America), data flow and quality assessment processes, data analyses, and the transfer of data to the National Institute of Mental Health (NIMH) Data Archive (NDA) for use by the research community. In an expected sample of approximately 2000 CHR individuals and 640 matched healthy controls, AMP SCZ will collect clinical, environmental, and cognitive data along with multimodal biomarkers, including neuroimaging, electrophysiology, fluid biospecimens, speech and facial expression samples, novel measures derived from digital health technologies including smartphone-based daily surveys, and passive sensing as well as actigraphy. The study will investigate a range of clinical outcomes over a 2-year period, including transition to psychosis, remission or persistence of CHR status, attenuated positive symptoms, persistent negative symptoms, mood and anxiety symptoms, and psychosocial functioning. The global reach of AMP SCZ and its harmonized innovative methods promise to catalyze the development of new treatments to address critical unmet clinical and public health needs in CHR individuals.
Published: 2024

42. Nuclear Hsp104 safeguards the dormant translation machinery during quiescence

Author: Kohler, Verena, Kohler, Andreas, Berglund, Lisa Larsson, Hao, Xinxin, Gersing, Sarah, Imhof, Axel, Nyström, Thomas, Höög, Johanna L., Ott, Martin, Andréasson, Claes, and Büttner, Sabrina
Published: 2024
Full Text: View/download PDF

43. On the rate of convergence of an over-parametrized Transformer classifier learned by gradient descent

Author: Kohler, Michael and Krzyzak, Adam
Subjects: Computer Science - Machine Learning, Mathematics - Statistics Theory, Statistics - Machine Learning
Abstract: One of the most recent and fascinating breakthroughs in artificial intelligence is ChatGPT, a chatbot which can simulate human conversation. ChatGPT is an instance of GPT4, which is a language model based on generative gredictive gransformers. So if one wants to study from a theoretical point of view, how powerful such artificial intelligence can be, one approach is to consider transformer networks and to study which problems one can solve with these networks theoretically. Here it is not only important what kind of models these network can approximate, or how they can generalize their knowledge learned by choosing the best possible approximation to a concrete data set, but also how well optimization of such transformer network based on concrete data set works. In this article we consider all these three different aspects simultaneously and show a theoretical upper bound on the missclassification probability of a transformer network fitted to the observed data. For simplicity we focus in this context on transformer encoder networks which can be applied to define an estimate in the context of a classification problem involving natural language.
Published: 2023

44. fMPI: Fast Novel View Synthesis in the Wild with Layered Scene Representations

Author: Kohler, Jonas, Sanchez, Nicolas Griffiths, Cavalli, Luca, Herold, Catherine, Pumarola, Albert, Garcia, Alberto Garcia, and Thabet, Ali
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: In this study, we propose two novel input processing paradigms for novel view synthesis (NVS) methods based on layered scene representations that significantly improve their runtime without compromising quality. Our approach identifies and mitigates the two most time-consuming aspects of traditional pipelines: building and processing the so-called plane sweep volume (PSV), which is a high-dimensional tensor of planar re-projections of the input camera views. In particular, we propose processing this tensor in parallel groups for improved compute efficiency as well as super-sampling adjacent input planes to generate denser, and hence more accurate scene representation. The proposed enhancements offer significant flexibility, allowing for a balance between performance and speed, thus making substantial steps toward real-time applications. Furthermore, they are very general in the sense that any PSV-based method can make use of them, including methods that employ multiplane images, multisphere images, and layered depth images. In a comprehensive set of experiments, we demonstrate that our proposed paradigms enable the design of an NVS method that achieves state-of-the-art on public benchmarks while being up to $50x$ faster than existing state-of-the-art methods. It also beats the current forerunner in terms of speed by over $3x$, while achieving significantly better rendering quality.
Published: 2023

45. Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models

Author: Castillo, Angela, Kohler, Jonas, Pérez, Juan C., Pérez, Juan Pablo, Pumarola, Albert, Ghanem, Bernard, Arbeláez, Pablo, and Thabet, Ali
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: This paper presents a comprehensive study on the role of Classifier-Free Guidance (CFG) in text-conditioned diffusion models from the perspective of inference efficiency. In particular, we relax the default choice of applying CFG in all diffusion steps and instead search for efficient guidance policies. We formulate the discovery of such policies in the differentiable Neural Architecture Search framework. Our findings suggest that the denoising steps proposed by CFG become increasingly aligned with simple conditional steps, which renders the extra neural network evaluation of CFG redundant, especially in the second half of the denoising process. Building upon this insight, we propose "Adaptive Guidance" (AG), an efficient variant of CFG, that adaptively omits network evaluations when the denoising process displays convergence. Our experiments demonstrate that AG preserves CFG's image quality while reducing computation by 25%. Thus, AG constitutes a plug-and-play alternative to Guidance Distillation, achieving 50% of the speed-ups of the latter while being training-free and retaining the capacity to handle negative prompts. Finally, we uncover further redundancies of CFG in the first half of the diffusion process, showing that entire neural function evaluations can be replaced by simple affine transformations of past score estimates. This method, termed LinearAG, offers even cheaper inference at the cost of deviating from the baseline model. Our findings provide insights into the efficiency of the conditional denoising process that contribute to more practical and swift deployment of text-conditioned diffusion models.
Published: 2023

46. Cache Me if You Can: Accelerating Diffusion Models through Block Caching

Author: Wimbauer, Felix, Wu, Bichen, Schoenfeld, Edgar, Dai, Xiaoliang, Hou, Ji, He, Zijian, Sanakoyeu, Artsiom, Zhang, Peizhao, Tsai, Sam, Kohler, Jonas, Rupprecht, Christian, Cremers, Daniel, Vajda, Peter, and Wang, Jialiang
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Diffusion models have recently revolutionized the field of image synthesis due to their ability to generate photorealistic images. However, one of the major drawbacks of diffusion models is that the image generation process is costly. A large image-to-image network has to be applied many times to iteratively refine an image from random noise. While many recent works propose techniques to reduce the number of required steps, they generally treat the underlying denoising network as a black box. In this work, we investigate the behavior of the layers within the network and find that 1) the layers' output changes smoothly over time, 2) the layers show distinct patterns of change, and 3) the change from step to step is often very small. We hypothesize that many layer computations in the denoising network are redundant. Leveraging this, we introduce block caching, in which we reuse outputs from layer blocks of previous steps to speed up inference. Furthermore, we propose a technique to automatically determine caching schedules based on each block's changes over timesteps. In our experiments, we show through FID, human evaluation and qualitative analysis that Block Caching allows to generate images with higher visual quality at the same computational cost. We demonstrate this for different state-of-the-art models (LDM and EMU) and solvers (DDIM and DPM)., Comment: Project page: https://fwmb.github.io/blockcaching/
Published: 2023

47. Gapped Clique Homology on weighted graphs is $\text{QMA}_1$-hard and contained in $\text{QMA}$

Author: King, Robbie and Kohler, Tamara
Subjects: Quantum Physics
Abstract: We study the complexity of a classic problem in computational topology, the homology problem: given a description of some space $X$ and an integer $k$, decide if $X$ contains a $k$-dimensional hole. The setting and statement of the homology problem are completely classical, yet we find that the complexity is characterized by quantum complexity classes. Our result can be seen as an aspect of a connection between homology and supersymmetric quantum mechanics. We consider clique complexes, motivated by the practical application of topological data analysis (TDA). The clique complex of a graph is the simplicial complex formed by declaring every $k+1$-clique in the graph to be a $k$-simplex. Our main result is that deciding whether the clique complex of a weighted graph has a hole or not, given a suitable promise on the gap, is $\text{QMA}_1$-hard and contained in $\text{QMA}$. Our main innovation is a technique to lower bound the eigenvalues of the combinatorial Laplacian operator. For this, we invoke a tool from algebraic topology known as \emph{spectral sequences}. In particular, we exploit a connection between spectral sequences and Hodge theory. Spectral sequences will play a role analogous to perturbation theory for combinatorial Laplacians. In addition, we develop the simplicial surgery technique used in prior work. Our result provides some suggestion that the quantum TDA algorithm \emph{cannot} be dequantized. More broadly, we hope that our results will open up new possibilities for quantum advantage in topological data analysis.
Published: 2023

48. Analysis of the expected $L_2$ error of an over-parametrized deep neural network estimate learned by gradient descent without regularization

Author: Drews, Selina and Kohler, Michael
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning, 62G08
Abstract: Recent results show that estimates defined by over-parametrized deep neural networks learned by applying gradient descent to a regularized empirical $L_2$ risk are universally consistent and achieve good rates of convergence. In this paper, we show that the regularization term is not necessary to obtain similar results. In the case of a suitably chosen initialization of the network, a suitable number of gradient descent steps, and a suitable step size we show that an estimate without a regularization term is universally consistent for bounded predictor variables. Additionally, we show that if the regression function is H\"older smooth with H\"older exponent $1/2 \leq p \leq 1$, the $L_2$ error converges to zero with a convergence rate of approximately $n^{-1/(1+d)}$. Furthermore, in case of an interaction model, where the regression function consists of a sum of H\"older smooth functions with $d^*$ components, a rate of convergence is derived which does not depend on the input dimension $d$.
Published: 2023

49. Direct measurement of spin-flip rates in single-electron tunneling

Author: Dani, Olfa, Hussein, Robert, Bayer, Johannes C., Pierz, Klaus, Kohler, Sigmund, and Haug, Rolf J.
Subjects: Condensed Matter - Mesoscale and Nanoscale Physics
Abstract: Spin-flips are one of the limiting factors for spin-based information processing. We demonstrate a transport approach for determining the spin-flip rates of a self-assembled InAs double quantum dot occupied by a single electron. In such devices, different Land\'e factors lead to an inhomogeneous Zeeman splitting, so that the two spin channels can never be at resonance simultaneously, leading to a spin blockade at low temperatures. This blockade is analyzed in terms of spin flips for different temperatures and magnetic fields. Our results are in good agreement with a quantum master equation that combines the dot-lead couplings with ohmic dissipation stemming from spin-flip cotunneling., Comment: 9 pages, 8 figures
Published: 2023
Full Text: View/download PDF

50. Team Cognition in Handoffs: Relating System Factors, Team Cognition Functions and Outcomes in Two Handoff Processes.

Author: Wooldridge, Abigail, Carayon, Pascale, Hoonakker, Peter, Hose, Bat-Zion, Shaffer, David, Brazelton, Tom, Eithun, Ben, Rusy, Deborah, Ross, Joshua, Kelly, Michelle, Springman, Scott, Gurses, Ayse, and Kohler, Jonathan
Subjects: care transitions and handoffs, communication and teamwork in health care, macroergonomics and the environment, system design and analysis, team cognition, Humans, Child, Patient Handoff, Intensive Care Units, Cognition, Communication, Operating Rooms
Abstract: OBJECTIVE: This study investigates how team cognition occurs in care transitions from operating room (OR) to intensive care unit (ICU). We then seek to understand how the sociotechnical system and team cognition are related. BACKGROUND: Effective handoffs are critical to ensuring patient safety and have been the subject of many improvement efforts. However, the types of team-level cognitive processing during handoffs have not been explored, nor is it clear how the sociotechnical system shapes team cognition. METHOD: We conducted this study in an academic, Level 1 trauma center in the Midwestern United States. Twenty-eight physicians (surgery, anesthesia, pediatric critical care) and nurses (OR, ICU) participated in semi-structured interviews. We performed qualitative content analysis and epistemic network analysis to understand the relationships between system factors, team cognition in handoffs and outcomes. RESULTS: Participants described three team cognition functions in handoffs-(1) information exchange, (2) assessment, and (3) planning and decision making; information exchange was mentioned most. Work system factors influenced team cognition. Inter-professional handoffs facilitated information exchange but included large teams with diverse backgrounds communicating, which can be inefficient. Intra-professional handoffs decreased team size and role diversity, which may simplify communication but increase information loss. Participants in inter-professional handoffs reflected on outcomes significantly more in relation to system factors and team cognition (p < 0.001), while participants in intra-professional handoffs discussed handoffs as a task. CONCLUSION: Handoffs include team cognition, which was influenced by work system design. Opportunities for handoff improvement include a flexibly standardized process and supportive tools/technologies. We recommend incorporating perspectives of the patient and family in future work.
Published: 2024

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

7,028 results on '"Kohler, P."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources