Author: "Magnusson P" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Magnusson P"' showing total 6,585 results

Start Over Author "Magnusson P"

6,585 results on '"Magnusson P"'

1. Scalable Data Ablation Approximations for Language Models through Modular Training and Merging

Author: Na, Clara, Magnusson, Ian, Jha, Ananya Harsh, Sherborne, Tom, Strubell, Emma, Dodge, Jesse, and Dasigi, Pradeep
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Training data compositions for Large Language Models (LLMs) can significantly affect their downstream performance. However, a thorough data ablation study exploring large sets of candidate data mixtures is typically prohibitively expensive since the full effect is seen only after training the models; this can lead practitioners to settle for sub-optimal data mixtures. We propose an efficient method for approximating data ablations which trains individual models on subsets of a training corpus and reuses them across evaluations of combinations of subsets. In continued pre-training experiments, we find that, given an arbitrary evaluation set, the perplexity score of a single model trained on a candidate set of data is strongly correlated with perplexity scores of parameter averages of models trained on distinct partitions of that data. From this finding, we posit that researchers and practitioners can conduct inexpensive simulations of data ablations by maintaining a pool of models that were each trained on partitions of a large training corpus, and assessing candidate data mixtures by evaluating parameter averages of combinations of these models. This approach allows for substantial improvements in amortized training efficiency -- scaling only linearly with respect to new data -- by enabling reuse of previous training computation, opening new avenues for improving model performance through rigorous, incremental data assessment and mixing., Comment: EMNLP 2024. 17 pages
Published: 2024

2. Fast Online Learning of CLiFF-maps in Changing Environments

Author: Zhu, Yufei, Rudenko, Andrey, Palmieri, Luigi, Heuer, Lukas, Lilienthal, Achim J., and Magnusson, Martin
Subjects: Computer Science - Robotics
Abstract: Maps of dynamics are effective representations of motion patterns learned from prior observations, with recent research demonstrating their ability to enhance performance in various downstream tasks such as human-aware robot navigation, long-term human motion prediction, and robot localization. Current advancements have primarily concentrated on methods for learning maps of human flow in environments where the flow is static, i.e., not assumed to change over time. In this paper we propose a method to update the CLiFF-map, one type of map of dynamics, for achieving efficient life-long robot operation. As new observations are collected, our goal is to update a CLiFF-map to effectively and accurately integrate new observations, while retaining relevant historic motion patterns. The proposed online update method maintains a probabilistic representation in each observed location, updating parameters by continuously tracking sufficient statistics. In experiments using both synthetic and real-world datasets, we show that our method is able to maintain accurate representations of human motion dynamics, contributing to high performance flow-compliant planning downstream tasks, while being orders of magnitude faster than the comparable baselines.
Published: 2024

3. Transdermal Buprenorphine for Acute Pain in the Clinical Setting: A Narrative Review

Author: Pergolizzi JV Jr, Magnusson P, LeQuang JA, Breve F, Mitchell K, Chopra M, and Varrassi G
Subjects: acute pain control, ceiling effect of buprenorphine, geriatric pain patients, gynecological surgery, hip arthroscopy, knee arthroscopy, opioid-associated side effects, orthopedic rehabilitation, orthopedic surgery, postoperative pain, spinal surgery, transdermal buprenorphine dosing., Medicine (General), R5-920
Abstract: Joseph V Pergolizzi Jr,1 Peter Magnusson,2,3 Jo Ann LeQuang,1 Frank Breve,4 Kailyn Mitchell,1 Maninder Chopra,5 Giustino Varrassi6 1NEMA Research, Inc., Naples, FL, USA; 2Centre for Research and Development, Region Gävleborg/Uppsala University, Gävle, Sweden; 3Department of Medicine, Cardiology Research Unit, Karolinska Institutet, Stockholm, Sweden; 4Department of Pharmacy Practice, Temple University School of Pharmacy, Philadelphia, PA, USA; 5Decision Alternatives, LLC, Frederick, MD, USA; 6Paolo Procacci Foundation, Rome, ItalyCorrespondence: Jo Ann LeQuangNEMA Research, Inc., 4005 Technology Drive, Suite 1008-V, Angleton, TX, 77515, USATel +1-979-824-0251Email joannlequang@gmail.comAbstract: Transdermal buprenorphine is indicated for chronic pain management, but as its role in the clinical management of acute pain is less clear, this narrative review examines studies of the patch for acute pain, mainly in the postoperative setting. Although perhaps better known for its role in opioid rehabilitation programs, buprenorphine is also an effective analgesic that is a Schedule III controlled substance. Although buprenorphine is a partial agonist at the μ-opioid receptor, it is erroneous to think of the agent as a partial analgesic; it has full analgesic efficacy and unique attributes among opioids, such as a ceiling for respiratory depression and low “drug likeability” among those who take opioids for recreational purposes. Transdermal buprenorphine has been most thoroughly studied for acute pain control in postoperative patients. Postoperative pain follows a distinct and predictable trajectory depending on the type of surgery and patient characteristics. Overall, when the patch is applied prior to surgery and left in place for the prescribed seven days, it was associated with reduced postoperative pain, lower consumption of other analgesics, and patient satisfaction. Transdermal buprenorphine has been evaluated in clinical studies of patients undergoing gynecological surgery, hip fracture surgery, knee or hip arthroscopy/arthroplasty, shoulder surgery, and spinal surgery. Transdermal buprenorphine may also be appropriate pain medication for controlling pain during postsurgical orthopedic rehabilitation programs. Transdermal buprenorphine may result in typical opioid-associated side effects but with less frequency than other opioids. Despite clinical reservations about transdermal buprenorphine and its potential role in acute pain management in the clinical setting, clinical acceptance may be hampered by the fact that it is off-label and buprenorphine is better known as an opioid maintenance agent rather than an analgesic.Keywords: acute pain control, ceiling effect of buprenorphine, geriatric pain patients, gynecological surgery, hip arthroscopy, knee arthroscopy, opioid-associated side effects, orthopedic rehabilitation, orthopedic surgery, postoperative pain, spinal surgery, transdermal buprenorphine dosing
Published: 2021

4. The pace of life for forest trees.

Author: Bialic-Murphy, Lalasia, McElderry, Robert M, Esquivel-Muelbert, Adriane, van den Hoogen, Johan, Zuidema, Pieter A, Phillips, Oliver L, de Oliveira, Edmar Almeida, Loayza, Patricia Alvarez, Alvarez-Davila, Esteban, Alves, Luciana F, Maia, Vinícius Andrade, Vieira, Simone Aparecida, Arantes da Silva, Lidiany Carolina, Araujo-Murakami, Alejandro, Arets, Eric, Astigarraga, Julen, Baccaro, Fabrício, Baker, Timothy, Banki, Olaf, Barroso, Jorcely, Blanc, Lilian, Bonal, Damien, Bongers, Frans, Bordin, Kauane Maiara, Brienen, Roel, de Medeiros, Marcelo Brilhante, Camargo, José Luís, Araújo, Felipe Carvalho, Castilho, Carolina V, Castro, Wendeson, Moscoso, Victor Chama, Comiskey, James, Costa, Flávia, Müller, Sandra Cristina, de Almeida, Everton Cristo, Lôla da Costa, Antonio Carlos, de Andrade Kamimura, Vitor, de Oliveira, Fernanda, Del Aguila Pasquel, Jhon, Derroire, Géraldine, Dexter, Kyle, Di Fiore, Anthony, Duchesne, Louis, Emílio, Thaise, Farrapo, Camila Laís, Fauset, Sophie, Draper, Federick C, Feldpausch, Ted R, Ramos, Rafael Flora, Martins, Valeria Forni, Simon, Marcelo Fragomeni, Reis, Miguel Gama, Manzatto, Angelo Gilberto, Herault, Bruno, Herrera, Rafael, Coronado, Eurídice Honorio, Howe, Robert, Huamantupa-Chuquimaco, Isau, Huasco, Walter Huaraca, Zanini, Katia Janaina, Joly, Carlos, Killeen, Timothy, Klipel, Joice, Laurance, Susan G, Laurance, William F, Fontes, Marco Aurélio Leite, Oviedo, Wilmar Lopez, Magnusson, William E, Dos Santos, Rubens Manoel, Peña, Jose Luis Marcelo, de Abreu, Karla Maria Pedra, Marimon, Beatriz, Junior, Ben Hur Marimon, Melgaço, Karina, Melo Cruz, Omar Aurelio, Mendoza, Casimiro, Monteagudo-Mendoza, Abel, Morandi, Paulo S, Gianasi, Fernanda Moreira, Nascimento, Henrique, Nascimento, Marcelo, Neill, David, Palacios, Walter, Camacho, Nadir C Pallqui, Pardo, Guido, Pennington, R Toby, Peñuela-Mora, Maria Cristina, Pitman, Nigel CA, Poorter, Lourens, Cruz, Adriana Prieto, Ramírez-Angulo, Hirma, Reis, Simone Matias, Correa, Zorayda Restrepo, Rodriguez, Carlos Reynel, Lleras, Agustín Rudas, Santos, Flavio AM, Bergamin, Rodrigo Scarton, Schietti, Juliana, Schwartz, Gustavo, and Serrano, Julio
Subjects: Trees, Carbon, Temperature, Longevity, Carbon Cycle, Forests, Life History Traits, General Science & Technology
Abstract: Tree growth and longevity trade-offs fundamentally shape the terrestrial carbon balance. Yet, we lack a unified understanding of how such trade-offs vary across the world's forests. By mapping life history traits for a wide range of species across the Americas, we reveal considerable variation in life expectancies from 10 centimeters in diameter (ranging from 1.3 to 3195 years) and show that the pace of life for trees can be accurately classified into four demographic functional types. We found emergent patterns in the strength of trade-offs between growth and longevity across a temperature gradient. Furthermore, we show that the diversity of life history traits varies predictably across forest biomes, giving rise to a positive relationship between trait diversity and productivity. Our pan-latitudinal assessment provides new insights into the demographic mechanisms that govern the carbon turnover rate across forest biomes.
Published: 2024

5. posteriordb: Testing, Benchmarking and Developing Bayesian Inference Algorithms

Author: Magnusson, Måns, Torgander, Jakob, Bürkner, Paul-Christian, Zhang, Lu, Carpenter, Bob, and Vehtari, Aki
Subjects: Statistics - Computation
Abstract: The generality and robustness of inference algorithms is critical to the success of widely used probabilistic programming languages such as Stan, PyMC, Pyro, and Turing.jl. When designing a new general-purpose inference algorithm, whether it involves Monte Carlo sampling or variational approximation, the fundamental problem arises in evaluating its accuracy and efficiency across a range of representative target models. To solve this problem, we propose posteriordb, a database of models and data sets defining target densities along with reference Monte Carlo draws. We further provide a guide to the best practices in using posteriordb for model evaluation and comparison. To provide a wide range of realistic target densities, posteriordb currently comprises 120 representative models and has been instrumental in developing several general inference algorithms.
Published: 2024

6. Formalising Anti-Discrimination Law in Automated Decision Systems

Author: Sargeant, Holli and Magnusson, Måns
Subjects: Computer Science - Computers and Society
Abstract: We study the legal challenges in automated decision-making by analysing conventional algorithmic fairness approaches and their alignment with antidiscrimination law in the United Kingdom and other jurisdictions based on English common law. By translating principles of anti-discrimination law into a decision-theoretic framework, we formalise discrimination and propose a new, legally informed approach to developing systems for automated decision-making. Our investigation reveals that while algorithmic fairness approaches have adapted concepts from legal theory, they can conflict with legal standards, highlighting the importance of bridging the gap between automated decisions, fairness, and anti-discrimination doctrine.
Published: 2024

7. Human Gaze and Head Rotation during Navigation, Exploration and Object Manipulation in Shared Environments with Robots

Author: Schreiter, Tim, Rudenko, Andrey, Magnusson, Martin, and Lilienthal, Achim J.
Subjects: Computer Science - Robotics
Abstract: The human gaze is an important cue to signal intention, attention, distraction, and the regions of interest in the immediate surroundings. Gaze tracking can transform how robots perceive, understand, and react to people, enabling new modes of robot control, interaction, and collaboration. In this paper, we use gaze tracking data from a rich dataset of human motion (TH\"OR-MAGNI) to investigate the coordination between gaze direction and head rotation of humans engaged in various indoor activities involving navigation, interaction with objects, and collaboration with a mobile robot. In particular, we study the spread and central bias of fixations in diverse activities and examine the correlation between gaze direction and head rotation. We introduce various human motion metrics to enhance the understanding of gaze behavior in dynamic interactions. Finally, we apply semantic object labeling to decompose the gaze distribution into activity-relevant regions., Comment: This is the final version of the accepted version of the manuscript that will be published in the 2024 33rd IEEE International Conference on Robot and Human Interactive Communication (ROMAN). Copyright 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses
Published: 2024

8. AIFS -- ECMWF's data-driven forecasting system

Author: Lang, Simon, Alexe, Mihai, Chantry, Matthew, Dramsch, Jesper, Pinault, Florian, Raoult, Baudouin, Clare, Mariana C. A., Lessig, Christian, Maier-Gerber, Michael, Magnusson, Linus, Bouallègue, Zied Ben, Nemesio, Ana Prieto, Dueben, Peter D., Brown, Andrew, Pappenberger, Florian, and Rabier, Florence
Subjects: Physics - Atmospheric and Oceanic Physics
Abstract: Machine learning-based weather forecasting models have quickly emerged as a promising methodology for accurate medium-range global weather forecasting. Here, we introduce the Artificial Intelligence Forecasting System (AIFS), a data driven forecast model developed by the European Centre for Medium-Range Weather Forecasts (ECMWF). AIFS is based on a graph neural network (GNN) encoder and decoder, and a sliding window transformer processor, and is trained on ECMWF's ERA5 re-analysis and ECMWF's operational numerical weather prediction (NWP) analyses. It has a flexible and modular design and supports several levels of parallelism to enable training on high-resolution input data. AIFS forecast skill is assessed by comparing its forecasts to NWP analyses and direct observational data. We show that AIFS produces highly skilled forecasts for upper-air variables, surface weather parameters and tropical cyclone tracks. AIFS is run four times daily alongside ECMWF's physics-based NWP model and forecasts are available to the public under ECMWF's open data policy.
Published: 2024

9. The Use of Peripheral μ-Opioid Receptor Antagonists (PAMORA) in the Management of Opioid-Induced Constipation: An Update on Their Efficacy and Safety

Author: Pergolizzi JV Jr, Christo PJ, LeQuang JA, and Magnusson P
Subjects: constipation, methylnaltrexone, naldemedine, naloxegol, opioid antagonism, opioid-associated bowel disorder, opioid-associated side effects, pain, Therapeutics. Pharmacology, RM1-950
Abstract: Joseph V Pergolizzi Jr,1 Paul J Christo,2 Jo Ann LeQuang,1 Peter Magnusson3,4 1NEMA Research, Inc., Naples, FL, USA; 2Division of Pain Medicine, Department of Anesthesiology and Critical Care Medicine, Johns Hopkins Medicine, Baltimore, ML, USA; 3Cardiology Research Unit, Department of Medicine, Karolinska Institute, Stockholm, Sweden; 4Centre for Research and Development, Uppsala University/Region, Gävleborg, SwedenCorrespondence: Joseph V Pergolizzi JrNEMA Research, Inc., Naples, FL, USAEmail jpjmd@msn.comAbstract: Peripherally acting μ-opioid receptor antagonists (PAMORAs) constitute a class of drugs which reverse opioid-induced constipation (OIC) with similar opioid analgesic effects. OIC differs from other forms of constipation in that it is an iatrogenic condition that occurs when an opioid acts on the dense network of μ-opioid receptors in the enteric system, which affect a variety of functions including gastrointestinal motility, secretion, and other factors that can cause bowel dysfunction. Unfortunately, laxative products, bowel regimens, dietary changes, and lifestyle modifications have limited effectiveness in preventing OIC, Opioid-associated adverse effect which occurs in 40% to 80% of opioid patients and may led to cessation of the treatment. PAMORAs are μ-receptor opioid antagonists specifically developed so that they have very limited ability to cross the blood-brain barrier and thus they are able to antagonize peripheral but not central μ-opioid receptors. PAMORAs are designed to have no effect on the analgesic benefits of opioid pain relievers but to relieve but antagonizing the effects of the opioid in the gastrointestinal system. The three main PAMORAS are methyltrexone (oral or parenteral), naldemedine (oral only), and naloxegol (oral only). Clinical studies demonstrate the safety and efficacy of these agents for alleviating constipation without diminishing the analgesic effect of opioid therapy. The aim of this narrative review to update the current status of PAMORAs for treating OIC in terms of safety and efficacy.Keywords: constipation, methylnaltrexone, naldemedine, naloxegol, opioid antagonism, opioid-associated bowel disorder, opioid-associated side effects, pain
Published: 2020

10. Cardiovascular Diseases And Psychiatric Disorders During The Diagnostic Workup Of Suspected Hematological Malignancy

Author: Liu Q, Andersson TML, Jöud A, Shen Q, Schelin MEC, Magnusson PKE, Smedby KE, and Fang F
Subjects: hematological malignancy, cardiovascular diseases, psychiatric disorders, diagnostic workup, Infectious and parasitic diseases, RC109-216
Abstract: Qianwei Liu,1 Therese ML Andersson,2 Anna Jöud,3,4 Qing Shen,2 Maria EC Schelin,3,5 Patrik KE Magnusson,2 Karin E Smedby,6 Fang Fang1 1Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden; 2Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden; 3Lund University, Department of Clinical Sciences Lund, Orthopaedics, Lund, Sweden; 4Lund University, Department of Laboratory Medicine, Occupational and Environmental Medicine, Lund, Sweden; 5Institute for Palliative Care, IKVL, Lund University and Region Skåne, Lund, Sweden; 6Clinical Epidemiology Division, Department of Medicine, Karolinska Institutet, Stockholm, SwedenCorrespondence: Fang FangInstitute of Environmental Medicine, Karolinska Institutet, Box 23109, 104 35, Stockholm, SwedenTel +46 8 52486131Email fang.fang@ki.seBackground: Little attention has been given to the risk of cardiovascular and psychiatric comorbidities during the clinical evaluation of a suspected hematological malignancy.Methods: Based on Skåne Healthcare Register, we performed a population-based cohort study of 1,527,449 individuals residing during 2005–2014 in Skåne, Sweden. We calculated the incidence rate ratios (IRRs) of cardiovascular diseases or psychiatric disorders during the diagnostic workup of 5495 patients with hematological malignancy and 18,906 individuals that underwent a bone marrow aspiration or biopsy or lymph node biopsy without receiving a diagnosis of any malignancy (“biopsied individuals”), compared to individuals without such experience (i.e., reference).Results: There was a higher rate of cardiovascular diseases during the diagnostic workup of patients with hematological malignancy (overall IRR, 3.3; 95% CI, 2.9 to 3.8; greatest IRR for embolism and thrombosis, 8.1; 95% CI, 5.2 to 12.8) and biopsied individuals (overall IRR, 4.9; 95% CI, 4.6 to 5.3; greatest IRR for stroke, 37.5; 95% CI, 34.1 to 41.2), compared to reference. Similarly, there was a higher rate of psychiatric disorders during the diagnostic workup of patients with hematological malignancy (IRR, 2.1; 95% CI, 1.5 to 2.8) and biopsied individuals (IRR, 3.1; 95% CI, 2.9 to 3.4). The rate increases were greater around the time of diagnosis or biopsy, compared to thereafter, for both outcomes.Conclusion: There were higher rates of cardiovascular diseases and psychiatric disorders during the diagnostic workup of a suspected hematological malignancy, regardless of the final diagnosis.Keywords: hematological malignancy, cardiovascular diseases, psychiatric disorders, diagnostic workup
Published: 2019

11. 'Look, My Name! I Can Write' -- Literacy Events and Digital Technology in the Preschool Atelier

Author: Lena O. Magnusson
Abstract: This article explores and displays some of the literacy events taking place in the context of early childhood education in Sweden. More specifically, the literacy events are part of the educational practices in the atelier of Reggio Emilia inspired preschools in Sweden. As parts of an ethnographic study of aesthetic activities, including digital technology, these literacy events awoke the researcher's interest. The literacy events are analysed from a sociocultural perspective reinforced by the use of multimodal theory. The results show how the literacy events in the ateliers become playful explorations. The children use the atelier's specific cultural and social potentiality to explore and develop written and oral language as part of the visual and aesthetic literacy practices taking place there.
Published: 2024
Full Text: View/download PDF

12. Stable properties under weakly geometrically flat maps

Author: Barlet, Daniel and Magnusson, Jon Ingolfur
Subjects: Mathematics - Complex Variables
Abstract: In this note we show that a weakly geometrically flat map $\pi$ : M $\rightarrow$ N between pure dimensional complex spaces has the local lifting property for cycles. From this result we also deduce that, under these hypotheses, several properties of M are transferred to N.
Published: 2024

13. Infrared resonance-lattice device technology

Author: Magnusson, Robert, Ko, Yeong H., Lee, Kyu J., Simlan, Fairooz A., Bootpakdeetam, Pawarat, Chen, Renjie, Weidanz, Debra Wawro, Gimlin, Susanne, and Ghaffari, Soroush
Subjects: Physics - Optics
Abstract: We present subwavelength resonant lattices fashioned as nano- and microstructured films as a basis for a host of device concepts. Whereas the canonical physical properties are fully embodied in a one-dimensional periodic lattice, the final device constructs are often patterned in two-dimensionally-modulated films in which case we may refer to them as photonic crystal slabs, metamaterials, or metasurfaces. These surfaces can support lateral modes and localized field signatures with propagative and evanescent diffraction channels critically controlling the response. The governing principle of guided-mode, or lattice, resonance enables diverse spectral expressions such that a single-layer component can behave as a sensor, reflector, filter, or polarizer. This structural sparsity contrasts strongly with the venerable field of multi-layer thin-film optics that is basis for most optical components on the market today. The lattice resonance effect can be exploited in all major spectral regions with appropriate low-loss materials and fabrication resources. In this paper, we highlight resonant device technology and present our work on design, fabrication, and characterization of optical elements operating in the near-IR, mid-IR, and long-wave IR spectral regions. Examples of fabricated and tested devices include biological sensors, high-contrast-ratio polarizers, narrow-band notch filters, and wideband high reflectors., Comment: 11 pages, 10 figures
Published: 2024

14. Human Detection from 4D Radar Data in Low-Visibility Field Conditions

Author: Skog, Mikael, Kotlyar, Oleksandr, Kubelka, Vladimír, and Magnusson, Martin
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Robotics
Abstract: Autonomous driving technology is increasingly being used on public roads and in industrial settings such as mines. While it is essential to detect pedestrians, vehicles, or other obstacles, adverse field conditions negatively affect the performance of classical sensors such as cameras or lidars. Radar, on the other hand, is a promising modality that is less affected by, e.g., dust, smoke, water mist or fog. In particular, modern 4D imaging radars provide target responses across the range, vertical angle, horizontal angle and Doppler velocity dimensions. We propose TMVA4D, a CNN architecture that leverages this 4D radar modality for semantic segmentation. The CNN is trained to distinguish between the background and person classes based on a series of 2D projections of the 4D radar data that include the elevation, azimuth, range, and Doppler velocity dimensions. We also outline the process of compiling a novel dataset consisting of data collected in industrial settings with a car-mounted 4D radar and describe how the ground-truth labels were generated from reference thermal images. Using TMVA4D on this dataset, we achieve an mIoU score of 78.2% and an mDice score of 86.1%, evaluated on the two classes background and person, Comment: Submitted to Radar in Robotics workshop at ICRA 2024
Published: 2024

15. Spectral Tuning of Polarization Selective Reflections Bands in GLAD deposited HfAlN chiral sculptured thin films

Author: Bairagi, Samiran, Lorentzon, Marcus, Angay, Firat, Magnusson, Roger, Birch, Jens, Hsiao, Ching-Lien, Ghafoor, Naureen, and Järrendahl, Kenneth
Subjects: Physics - Optics
Abstract: We present the first report on fabrication of Hafnium aluminum nitride chiral sculptured thin films (CSTFs) using reactive magnetron sputtering in a glancing angle deposition configuration, and the analysis of its optical polarization properties. The resulting CSTFs were designed to give interference extrema or so-called circular Bragg (CB) resonances at desired wavelengths in the region from 370 to 690 nm. This was achieved by tailoring the growth of the chiral thin films to obtain a dielectric pitch between 87 and 260.9 nm. The spectral positions of the obtained CB resonances were compared to values from analytical expressions. Contrary to the common case where the dielectric pitch is half of the growth-related rotational pitch due to a 180{\deg} symmetry, this pitch was shown to be the same as the rotational pitch. It is concluded that this is due to the c-axis of the CSTF being tilted about 45{\deg} from the substrate normal. The morphology and crystallographic characterizations were done using scanning electron microscopy and X-ray diffraction, respectively, while the tilt of the crystal lattice was corroborated using X-ray diffraction pole figures. The optical response from the CSTFs was analyzed using Mueller matrix spectroscopic ellipsometry from which the degree of circular polarization at the CB resonances was obtained. In addition, a strong non-reciprocal reflection was observed which could be attributed to the helicoidal morphology and the intrinsic crystal tilt. An optical layered model of the chiral structure including azimuthal twist and using the Cauchy dispersion relations was used to simulate the Mueller matrix elements and compare with the ellipsometry measurements. The correlation between the simulated and experimental data gave information of the morphological parameters of the CSTF and its optical properties.
Published: 2024

16. Compressed Federated Reinforcement Learning with a Generative Model

Author: Beikmohammadi, Ali, Khirirat, Sarit, and Magnússon, Sindri
Subjects: Computer Science - Distributed, Parallel, and Cluster Computing, Computer Science - Machine Learning, Computer Science - Multiagent Systems
Abstract: Reinforcement learning has recently gained unprecedented popularity, yet it still grapples with sample inefficiency. Addressing this challenge, federated reinforcement learning (FedRL) has emerged, wherein agents collaboratively learn a single policy by aggregating local estimations. However, this aggregation step incurs significant communication costs. In this paper, we propose CompFedRL, a communication-efficient FedRL approach incorporating both \textit{periodic aggregation} and (direct/error-feedback) compression mechanisms. Specifically, we consider compressed federated $Q$-learning with a generative model setup, where a central server learns an optimal $Q$-function by periodically aggregating compressed $Q$-estimates from local agents. For the first time, we characterize the impact of these two mechanisms (which have remained elusive) by providing a finite-time analysis of our algorithm, demonstrating strong convergence behaviors when utilizing either direct or error-feedback compression. Our bounds indicate improved solution accuracy concerning the number of agents and other federated hyperparameters while simultaneously reducing communication costs. To corroborate our theory, we also conduct in-depth numerical experiments to verify our findings, considering Top-$K$ and Sparsified-$K$ sparsification operators., Comment: European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD 2024)
Published: 2024
Full Text: View/download PDF

17. LaCE-LHMP: Airflow Modelling-Inspired Long-Term Human Motion Prediction By Enhancing Laminar Characteristics in Human Flow

Author: Zhu, Yufei, Fan, Han, Rudenko, Andrey, Magnusson, Martin, Schaffernicht, Erik, and Lilienthal, Achim J.
Subjects: Computer Science - Robotics
Abstract: Long-term human motion prediction (LHMP) is essential for safely operating autonomous robots and vehicles in populated environments. It is fundamental for various applications, including motion planning, tracking, human-robot interaction and safety monitoring. However, accurate prediction of human trajectories is challenging due to complex factors, including, for example, social norms and environmental conditions. The influence of such factors can be captured through Maps of Dynamics (MoDs), which encode spatial motion patterns learned from (possibly scattered and partial) past observations of motion in the environment and which can be used for data-efficient, interpretable motion prediction (MoD-LHMP). To address the limitations of prior work, especially regarding accuracy and sensitivity to anomalies in long-term prediction, we propose the Laminar Component Enhanced LHMP approach (LaCE-LHMP). Our approach is inspired by data-driven airflow modelling, which estimates laminar and turbulent flow components and uses predominantly the laminar components to make flow predictions. Based on the hypothesis that human trajectory patterns also manifest laminar flow (that represents predictable motion) and turbulent flow components (that reflect more unpredictable and arbitrary motion), LaCE-LHMP extracts the laminar patterns in human dynamics and uses them for human motion prediction. We demonstrate the superior prediction performance of LaCE-LHMP through benchmark comparisons with state-of-the-art LHMP methods, offering an unconventional perspective and a more intuitive understanding of human movement patterns., Comment: Accepted to the 2024 IEEE International Conference on Robotics and Automation (ICRA)
Published: 2024

18. High-Fidelity SLAM Using Gaussian Splatting with Rendering-Guided Densification and Regularized Optimization

Author: Sun, Shuo, Mielle, Malcolm, Lilienthal, Achim J., and Magnusson, Martin
Subjects: Computer Science - Robotics, Computer Science - Computer Vision and Pattern Recognition
Abstract: We propose a dense RGBD SLAM system based on 3D Gaussian Splatting that provides metrically accurate pose tracking and visually realistic reconstruction. To this end, we first propose a Gaussian densification strategy based on the rendering loss to map unobserved areas and refine reobserved areas. Second, we introduce extra regularization parameters to alleviate the forgetting problem in the continuous mapping problem, where parameters tend to overfit the latest frame and result in decreasing rendering quality for previous frames. Both mapping and tracking are performed with Gaussian parameters by minimizing re-rendering loss in a differentiable way. Compared to recent neural and concurrently developed gaussian splatting RGBD SLAM baselines, our method achieves state-of-the-art results on the synthetic dataset Replica and competitive results on the real-world dataset TUM., Comment: Accepted by IROS 2024
Published: 2024

19. Notochord: a Flexible Probabilistic Model for Real-Time MIDI Performance

Author: Shepardson, Victor, Armitage, Jack, and Magnusson, Thor
Subjects: Computer Science - Sound, Computer Science - Artificial Intelligence, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: Deep learning-based probabilistic models of musical data are producing increasingly realistic results and promise to enter creative workflows of many kinds. Yet they have been little-studied in a performance setting, where the results of user actions typically ought to feel instantaneous. To enable such study, we designed Notochord, a deep probabilistic model for sequences of structured events, and trained an instance of it on the Lakh MIDI dataset. Our probabilistic formulation allows interpretable interventions at a sub-event level, which enables one model to act as a backbone for diverse interactive musical functions including steerable generation, harmonization, machine improvisation, and likelihood-based interfaces. Notochord can generate polyphonic and multi-track MIDI, and respond to inputs with latency below ten milliseconds. Training code, model checkpoints and interactive examples are provided as open source software., Comment: 12 pages, 6 figures. Proceedings of the 3rd Conference on AI Music Creativity (2022, September 17)
Published: 2024
Full Text: View/download PDF

20. Region-based U-net for accelerated training and enhanced precision in deep brain segmentation

Author: Li, Mengyu, Magnusson, Magnus, van Eimeren, Thilo, and Ellingsen, Lotta M.
Subjects: Electrical Engineering and Systems Science - Image and Video Processing, Computer Science - Computer Vision and Pattern Recognition
Abstract: Segmentation of brain structures on MRI is the primary step for further quantitative analysis of brain diseases. Manual segmentation is still considered the gold standard in terms of accuracy; however, such data is extremely time-consuming to generate. This paper presents a deep learning-based segmentation approach for 12 deep-brain structures, utilizing multiple region-based U-Nets. The brain is divided into three focal regions of interest that encompass the brainstem, the ventricular system, and the striatum. Next, three region-based U-nets are run in parallel to parcellate these larger structures into their respective four substructures. This approach not only greatly reduces the training and processing times but also significantly enhances the segmentation accuracy, compared to segmenting the entire MRI image at once. Our approach achieves remarkable accuracy with an average Dice Similarity Coefficient (DSC) of 0.901 and 95% Hausdorff Distance (HD95) of 1.155 mm. The method was compared with state-of-the-art segmentation approaches, demonstrating a high level of accuracy and robustness of the proposed method., Comment: 5 pages, 2 figures, 21st IEEE International Symposium on Biomedical Imaging
Published: 2024
Full Text: View/download PDF

21. TH\'OR-MAGNI: A Large-scale Indoor Motion Capture Recording of Human Movement and Robot Interaction

Author: Schreiter, Tim, de Almeida, Tiago Rodrigues, Zhu, Yufei, Maestro, Eduardo Gutierrez, Morillo-Mendez, Lucas, Rudenko, Andrey, Palmieri, Luigi, Kucner, Tomasz P., Magnusson, Martin, and Lilienthal, Achim J.
Subjects: Computer Science - Robotics
Abstract: We present a new large dataset of indoor human and robot navigation and interaction, called TH\"OR-MAGNI, that is designed to facilitate research on social navigation: e.g., modelling and predicting human motion, analyzing goal-oriented interactions between humans and robots, and investigating visual attention in a social interaction context. TH\"OR-MAGNI was created to fill a gap in available datasets for human motion analysis and HRI. This gap is characterized by a lack of comprehensive inclusion of exogenous factors and essential target agent cues, which hinders the development of robust models capable of capturing the relationship between contextual cues and human behavior in different scenarios. Unlike existing datasets, TH\"OR-MAGNI includes a broader set of contextual features and offers multiple scenario variations to facilitate factor isolation. The dataset includes many social human-human and human-robot interaction scenarios, rich context annotations, and multi-modal data, such as walking trajectories, gaze tracking data, and lidar and camera streams recorded from a mobile robot. We also provide a set of tools for visualization and processing of the recorded data. TH\"OR-MAGNI is, to the best of our knowledge, unique in the amount and diversity of sensor data collected in a contextualized and socially dynamic environment, capturing natural human-robot interactions., Comment: Submitted to The International Journal of Robotics Research (IJRR) on 28 of February 2024
Published: 2024

22. DiffSF: Diffusion Models for Scene Flow Estimation

Author: Zhang, Yushan, Wandt, Bastian, Magnusson, Maria, and Felsberg, Michael
Subjects: Computer Science - Computer Vision and Pattern Recognition
Abstract: Scene flow estimation is an essential ingredient for a variety of real-world applications, especially for autonomous agents, such as self-driving cars and robots. While recent scene flow estimation approaches achieve a reasonable accuracy, their applicability to real-world systems additionally benefits from a reliability measure. Aiming at improving accuracy while additionally providing an estimate for uncertainty, we propose DiffSF that combines transformer-based scene flow estimation with denoising diffusion models. In the diffusion process, the ground truth scene flow vector field is gradually perturbed by adding Gaussian noise. In the reverse process, starting from randomly sampled Gaussian noise, the scene flow vector field prediction is recovered by conditioning on a source and a target point cloud. We show that the diffusion process greatly increases the robustness of predictions compared to prior approaches resulting in state-of-the-art performance on standard scene flow estimation benchmarks. Moreover, by sampling multiple times with different initial states, the denoising process predicts multiple hypotheses, which enables measuring the output uncertainty, allowing our approach to detect a majority of the inaccurate predictions. The code is available at https://github.com/ZhangYushan3/DiffSF.
Published: 2024

23. On the Convergence of Federated Learning Algorithms without Data Similarity

Author: Beikmohammadi, Ali, Khirirat, Sarit, and Magnússon, Sindri
Subjects: Computer Science - Machine Learning, Computer Science - Computer Science and Game Theory
Abstract: Data similarity assumptions have traditionally been relied upon to understand the convergence behaviors of federated learning methods. Unfortunately, this approach often demands fine-tuning step sizes based on the level of data similarity. When data similarity is low, these small step sizes result in an unacceptably slow convergence speed for federated methods. In this paper, we present a novel and unified framework for analyzing the convergence of federated learning algorithms without the need for data similarity conditions. Our analysis centers on an inequality that captures the influence of step sizes on algorithmic convergence performance. By applying our theorems to well-known federated algorithms, we derive precise expressions for three widely used step size schedules: fixed, diminishing, and step-decay step sizes, which are independent of data similarity conditions. Finally, we conduct comprehensive evaluations of the performance of these federated learning algorithms, employing the proposed step size strategies to train deep neural network models on benchmark datasets under varying data similarity conditions. Our findings demonstrate significant improvements in convergence speed and overall performance, marking a substantial advancement in federated learning research., Comment: Accepted by the IEEE Transactions on Big Data Journal
Published: 2024

24. Parallel Momentum Methods Under Biased Gradient Estimations

Author: Beikmohammadi, Ali, Khirirat, Sarit, and Magnússon, Sindri
Subjects: Computer Science - Machine Learning
Abstract: Parallel stochastic gradient methods are gaining prominence in solving large-scale machine learning problems that involve data distributed across multiple nodes. However, obtaining unbiased stochastic gradients, which have been the focus of most theoretical research, is challenging in many distributed machine learning applications. The gradient estimations easily become biased, for example, when gradients are compressed or clipped, when data is shuffled, and in meta-learning and reinforcement learning. In this work, we establish worst-case bounds on parallel momentum methods under biased gradient estimation on both general non-convex and $\mu$-PL problems. Our analysis covers general distributed optimization problems, and we work out the implications for special cases where gradient estimates are biased, i.e. in meta-learning and when the gradients are compressed or clipped. Our numerical experiments verify our theoretical findings and show faster convergence performance of momentum methods than traditional biased gradient descent., Comment: 12 pages
Published: 2024

25. Towards Efficient Quantum Computing for Quantum Chemistry: Reducing Circuit Complexity with Transcorrelated and Adaptive Ansatz Techniques

Author: Magnusson, Erika, Fitzpatrick, Aaron, Knecht, Stefan, Rahm, Martin, and Dobrautz, Werner
Subjects: Quantum Physics, Condensed Matter - Strongly Correlated Electrons, Physics - Chemical Physics, Physics - Computational Physics
Abstract: The near-term utility of quantum computers is hindered by hardware constraints in the form of noise. One path to achieving noise resilience in hybrid quantum algorithms is to decrease the required circuit depth -- the number of applied gates -- to solve a given problem. This work demonstrates how to reduce circuit depth by combining the transcorrelated (TC) approach with adaptive quantum ans\"atze and their implementations in the context of variational quantum imaginary time evolution (AVQITE). The combined TC-AVQITE method is used to calculate ground state energies across the potential energy surfaces of H$_4$, LiH, and H$_2$O. In particular, H$_4$ is a notoriously difficult case where unitary coupled cluster theory, including singles and doubles excitations, fails to provide accurate results. Adding TC yields energies close to the complete basis set (CBS) limit while reducing the number of necessary operators -- and thus circuit depth -- in the adaptive ans\"atze. The reduced circuit depth furthermore makes our algorithm more noise-resilient and accelerates convergence. Our study demonstrates that combining the TC method with adaptive ans\"atze yields compact, noise-resilient, and easy-to-optimize quantum circuits that yield accurate quantum chemistry results close to the CBS limit.
Published: 2024
Full Text: View/download PDF

26. Improving forecasts of precipitation extremes over Northern and Central Italy using machine learning

Author: Grazzini, Federico, Dorrington, Joshua, Grams, Christian M., Craig, George C., Magnusson, Linus, and Vitart, Frederic
Subjects: Physics - Atmospheric and Oceanic Physics
Abstract: The accurate prediction of intense precipitation events is one of the main objectives of operational weather services. This task is even more relevant nowadays, with the rapid progression of global warming which intensifies these events. Numerical weather prediction models have improved continuously over time, providing uncertainty estimation with dynamical ensembles. However, direct precipitation forecasting is still challenging. Greater availability of machine learning tools paves the way to a hybrid forecasting approach, with the optimal combination of physical models, event statistics, and user-oriented post-processing. Here we describe a specific chain, based on a random forest pipeline, specialised in recognizing favourable synoptic conditions leading to precipitation extremes and subsequently classifying extremes into predefined types. The application focuses on Northern and Central Italy, taken as a testbed region, but is seamlessly extensible to other regions and timescales. The system is called MaLCoX (Machine Learning model predicting Conditions for eXtreme precipitation) and is running daily at the Italian regional weather service of ARPAE Emilia-Romagna. MalCoX has been trained with the ARCIS gridded high-resolution precipitation dataset as the target truth, using the last 20 years of the ECMWF re-forecast dataset as input predictors. We show that, with a long enough training period, the optimal blend of larger-scale information with direct model output improves the probabilistic forecast accuracy of extremes in the medium range. In addition, with specific methods, we provide a useful diagnostic to convey to forecasters the underlying physical storyline which makes a meteorological event extreme., Comment: 18 pages, 13 figures
Published: 2024

27. OLMo: Accelerating the Science of Language Models

Author: Groeneveld, Dirk, Beltagy, Iz, Walsh, Pete, Bhagia, Akshita, Kinney, Rodney, Tafjord, Oyvind, Jha, Ananya Harsh, Ivison, Hamish, Magnusson, Ian, Wang, Yizhong, Arora, Shane, Atkinson, David, Authur, Russell, Chandu, Khyathi Raghavi, Cohan, Arman, Dumas, Jennifer, Elazar, Yanai, Gu, Yuling, Hessel, Jack, Khot, Tushar, Merrill, William, Morrison, Jacob, Muennighoff, Niklas, Naik, Aakanksha, Nam, Crystal, Peters, Matthew E., Pyatkin, Valentina, Ravichander, Abhilasha, Schwenk, Dustin, Shah, Saurabh, Smith, Will, Strubell, Emma, Subramani, Nishant, Wortsman, Mitchell, Dasigi, Pradeep, Lambert, Nathan, Richardson, Kyle, Zettlemoyer, Luke, Dodge, Jesse, Lo, Kyle, Soldaini, Luca, Smith, Noah A., and Hajishirzi, Hannaneh
Subjects: Computer Science - Computation and Language
Abstract: Language models (LMs) have become ubiquitous in both NLP research and in commercial product offerings. As their commercial importance has surged, the most powerful models have become closed off, gated behind proprietary interfaces, with important details of their training data, architectures, and development undisclosed. Given the importance of these details in scientifically studying these models, including their biases and potential risks, we believe it is essential for the research community to have access to powerful, truly open LMs. To this end, we have built OLMo, a competitive, truly Open Language Model, to enable the scientific study of language models. Unlike most prior efforts that have only released model weights and inference code, we release OLMo alongside open training data and training and evaluation code. We hope this release will empower the open research community and inspire a new wave of innovation.
Published: 2024

28. Effects of an amazonian dam on taxonomic, functional and phylogenetic diversity of non-volant small mammals

Author: da Silva Araujo, Raylenne, Dineli Bobrowiec, Paulo E., Stevens, Richard D., de Moura, Raquel Teixeira, Aurélio L. Sábato, Marco, Sábato, Eduardo Lima, and Magnusson, William E.
Published: 2024
Full Text: View/download PDF

29. Epigenetic regulation by polycomb repressive complex 1 promotes cerebral cavernous malformations

Author: Pham, Van-Cuong, Rödel, Claudia Jasmin, Valentino, Mariaelena, Malinverno, Matteo, Paolini, Alessio, Münch, Juliane, Pasquier, Candice, Onyeogaziri, Favour C, Lazovic, Bojana, Girard, Romuald, Koskimäki, Janne, Hußmann, Melina, Keith, Benjamin, Jachimowicz, Daniel, Kohl, Franziska, Hagelkruys, Astrid, Penninger, Josef M, Schulte-Merker, Stefan, Awad, Issam A, Hicks, Ryan, Magnusson, Peetra U, Faurobert, Eva, Pagani, Massimiliano, and Abdelilah-Seyfried, Salim
Published: 2024
Full Text: View/download PDF

30. The effectiveness of stretching exercises in patients with fibromyalgia: A systematic review

Author: Støve, Morten Pallisgaard, Dissing, Anne Mette Lücke, Thomsen, Janus Laust, Magnusson, Stig Peter, and Riis, Allan
Published: 2024
Full Text: View/download PDF

31. Semorinemab Pharmacokinetics and The Effect on Plasma Total Tau Pharmacodynamics in Clinical Studies

Author: Ramakrishnan, Vidya, Bender, B., Langenhorst, J., Magnusson, M. O., Dolton, M., Shim, J., Fuji, R. N., Monteiro, C., Teng, E., Kassir, N., and Jin, J.
Published: 2024
Full Text: View/download PDF

32. Screening protocol for freshwater filamentous macroalgae bioremediation of primary municipal wastewater

Author: Novak, Indira N., Magnusson, Marie, Craggs, Rupert J., and Lawton, Rebecca J.
Published: 2024
Full Text: View/download PDF

33. Genome-Wide Association Study of Obsessive-Compulsive Symptoms including 33,943 individuals from the general population

Author: Strom, Nora I., Burton, Christie L., Iyegbe, Conrad, Silzer, Talisa, Antonyan, Lilit, Pool, René, Lemire, Mathieu, Crowley, James J., Hottenga, Jouke-Jan, Ivanov, Volen Z., Larsson, Henrik, Lichtenstein, Paul, Magnusson, Patrik, Rück, Christian, Schachar, Russell, Wu, Hei Man, Cath, Danielle, Crosbie, Jennifer, Mataix-Cols, David, Boomsma, Dorret I., Mattheisen, Manuel, Meier, Sandra M., Smit, Dirk J. A., and Arnold, Paul D.
Published: 2024
Full Text: View/download PDF

34. Homozygosity for a stop-gain variant in CCDC201 causes primary ovarian insufficiency

Author: Oddsson, Asmundur, Steinthorsdottir, Valgerdur, Oskarsson, Gudjon R., Styrkarsdottir, Unnur, Moore, Kristjan H. S., Isberg, Salvor, Halldorsson, Gisli H., Sveinbjornsson, Gardar, Westergaard, David, Nielsen, Henriette Svarre, Fridriksdottir, Run, Jensson, Brynjar O., Arnadottir, Gudny A., Jonsson, Hakon, Sturluson, Arni, Snaebjarnarson, Audunn S., Andreassen, Ole A., Walters, G. Bragi, Nyegaard, Mette, Erikstrup, Christian, Steingrimsdottir, Thora, Lie, Rolv T., Melsted, Pall, Jonsdottir, Ingileif, Halldorsson, Bjarni V., Thorleifsson, Gudmar, Saemundsdottir, Jona, Magnusson, Olafur Th., Banasik, Karina, Sorensen, Erik, Masson, Gisli, Pedersen, Ole Birger, Tryggvadottir, Laufey, Haavik, Jan, Ostrowski, Sisse Rye, Stefansson, Hreinn, Holm, Hilma, Rafnar, Thorunn, Gudbjartsson, Daniel F., Sulem, Patrick, and Stefansson, Kari
Published: 2024
Full Text: View/download PDF

35. Structure and function of Achilles and patellar tendons following moderate slow resistance training in young and old men

Author: Létocart, Adrien J., Svensson, René B., Mabesoone, Franck, Charleux, Fabrice, Marin, Frédéric, Dermigny, Quentin, Magnusson, S. Peter, Couppé, Christian, and Grosset, Jean-François
Published: 2024
Full Text: View/download PDF

36. The correlation between CpG methylation and gene expression is driven by sequence variants

Author: Stefansson, Olafur Andri, Sigurpalsdottir, Brynja Dogg, Rognvaldsson, Solvi, Halldorsson, Gisli Hreinn, Juliusson, Kristinn, Sveinbjornsson, Gardar, Gunnarsson, Bjarni, Beyter, Doruk, Jonsson, Hakon, Gudjonsson, Sigurjon Axel, Olafsdottir, Thorunn Asta, Saevarsdottir, Saedis, Magnusson, Magnus Karl, Lund, Sigrun Helga, Tragante, Vinicius, Oddsson, Asmundur, Hardarson, Marteinn Thor, Eggertsson, Hannes Petur, Gudmundsson, Reynir L., Sverrisson, Sverrir, Frigge, Michael L., Zink, Florian, Holm, Hilma, Stefansson, Hreinn, Rafnar, Thorunn, Jonsdottir, Ingileif, Sulem, Patrick, Helgason, Agnar, Gudbjartsson, Daniel F., Halldorsson, Bjarni V., Thorsteinsdottir, Unnur, and Stefansson, Kari
Published: 2024
Full Text: View/download PDF

37. Palms predict the distributions of birds in southwestern Amazonia and are potential surrogates for land-use planning by citizen scientists

Author: Menger, Juliana, Santorelli Junior, Sergio, Emilio, Thaise, Magnusson, William E., and Anciães, Marina
Published: 2024
Full Text: View/download PDF

38. Birth weight, sex, and celiac disease: a nationwide twin study

Author: Kuja-Halkola R, Lebwohl B, Halfvarson J, Emilsson L, Magnusson PK, and Ludvigsson JF
Subjects: AUTOIMMUNE, GESTATIONAL AGE, GLUTEN, REGISTRIES, RISK FACTORS, TWINS, Infectious and parasitic diseases, RC109-216
Abstract: Ralf Kuja-Halkola,1 Benjamin Lebwohl,1,2 Jonas Halfvarson,3 Louise Emilsson,4–6 Patrik K Magnusson,1 Jonas F Ludvigsson1,2,7,8 1Department Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden; 2Department of Medicine, Celiac Disease Center, Columbia University Medical Center, Columbia University, New York, NY, USA; 3Department of Gastroenterology, Faculty of Medicine and Health, Örebro University, Örebro, Sweden; 4Department of Health Management and Health Economy, Institute of Health and Society, University of Oslo, Oslo, Norway; 5Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA; 6Centre for Clinical Research, Vårdcentralen Värmlands Nysäter, County Council of Värmland, Värmland, 7Department of Pediatrics, Örebro University Hospital, Örebro, Sweden; 8Division of Epidemiology and Public Health, School of Medicine, City Hospital, University of Nottingham, Nottingham, UK Objective: Earlier research suggests that birth weight may be associated with celiac disease (CD), but the direction of association has been unclear potentially due to confounding effect from genetic and intrafamilial factors. Through within-twin analyses, we aimed to minimize confounding effects such as twins that share genetic and early environmental exposures.Materials and methods: Using the Swedish Twin Registry, we examined the birth weight of 146,830 twins according to the CD status. CD was defined as having villous atrophy according to a small intestinal biopsy reports.Results: The prevalence of diagnosed CD was 0.5% (n=669), and we included 407 discordant pairs of CD–non-CD twins. Comparing the 669 CD patients with non-CD twins, the association between birth weight and future CD was not statistically significant (odds ratio [OR] per 1000 g increase in birth weight: 1.16; 95% confidence interval [CI]=0.97–1.38). In males, the association was positive and statistically significant (OR=1.50; 95% CI=1.11–2.02). However, the association was not significant in within-pair analyses for both dizygotic and monozygotic twins and for both sexes.Conclusion: This population-based study found that in male twins, higher birth weight was associated with higher risk of CD. However, when comparing discordant twin pairs in within-twin pair analyses, there was no statistically significant association between birth weight, intrauterine growth, and future risk of CD. Keywords: autoimmune, gestational age, gluten, registries, risk factors, twins
Published: 2017

39. Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Author: Soldaini, Luca, Kinney, Rodney, Bhagia, Akshita, Schwenk, Dustin, Atkinson, David, Authur, Russell, Bogin, Ben, Chandu, Khyathi, Dumas, Jennifer, Elazar, Yanai, Hofmann, Valentin, Jha, Ananya Harsh, Kumar, Sachin, Lucy, Li, Lyu, Xinxi, Lambert, Nathan, Magnusson, Ian, Morrison, Jacob, Muennighoff, Niklas, Naik, Aakanksha, Nam, Crystal, Peters, Matthew E., Ravichander, Abhilasha, Richardson, Kyle, Shen, Zejiang, Strubell, Emma, Subramani, Nishant, Tafjord, Oyvind, Walsh, Pete, Zettlemoyer, Luke, Smith, Noah A., Hajishirzi, Hannaneh, Beltagy, Iz, Groeneveld, Dirk, Dodge, Jesse, and Lo, Kyle
Subjects: Computer Science - Computation and Language
Abstract: Information about pretraining corpora used to train the current best-performing language models is seldom discussed: commercial models rarely detail their data, and even open models are often released without accompanying training data or recipes to reproduce them. As a result, it is challenging to conduct and advance scientific research on language modeling, such as understanding how training data impacts model capabilities and limitations. To facilitate scientific research on language model pretraining, we curate and release Dolma, a three-trillion-token English corpus, built from a diverse mixture of web content, scientific papers, code, public-domain books, social media, and encyclopedic materials. We extensively document Dolma, including its design principles, details about its construction, and a summary of its contents. We present analyses and experimental results on intermediate states of Dolma to share what we have learned about important data curation practices. Finally, we open-source our data curation toolkit to enable reproduction of our work as well as support further research in large-scale data curation., Comment: Accepted at ACL 2024; Dataset: https://hf.co/datasets/allenai/dolma; Code: https://github.com/allenai/dolma
Published: 2024

40. SCANIA Component X Dataset: A Real-World Multivariate Time Series Dataset for Predictive Maintenance

Author: Kharazian, Zahra, Lindgren, Tony, Magnússon, Sindri, Steinert, Olof, and Reyna, Oskar Andersson
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: This paper presents a description of a real-world, multivariate time series dataset collected from an anonymized engine component (called Component X) of a fleet of trucks from SCANIA, Sweden. This dataset includes diverse variables capturing detailed operational data, repair records, and specifications of trucks while maintaining confidentiality by anonymization. It is well-suited for a range of machine learning applications, such as classification, regression, survival analysis, and anomaly detection, particularly when applied to predictive maintenance scenarios. The large population size and variety of features in the format of histograms and numerical counters, along with the inclusion of temporal information, make this real-world dataset unique in the field. The objective of releasing this dataset is to give a broad range of researchers the possibility of working with real-world data from an internationally well-known company and introduce a standard benchmark to the predictive maintenance field, fostering reproducible research., Comment: 10 pages, 8 figures
Published: 2024

41. A Cost-Sensitive Transformer Model for Prognostics Under Highly Imbalanced Industrial Data

Author: Beikmohammadi, Ali, Hamian, Mohammad Hosein, Khoeyniha, Neda, Lindgren, Tony, Steinert, Olof, and Magnússon, Sindri
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: The rapid influx of data-driven models into the industrial sector has been facilitated by the proliferation of sensor technology, enabling the collection of vast quantities of data. However, leveraging these models for failure detection and prognosis poses significant challenges, including issues like missing values and class imbalances. Moreover, the cost sensitivity associated with industrial operations further complicates the application of conventional models in this context. This paper introduces a novel cost-sensitive transformer model developed as part of a systematic workflow, which also integrates a hybrid resampler and a regression-based imputer. After subjecting our approach to rigorous testing using the APS failure dataset from Scania trucks and the SECOM dataset, we observed a substantial enhancement in performance compared to state-of-the-art methods. Moreover, we conduct an ablation study to analyze the contributions of different components in our proposed method. Our findings highlight the potential of our method in addressing the unique challenges of failure prediction in industrial settings, thereby contributing to enhanced reliability and efficiency in industrial operations.
Published: 2024

42. 3QFP: Efficient neural implicit surface reconstruction using Tri-Quadtrees and Fourier feature Positional encoding

Author: Sun, Shuo, Mielle, Malcolm, Lilienthal, Achim J., and Magnusson, Martin
Subjects: Computer Science - Robotics
Abstract: Neural implicit surface representations are currently receiving a lot of interest as a means to achieve high-fidelity surface reconstruction at a low memory cost, compared to traditional explicit representations.However, state-of-the-art methods still struggle with excessive memory usage and non-smooth surfaces. This is particularly problematic in large-scale applications with sparse inputs, as is common in robotics use cases. To address these issues, we first introduce a sparse structure, \emph{tri-quadtrees}, which represents the environment using learnable features stored in three planar quadtree projections. Secondly, we concatenate the learnable features with a Fourier feature positional encoding. The combined features are then decoded into signed distance values through a small multi-layer perceptron. We demonstrate that this approach facilitates smoother reconstruction with a higher completion ratio with fewer holes. Compared to two recent baselines, one implicit and one explicit, our approach requires only 10\%--50\% as much memory, while achieving competitive quality., Comment: ICRA2024
Published: 2024

43. Survey and Analysis of DNS Filtering Components

Author: Magnusson, Jonathan
Subjects: Computer Science - Cryptography and Security, Computer Science - Networking and Internet Architecture
Abstract: The Domain Name System (DNS) comprises name servers translating domain names into, commonly, IP addresses. Authoritative name servers hosts the resource records (RR) for certain zones, and resolver name servers are responsible for querying and answering DNS queries on behalf of their clients. Unfortunately, cybercriminals often use DNS for malicious purposes, such as phishing, malware distribution, and botnet communication. To combat these threats, filtering resolvers have become increasingly popular, employing various techniques to identify and block malicious requests. In this paper, we survey several techniques to implement and enhance the capabilities of filtering resolvers including response policy zones, threat intelligence feeds, and detection of algorithmically generated domains. We identify the current trends of each area and find missing intersections in the literature, which could be used to improve the effectiveness of filtering resolvers. In addition, we propose future work designing a framework for filtering resolvers using state-of-the-art approaches identified in this study.
Published: 2024

44. Paloma: A Benchmark for Evaluating Language Model Fit

Author: Magnusson, Ian, Bhagia, Akshita, Hofmann, Valentin, Soldaini, Luca, Jha, Ananya Harsh, Tafjord, Oyvind, Schwenk, Dustin, Walsh, Evan Pete, Elazar, Yanai, Lo, Kyle, Groeneveld, Dirk, Beltagy, Iz, Hajishirzi, Hannaneh, Smith, Noah A., Richardson, Kyle, and Dodge, Jesse
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Language models (LMs) commonly report perplexity on monolithic data held out from training. Implicitly or explicitly, this data is composed of domains$\unicode{x2013}$varying distributions of language. Rather than assuming perplexity on one distribution extrapolates to others, Perplexity Analysis for Language Model Assessment (Paloma), measures LM fit to 585 text domains, ranging from nytimes.com to r/depression on Reddit. We invite submissions to our benchmark and organize results by comparability based on compliance with guidelines such as removal of benchmark contamination from pretraining. Submissions can also record parameter and training token count to make comparisons of Pareto efficiency for performance as a function of these measures of cost. We populate our benchmark with results from 6 baselines pretrained on popular corpora. In case studies, we demonstrate analyses that are possible with Paloma, such as finding that pretraining without data beyond Common Crawl leads to inconsistent fit to many domains., Comment: Project Page: https://paloma.allen.ai/
Published: 2023

45. Catwalk: A Unified Language Model Evaluation Framework for Many Datasets

Author: Groeneveld, Dirk, Awadalla, Anas, Beltagy, Iz, Bhagia, Akshita, Magnusson, Ian, Peng, Hao, Tafjord, Oyvind, Walsh, Pete, Richardson, Kyle, and Dodge, Jesse
Subjects: Computer Science - Computation and Language
Abstract: The success of large language models has shifted the evaluation paradigms in natural language processing (NLP). The community's interest has drifted towards comparing NLP models across many tasks, domains, and datasets, often at an extreme scale. This imposes new engineering challenges: efforts in constructing datasets and models have been fragmented, and their formats and interfaces are incompatible. As a result, it often takes extensive (re)implementation efforts to make fair and controlled comparisons at scale. Catwalk aims to address these issues. Catwalk provides a unified interface to a broad range of existing NLP datasets and models, ranging from both canonical supervised training and fine-tuning, to more modern paradigms like in-context learning. Its carefully-designed abstractions allow for easy extensions to many others. Catwalk substantially lowers the barriers to conducting controlled experiments at scale. For example, we finetuned and evaluated over 64 models on over 86 datasets with a single command, without writing any code. Maintained by the AllenNLP team at the Allen Institute for Artificial Intelligence (AI2), Catwalk is an ongoing open-source effort: https://github.com/allenai/catwalk., Comment: technical report, work in progress
Published: 2023

46. Asynchronous Distributed Optimization with Delay-free Parameters

Author: Wu, Xuyang, Liu, Changxin, Magnusson, Sindri, and Johansson, Mikael
Subjects: Mathematics - Optimization and Control, Computer Science - Machine Learning
Abstract: Existing asynchronous distributed optimization algorithms often use diminishing step-sizes that cause slow practical convergence, or use fixed step-sizes that depend on and decrease with an upper bound of the delays. Not only are such delay bounds hard to obtain in advance, but they also tend to be large and rarely attained, resulting in unnecessarily slow convergence. This paper develops asynchronous versions of two distributed algorithms, Prox-DGD and DGD-ATC, for solving consensus optimization problems over undirected networks. In contrast to alternatives, our algorithms can converge to the fixed point set of their synchronous counterparts using step-sizes that are independent of the delays. We establish convergence guarantees for strongly and weakly convex problems under both partial and total asynchrony. We also show that the convergence speed of the two asynchronous methods adapts to the actual level of asynchrony rather than being constrained by the worst-case. Numerical experiments demonstrate a strong practical performance of our asynchronous algorithms., Comment: 15 pages. arXiv admin note: text overlap with arXiv:2303.18034
Published: 2023

47. Towards Human-Level Text Coding with LLMs: The Case of Fatherhood Roles in Public Policy Documents

Author: Lupo, Lorenzo, Magnusson, Oscar, Hovy, Dirk, Naurin, Elin, and Wängnerud, Lena
Subjects: Computer Science - Computation and Language, Computer Science - Computers and Society, J.4, I.2
Abstract: Recent advances in large language models (LLMs) like GPT-3.5 and GPT-4 promise automation with better results and less programming, opening up new opportunities for text analysis in political science. In this study, we evaluate LLMs on three original coding tasks involving typical complexities encountered in political science settings: a non-English language, legal and political jargon, and complex labels based on abstract constructs. Along the paper, we propose a practical workflow to optimize the choice of the model and the prompt. We find that the best prompting strategy consists of providing the LLMs with a detailed codebook, as the one provided to human coders. In this setting, an LLM can be as good as or possibly better than a human annotator while being much faster, considerably cheaper, and much easier to scale to large amounts of text. We also provide a comparison of GPT and popular open-source LLMs, discussing the trade-offs in the model's choice. Our software allows LLMs to be easily used as annotators and is publicly available: https://github.com/lorelupo/pappa.
Published: 2023

48. Misclassification of hypertrophic cardiomyopathy: validation of diagnostic codes

Author: Magnusson P, Palm A, Branden E, and Mörner S
Subjects: diagnostic error, diagnosis, epidemiology, hypertrophic cardiomyopathy, International Classification of Diseases, register, Infectious and parasitic diseases, RC109-216
Abstract: Peter Magnusson,1,2 Andreas Palm,2,3 Eva Branden,2,4 Stellan Mörner5 1Cardiology Research Unit, Department of Medicine, Karolinska Institutet, Stockholm, 2Centre for Research and Development, Uppsala University, Region Gävleborg, Gävle, 3Department of Medical Sciences, Respiratory, Allergy and Sleep Research, Uppsala University, Uppsala, 4Department of Medicine, Karolinska Institutet, Stockholm, 5Heart Center and Department of Public Health and Clinical Medicine, Umeå University, Umeå, Sweden Purpose: To validate diagnostic codes for hypertrophic cardiomyopathy (HCM), analyze misclassfications, and estimate the prevalence of HCM in an unselected Swedish regional cohort.Patients and methods: Using the hospitals’ electronic medical records (used for the Swedish National Patient Register), we identified 136 patients from 2006 to 2016 with the HCM-related codes 142.1 and 142.2 (International Classification of Diseases).Results: Of a total of 129 residents in the catchment area, 88 patients were correctly classified as HCM (positive predictive value 68.2%) and 41 patients (31.8%) were misclassified as HCM. Among the 88 HCM patients (52.2% males), 74 were alive and 14 were dead (15.9%). This yields an HCM prevalence of 74/183,337, that is, 4.0 diagnosed cases per 10,000 in the adult population aged ≥18 years. The underlying diagnoses of misclassified cases were mainly hypertension (31.7%) and aortic stenosis (22.0%). Other types of cardiomyopathies accounted for several cases of misclassification: dilated (nonischemic or ischemic), left ventricular noncompaction, and Takotsubo. Miscellaneous diagnoses were amyloidosis, pulmonary stenosis combined with ventricular septal defect, aortic insufficiency, athelete’s heart, and atrioventricular conduction abnormality. The mean age was not significantly different between HCM and misclassified patients (65.8±15.8 vs 70.1±13.4 years; P=0.177). There were 47.8% females among HCM and 60.8% females among misclassified (P=0.118).Conclusion: One-third of patients diagnosed as HCM are misclassified, so registry data should be interpreted with caution. A correct diagnosis is important for decision-making and implementation of optimal HCM care; efforts should be made to increase awareness of HCM and diagnostic competence throughout the health care system. Keywords: diagnostic error, diagnosis, epidemiology, hypertrophic cardiomyopathy, International Classification of Diseases, register
Published: 2017

49. The novel estuarine bioremediation target Gracilaria transtasmanica has high tolerance to light limitation, air-exposure and a broad range of salinities

Author: Ross, Bethany G., Magnusson, Marie, and Lawton, Rebecca J.
Published: 2024
Full Text: View/download PDF

50. Completion of Upper Secondary Mainstream School in Autistic Students in Sweden

Author: Stark, Isidora, Rast, Jessica E., Lundberg, Michael, Döring, Nora, Ohlis, Anna, Idring Nordström, Selma, Rai, Dheeraj, and Magnusson, Cecilia
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

6,585 results on '"Magnusson P"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources