1,430 results on '"Data Reuse"'
Search Results
2. A data reuse strategy based on deep learning for high dimensional data's pattern and instance similarity.
- Author
-
Wu, Feng, Lv, Hongwei, Fan, Tongrang, Zhao, Wenbin, and Wang, Jiaqi
- Subjects
- *
CONVOLUTIONAL neural networks , *TEXT mining , *DATA management , *DATA mapping , *DEEP learning - Abstract
Data reuse strategy is an effective method to save storage space and improve data utilization in data management. In view of the successful application of deep learning in the field of text mining, a data reuse strategy based on deep learning is proposed for high dimensional data's pattern and instance similarity. With traditional feature analysis and deep learning model of convolutional neural network, the pattern similarity of data dimension is analyzed so as to optimize the similar dimension pairs among high dimensional data sets. Combining inner-attention mechanism, a semantic similarity model IA-LSTM is designed for instance similarity, which can build the association mapping among data entities by the calculation of the similarity of short text. Based on the pattern and instance similarity in the proposed strategy, reusable data entities are discovered, and column storage is designed to improve data reuse efficiency. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. Barriers and facilitators to research data sharing: a lifecycle perspective.
- Author
-
He, Zilong and Fang, Wei
- Subjects
- *
PERCEIVED control (Psychology) , *TECHNOLOGY Acceptance Model , *STRUCTURAL equation modeling , *INFORMATION sharing , *OPEN scholarship - Abstract
Purpose: This study investigates the multifaceted barriers and facilitators affecting research data sharing across the research data lifecycle. It aims to broaden the understanding of data sharing beyond the publication phase, emphasizing the continuous nature of data sharing from generation to reuse. Design/methodology/approach: Employing a mixed-methods approach, the study integrates the Theory of Planned Behavior, the Technology Acceptance Model, and the Institutional Theory to hypothesize the influence of various factors on data sharing behaviors across the lifecycle. A questionnaire survey and structural equation modeling are utilized to empirically test these hypotheses. Findings: This study identifies critical factors influencing data sharing at different lifecycle stages, including perceived behavioral control, perceived effort, journal and funding agency pressures, subjective norms, perceived risks, resource availability, and perceived benefits. The findings highlight the complex interplay of these factors and their varying impacts at different stages of data sharing. Research limitations/implications: This study illuminates the dynamics of research data sharing, offering insights while recognizing its scope might not capture all disciplinary and cultural nuances. It highlights pathways for stakeholders to bolster data sharing, suggesting a collaborative push towards open science, reflecting on how strategic interventions can bridge existing gaps in practice. Practical implications: This study offers actionable recommendations for policymakers, journals, and institutions to foster a more conducive environment for data sharing, emphasizing the need for support mechanisms at various lifecycle stages. Originality/value: This study contributes to the literature by offering a comprehensive model of the research data lifecycle, providing empirical evidence on the factors influencing data sharing across this continuum. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. A Community Data Sharing Resource: The LDbase Data Repository.
- Author
-
Hart, Sara A., Schatschneider, Christopher, Reynolds, Tara, and Calvo, Favenzio
- Subjects
- *
MEDICAL information storage & retrieval systems , *DATABASE management , *INFORMATION storage & retrieval systems , *INFORMATION resources , *ELECTRONIC health records , *METADATA , *INFORMATION retrieval , *LEARNING disabilities , *ACCESS to information - Abstract
The purpose of this invited paper is to show the learning disabilities field what LDbase is, why it's important for the field, what it offers the field, and examples of how you can leverage LDbase in your own work. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. What Does It Mean to "Misuse" Research Data?
- Author
-
Pasquetto, Irene V, Thomer, Andrea, Acker, Amelia, Chtena, Natascha, and Desai, Meera
- Subjects
- *
DATA curation , *SCHOLARLY communication , *BRAINSTORMING , *DATA privacy , *INFORMATION sharing - Abstract
In this panel, we will discuss how "data misuse" is understood across different disciplines, and in particular digital curation, critical data studies, scholarly communication, and algorithmic fairness. The audience will be invited to contribute to the discussion by reporting on their own experience with data misuse, and brainstorming potential interventions to prevent misuse. Controversial reuses of open research data are emerging, including exploitation of marginalized communities, geo privacy violations, and perpetuation of harmful stereotypes. Incidents of data misuse hinder scientific progress and erode public trust, yet defining misuse remains challenging as one community's misuse might be another's best practice. The development of a shared framework to understand when, how, and why misuse of research data occurs can help science stakeholders decide when and how to release crucial research data, evaluate the potential for misuse, and tailor documentation of research data to prevent misuse. Our goal for this panel discussion is to take us a step closer to the development of such a theoretical framework for defining data misuse. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Primary Sources as Linked Data: Exploring Motives Across the Sciences and Social Sciences.
- Author
-
Marsh, Diana E., Fenlon, Katrina, Sorensen, Amanda H., and Wise, Nikki M.
- Subjects
- *
SOCIAL sciences , *LINKED data (Semantic Web) , *DATA integration , *ARCHIVES , *METADATA - Abstract
While long recognized in the humanities, there is growing recognition in the sciences and social sciences that primary sources—as diverse as manuscripts, photographs, cultural belongings, and specimens—hold vast data about scientific and human knowledge for use in scholarship, community research, and global knowledge. Yet, data embedded in these sources are largely disconnected from the systems of discovery, access, and structured data that support reuse and insights across globally dispersed repositories. In this paper, we share select findings of a systematic review to explore the use of primary sources, and the data embedded in them, via linked data across the sciences and social sciences. Our results confirm the use of a variety of primary source data across diverse disciplines, particularly those requiring longitudinal studies and data integration from diverse repositories and contexts. We highlight how linked data are understood to: connect collections to communities; support highly granular credit, attribution, and assessment of impact; and interrelate diverse sources of knowledge. While these results suggest the value of linked data for the specific research needs of anthropology, the effectiveness of linked data in achieving these objectives and the suitability of this approach for a diversity of institutions and communities need further study. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
7. Use It or Lose It: Facilitating the Use of Interactive Data Apps in Psychological Research Data Sharing
- Author
-
Franziska Usée, Christiane A. Melzig, and Dirk Ostwald
- Subjects
open research data ,data reuse ,interactive data apps ,innovative supplements ,dash ,gradio ,Psychology ,BF1-990 - Abstract
The value of open research data (ORD), a key feature of open science, lies in their reuse. However, the mere online availability of ORD does not guarantee their reuse by other researchers. Specifically, previous meta-scientific research has indicated that the underutilization of ORD is related to barriers at the level of the ORD themselves, potential reusers of ORD, and the broader academic ecosystem. At the same time, sharing large datasets in an understandable and transparent format that motivates researchers to explore these datasets remains a fundamental challenge. With the present work, we propose interactive data apps (IDAs) as innovative ORD supplements that provide a means to lower barriers of ORD reuse. We demonstrate the use of two open-source Python libraries (Dash, Gradio) for IDA development using two psychological research use cases. The first use case pertains to an experimental quantitative dataset acquired in a clinical psychology setting. The second use case concerns the familiarization with data analysis workflows that are characteristic of natural language processing (NLP). For both use cases, we provide easy-to-adapt Python code that can form the basis for IDA development in similar scenarios.
- Published
- 2024
- Full Text
- View/download PDF
8. Preparing multi-layered visualisations of Old Babylonian cuneiform tablets for a machine learning OCR training model towards automated sign recognition.
- Author
-
Hameeuw, Hendrik, De Graef, Katrien, Smidt, Gustav Ryberg, Goddeeris, Anne, Homburg, Timo, and Kumar Thirukokaranam Chandrasekar, Krishna
- Subjects
OPTICAL character recognition ,ARTIFICIAL intelligence ,TEXT recognition ,WEB-based user interfaces ,RECORDS management - Abstract
In the framework of the CUNE-IIIF-ORM project the aim is to train an Artificial Intelligence Optical Character Recognition (AI-OCR) model that can automatically locate and identify cuneiform signs on photorealistic representations of Old Babylonian texts (c. 2000–1600 B.C.E.). In order to train the model, c. 200 documentary clay tablets have been selected. They are manually annotated by specialist cuneiformists on a set of 12 still raster images generated from interactive Multi-Light Reflectance images. This image set includes visualisations with varying light angles and simplifications based on the dept information on the impressed signs in the surface. In the Cuneur Cuneiform Annotator, a Gitlab-based web application, the identified cuneiform signs are annotated with polygons and enriched with metadata. This methodology builds a qualitative annotated training corpus of approximately 20,000 cropped signs (i.e. 240,000 visualizations), all with their unicode codepoint and conventional sign name. It will act as a multi-layerd core dataset for the further development and fine-tuning of a machine learning OCR training model for the Old Babylonian cuneiform script. This paper discusses how the physical nature of handwritten inscribed Old Babylonian documentary clay tablets challenges the annotation and metadating task, and how these have been addressed within the CUNE-IIIF-ORM project to achieve an effective training corpus to support the training of a machine learning OCR model. Applied computing → Document management and text processing → Document capture → Optical character recognition; Applied computing → Arts and humanities → Language translation. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
9. Allocation of attention to metadata and retrieval functions: Implications for perceived value and open data discovery and reuse.
- Author
-
Wang, Ping, Xie, Yufei, Li, Xueyi, and Li, Qiao
- Subjects
FLOW theory (Psychology) ,MULTIPLE regression analysis ,INFORMATION retrieval ,METADATA ,RESEARCH personnel ,TECHNOLOGY Acceptance Model - Abstract
Metadata and retrieval functions play a vital role in aiding researchers in the discovery and reuse of open data. However, the diversity of metadata elements and retrieval functions poses a challenge to data searchers' limited attentional resources. This study aims to examine the allocation of attention to metadata elements and retrieval functions and its implications for perceived value and intentions to discover and reuse open data by drawing upon the attentional drift-diffusion model, flow theory, and perceived value literature. An experiment with 48 participants was conducted to explore the proposed relationships. Multiple linear regression analysis was performed to analyze the data. The results suggest that researchers' attention to high-value functions amplifies the perceived value and motivates data discovery intention. Attention to high-value metadata elements motivates data discovery and reuse intention. In contrast, attention to low-value metadata elements hampers the perceived value and inhibits data discovery and reuse intention. These findings put forward a new lens for exploring the attention mechanisms underlying perceived value, data discovery and reuse intention and highlight the important role of the value of metadata and retrieval functions in attention mechanisms. Additionally, this paper identifies the positive effect of perceived ease of use on users' intentions to find, evaluate, and access open data. Perceived usefulness positively affects users' intentions to evaluate open data. However, in contrast to perceived intentions to reuse open data assessed by self-reported measures, perceived value is not a salient motivator of open data reuse intention measured by behavioral indicators. These findings reveal the distinct effects of perceived value on perceived intention and intentional action in data reuse. With these insights, this study develops practical strategies to optimize the design of metadata and retrieval functions in data retrieval systems. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
10. Use It or Lose It: Facilitating the Use of Interactive Data Apps in Psychological Research Data Sharing.
- Author
-
Usée, Franziska, Melzig, Christiane A., and Ostwald, Dirk
- Subjects
NATURAL language processing ,OPEN scholarship ,CLINICAL psychology ,PSYCHOLOGICAL research ,PYTHON programming language - Abstract
The value of open research data (ORD), a key feature of open science, lies in their reuse. However, the mere online availability of ORD does not guarantee their reuse by other researchers. Specifically, previous meta-scientific research has indicated that the underutilization of ORD is related to barriers at the level of the ORD themselves, potential reusers of ORD, and the broader academic ecosystem. At the same time, sharing large datasets in an understandable and transparent format that motivates researchers to explore these datasets remains a fundamental challenge. With the present work, we propose interactive data apps (IDAs) as innovative ORD supplements that provide a means to lower barriers of ORD reuse. We demonstrate the use of two open-source Python libraries (Dash, Gradio) for IDA development using two psychological research use cases. The first use case pertains to an experimental quantitative dataset acquired in a clinical psychology setting. The second use case concerns the familiarization with data analysis workflows that are characteristic of natural language processing (NLP). For both use cases, we provide easy-to-adapt Python code that can form the basis for IDA development in similar scenarios. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
11. How do we measure the costs, benefits, and harms of sharing data from biomedical studies? A protocol for a scoping review [version 2; peer review: 2 approved]
- Author
-
Lauren Maxwell, Ankur Krishnan, and Priya Shreedhar
- Subjects
Scoping review ,data reuse ,research translation ,research impact ,ethics review committee ,biomedical research ,eng ,Science ,Social Sciences - Abstract
Introduction The benefits of sharing participant-level data, including clinical or epidemiological data, genomic data, high-dimensional imaging data, or human-derived samples, from biomedical studies have been widely touted and may be taken for granted. As investments in data sharing and reuse efforts continue to grow, understanding the cost and positive and negative effects of data sharing for research participants, the general public, individual researchers, research and development, clinical practice, and public health is of growing importance. In this scoping review, we will identify and summarize existing evidence on the positive and negative impacts and costs of data sharing and how they are measured. Methods and analysis Eligible studies will report on qualitative or quantitative approaches for measuring the cost of data sharing or its impact on participant privacy, individual or public health, researcher’s careers, clinical or public health practice, or research or development. The systematic search strategy uses MeSH and text terms and is tailored for Ovid Medline, Cumulative Index to Nursing and Allied Health Literature, and Web of Science. We will apply the Arskey and O’Malley scoping review methodology. We selected a scoping rather than a systematic review approach to address multiple related questions and provide guidance related to an emerging field. Two reviewers will conduct the title-abstract and full-text screening and data charting independently. Discrepancies will be resolved through consensus and results will be summarized in a narrative form. Conclusion Research participants, investigators, regulatory groups, ethics review committees, data protection officers, and funders cannot make informed decisions or policies about data reuse without appropriate means of measuring the effects, positive or negative, and cost of data sharing.
- Published
- 2025
- Full Text
- View/download PDF
12. Gaining Insight: Data Needs for Firearm Injury Prevention Researchers.
- Author
-
Million, A. J., Gidakovic, Sanja, and Bossaller, Jenny
- Subjects
- *
GUNSHOT wounds , *SHOOTINGS (Crime) , *CRIMINAL justice system , *CONVICTION rates , *PUBLIC health - Abstract
Between 1996 and 2018, there was little federally funded research on gun violence, even though it remains a perennial issue, resulting in considerable loss of life. Advocacy and academic groups piece together data to conduct their research, but the U.S. firearms data infrastructure is limited. A current National Institute of Health (NIH)‐funded coordinating center led by researchers at the University of Michigan aims to help confront that problem. This Quality Improvement (QI) study seeks to uncover the information needs of researchers in firearm injury prevention and community‐based interventions through interviews with public health, medical, and criminal justice researchers. Interviews reveal problems associated with existing databases and infrastructures, as well as what researchers need to support their research. We find fragmented data resources and a lag in public health data availability create obstacles to obtaining many datasets. The researchers also describe problems and opportunities for data harmonization across intervention‐focused studies. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
13. The Human‐Data Interaction Driven by Data Reuse.
- Author
-
Duan, Qingyu, Liang, Mengli, and Wang, Xiaoguang
- Subjects
- *
SCIENTIFIC community , *HUMAN-computer interaction , *INTERPERSONAL communication , *DATA analysis , *ERGONOMICS - Abstract
The data‐intensive research paradigm is sweeping through the scientific community, and new interaction challenges for interacting with data have emerged. Data reuse emerges as a critical driver of human‐data interaction, positioning data repositories at the forefront as the optimal solution for facilitating this process. This study adopts a diary study methodology to analyze the data repository‐supported human‐data interaction behaviors driven. The findings reveal that human‐data interaction extends beyond mere engagement with data, encapsulating elements of human‐computer interaction, engagement with literature, interactions with agent systems, and interpersonal communication. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
14. Prolonged increase in psychotropic drug use among young women following the COVID-19 pandemic: a French nationwide retrospective study
- Author
-
Antoine Lamer, Chloé Saint-Dizier, Mathieu Levaillant, Jean-François Hamel-Broza, Eiya Ayed, Emmanuel Chazard, Maxime Bubrovszky, Fabien D’Hondt, Michael Génin, and Mathilde Horn
- Subjects
COVID-19 ,Mental health ,Psychiatry ,Pharmacoepidemiology ,Psychotropic drugs ,Data reuse ,Medicine - Abstract
Abstract Background The COVID-19 pandemic has had a significant impact on mental health, with evidence suggesting an enduring mental health crisis. Studies worldwide observed increased usage of antidepressants, anxiolytics, and hypnotics during the pandemic, notably among young people and women. However, few studies tracked consumption post-2021. Our study aimed to fill this gap by investigating whether the surge in the number psychotropic drug consumers in France persisted 2 years after the first lockdown, particularly focusing on age and gender differences. Methods We conducted a national retrospective observational study based on the French national insurance database. We retrieved all prescriptions of anxiolytics, hypnotics, and antidepressants dispensed in pharmacies in France for the period 2015–2022. We performed interrupted time series analyses based on Poisson models for five age classes (12–18; 19–25; 26–50; 51–75; 76 and more) to assess the trend before lockdown, the gap induced and the change in trend after. Results In the overall population, the number of consumers remained constant for antidepressants while it decreased for anxiolytics and hypnotics. Despite this global trend, a long-term increase was observed in the 12–18 and 19–25 groups for the three drug classes. Moreover, for these age classes, the increases were more pronounced for women than men, except for hypnotics where the trends were similar. Conclusions The number of people using antidepressants continues to increase more than 2 years after the first lockdown, showing a prolonged effect on mental health. This effect is particularly striking among adolescents and young adults confirming the devastating long-term impact of the pandemic on their mental health.
- Published
- 2024
- Full Text
- View/download PDF
15. Improving the Usability of Archaeological Data through Written Guidelines.
- Author
-
Austin, Anne, Faniel, Ixchel M., Brannon, Brittany, and Kansa, Sarah Whitcher
- Subjects
- *
DATA management , *ARCHAEOLOGICAL excavations , *DATA quality , *ARCHAEOLOGISTS , *ACQUISITION of data - Abstract
Archaeologists frequently use written guidelines such as site manuals, recording forms, and digital prompts during excavations to create usable data within and across projects. Most written guidelines emphasize creating either standardized datasets or narrative summaries; however, previous research has demonstrated that the resulting datasets are often difficult to (re)use. Our study analyzed observations and interviews conducted with four archaeological excavation teams, as well as interviews with archaeological data reusers, to evaluate how archaeologists use and implement written guidelines. These excavation team and reuser experiences suggest that archaeologists need more specific best practices to create and implement written guidelines that improve the quality and usability of archaeological data. We present recommendations to improve written guidelines that focus on a project's methods, end-of-season documentation, and naming practices. We also present a Written Guidelines Checklist to help project directors improve their written guidelines before, during, and after fieldwork as part of a collaborative process. Ideally, these best practices for written guidelines will make it easier for team members and future reusers to incorporate their own and others' archaeological data into their research. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
16. Variations in Dispensing Psychotropic Drugs to Adolescents Depending on School Periods: A French Nationwide Retrospective Study.
- Author
-
SAINT-DIZIER, Chloé, BETREMIEUX, Julian, CHAZARD, Emmanuel, BUBROVSZKY, Maxime, and LAMER, Antoine
- Abstract
Seasonality patterns are reported for various psychiatric disorders. Concerning adolescents, there is an increased frequency of general emergency department visits for mental health disorders observed between March and May, as well as in October and November. We conducted a retrospective cohort study using the French health insurance medico-administrative database. We extracted psychotropic drug deliveries occurring between 2015 and 2019 for patients aged between 12 and 18 years old. Each drug delivery was classified as occurring during a school period (Sc), the summer holidays (SumH) or other shorter holidays periods (ShH). We compared the number of distinct patients, as well as the proportion of new consumers, according to week status. Anxiolytics and hypnotics were more frequently dispensed during the school periods and short breaks than during the summer holidays. Conversely, antidepressants were more commonly dispensed during the short breaks rather than school periods and summer holidays. The stressful effects induced by schooling appear to be addressed in the first line by anxiolytics and hypnotics, while antidepressants are more frequently introduced during school holidays. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
17. Reuse of Adverse Effect Reports from the French National Agency of Medicines: A Visual Analytic Tool to Improve Patient Safety.
- Author
-
SAINT-DIZIER, Chloé, DUFRENOIS, Florian, DAVID, Emeric, BUBROVSZKY, Maxime, and LAMER, Antoine
- Abstract
Adverse drug reaction are defined as "harmful or unpleasant reaction, resulting from an intervention related to the use of a medicinal product". In France, adverse effects due to medicines are reported to the French National Agency of Medicines (ANSM) by the healthcare professionals or consumers. The objective of this study was to implement a tool that facilitates the utilization of ANSM reports by synthesizing information to effectively inform prescribers and users. We focused on 3 psychotropic classes: antidepressants, antipsychotics and anxiolytics. We extracted relevant data from the ANSM website through a webscraping process, based on the names of molecules in these 3 classes: antidepressants, antipsychotics, and anxiolytics. We implemented a web interface with R Shiny that provides three panels: (i) a presentation of the active ingredient with the fewest reports for a selected adverse effect category, (ii) the adverse reactions for a selected active ingredient ranked in descending order, and (iii) a comparison of two active ingredients where, for each adverse effect, the active ingredient with the fewest reported adverse drug events (ADEs) is displayed. Our application allows for synthesizing information to effectively inform prescribers and users. In the ANSM existing interface, molecules can only be viewed one by one, and the ratio needs to be calculated manually, making it difficult to compare molecules. It is important to note that this is not a prescription assistance device but rather for informational purposes. In the future, the application may be expanded to include other categories of molecules. Finally, the indicators provided by our tool could be compared to those from other pharmacovigilance databases. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
18. Sustainability in Secondary Use of Health Data - A Scoping Review.
- Author
-
KILGUS, Tim, NOWAK, Alessia, GERSCH, Martin, and FÜRSTENAU, Daniel
- Abstract
This scoping review investigates sustainability in the reuse of health data on a technological, intra-organizational, inter-organizational, and regulatory level. Thereby, it focuses on the evolutionary, relational, and durational perspective of sustainability. The study highlights various challenges in achieving data sustainability, from regulatory norms such as FAIR principles towards data governance processes and responsibilities in organizations that facilitate data sharing. By highlighting the need for economic sustainability of health data sharing platforms and adapted principles for data sharing, this study aims to analyze current practices that aim for sustainability in the secondary use of health data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
19. Public Availability of Data Management and Analysis Scripts of Studies Conducted on Open-Access Intensive Care Databases.
- Author
-
ANDRIES, Coline, QUINDROIT, Paul, POPOFF, Benjamin, OUDARROUR, Ikram, and LAMER, Antoine
- Abstract
Intensive care units (ICUs) provide care for critical patients at high risk of morbidity and mortality, and require continuous monitoring of clinical, biological and, imaging parameters. Collaborative ventures have enabled the emergence of large open access databases for the secondary use of Electronic Health Records (EHRs). The objective of this work was to evaluate the availability of scripts and datasets in publications based on ICU open-access databases. We included 910 original articles based on four ICU open-access databases (Amsterdam University Medical Centers Database, eICU Collaborative Research Database, High time resolution ICU dataset, and Medical Information Mart for Intensive Care). The majority of the studies did not provide their data management scripts (n=839, 92.9%), neither the analysis script (n=843, 93.4%) in the article. Attempts to contact the 845 corresponding authors in question resulted in 89.11% (n=753) of our e-mail requests going unanswered over a two-month period. We received 51 automated messages (55.43%) indicating that emails have not been delivered, while 6 messages (6.52%) redirected to alternative email addresses. Only 20 corresponding authors (18.18%) answered, finally providing the requested materials. Despite scientific journals recommendations to share materials, our study unveils the absence of crucial components for the replication of studies by other research teams. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
20. Data Sharing Is Caring: Crisis-Induced Realisation of Open Access Policy in a PhD Project on Food Practices
- Author
-
Ciren, Baizhen, Fleer, Marilyn, Series Editor, Hedegaard, Mariane, Series Editor, Veresov, Nikolai, Series Editor, González Rey, Fernando, Founding Editor, Fragkiadaki, Glykeria, editor, Ødegaard, Elin Eriksen, editor, Rai, Prabhat, editor, and Sadownik, Alicja R., editor
- Published
- 2024
- Full Text
- View/download PDF
21. Research data management practices of faculty members in Ghanaian universities
- Author
-
Arthur, Beatrice and van der Walt, Thomas
- Published
- 2024
- Full Text
- View/download PDF
22. Identifying metadata commonalities across restricted health data sources: A mixed methods study exploring how to improve the discovery of and access to restricted datasets
- Author
-
Ambery Leahey, Grant Gibson, Julie Shi, Kelly Stathis, Kevin B. Read, Lynn Peterson, Sarah Rutley, and Victoria Smith
- Subjects
restricted data ,data discovery ,data access ,metadata ,data sharing ,data reuse ,Bibliography. Library science. Information resources - Abstract
Background: While open datasets are adopting FAIR principles to improve their discovery and use, restricted data—those only accessible via request or application—have fallen behind. Metadata is not an inherent characteristic of restricted data, which limits its ability to be found and used. To better understand discoverability and accessibility of restricted data, this study reviewed restricted health data sources to determine how they describe their datasets and access procedures, what descriptive commonalities exist across data sources, and to what extent the commonalities we found can be accommodated within existing metadata schemas. Methods: This study extracted dataset and access information provided by a sample of 48 restricted data sources, identified commonalities across these data sources to develop possible metadata elements for restricted data, and mapped these metadata elements to existing metadata schemas (e.g., DataCite) to evaluate how well they accommodate information supplied by restricted data sources. Results: Restricted data sources describe their datasets (35 commonalities) and access procedures (27 commonalities) in similar ways. Dataset descriptions aligned with existing metadata schemas, with the DDI-Lifecycle and -Codebook schemas receiving 91.4% and 85.7% exact matches respectively with the dataset elements we identified. Access procedures did not align with metadata available in existing schemas. Discussion: While descriptive dataset metadata for restricted data sources will make their data more findable, the accessibility of these datasets could be significantly improved by structured metadata capturing data access information. Presently, metadata schemas do not accommodate the level of detail restricted data sources provide about access procedures and requirements.
- Published
- 2024
- Full Text
- View/download PDF
23. Why don't we share data and code? Perceived barriers and benefits to public archiving practices
- Author
-
Gomes, Dylan GE, Pottier, Patrice, Crystal-Ornelas, Robert, Hudgins, Emma J, Foroughirad, Vivienne, Sánchez-Reyes, Luna L, Turba, Rachel, Martinez, Paula Andrea, Moreau, David, Bertram, Michael G, Smout, Cooper A, and Gaynor, Kaitlyn M
- Subjects
Agricultural ,Veterinary and Food Sciences ,Biological Sciences ,Environmental Sciences ,Generic health relevance ,Biological Science Disciplines ,Motivation ,Information Dissemination ,code reuse ,data reuse ,data science ,open science ,reproducibility ,transparency ,Agricultural and Veterinary Sciences ,Medical and Health Sciences ,Agricultural ,veterinary and food sciences ,Biological sciences ,Environmental sciences - Abstract
The biological sciences community is increasingly recognizing the value of open, reproducible and transparent research practices for science and society at large. Despite this recognition, many researchers fail to share their data and code publicly. This pattern may arise from knowledge barriers about how to archive data and code, concerns about its reuse, and misaligned career incentives. Here, we define, categorize and discuss barriers to data and code sharing that are relevant to many research fields. We explore how real and perceived barriers might be overcome or reframed in the light of the benefits relative to costs. By elucidating these barriers and the contexts in which they arise, we can take steps to mitigate them and align our actions with the goals of open science, both as individual scientists and as a scientific community.
- Published
- 2022
24. The Dataset Finder: A Tool Utilizing Data Management Plans as a Key to Data Discoverability
- Author
-
Soo-Yon Kim, Steffen Hillemacher, Max Kocher, Bernhard Rumpe, and Sandra Geisler
- Subjects
data management plans ,data findability ,data discoverability ,data sharing ,data reuse ,research data management ,Science (General) ,Q1-390 - Abstract
In the past years, there has been an increased interest in sharing and reusing research data. While the importance of sharing data is urgent for enabling collaboration, many research projects are currently struggling with setting up a strategy and the right infrastructure for enabling such data-driven collaboration among the project’s researchers. Through an analysis of the Cluster of Excellence project Internet of Production as a use case, we have found that to enable researchers to share and find research data, a suitable platform is needed, as well as processes that smoothly blend into existing research data management practices. We argue that leveraging data management plans from a medium of documentation to a dynamic knowledge source enhances overview and discoverability of data, while integrating easily into day-to-day workflows of researchers. We present a tool, the Dataset Finder, which is built on the basis of data management plans, and allows users to intuitively query available datasets. The current functionalities of the tools are discussed, results of a preliminary evaluation, as well as potential future features.
- Published
- 2024
- Full Text
- View/download PDF
25. Lessons learned from the development of a national registry on dementia care and support based on linked national health and administrative data.
- Author
-
van der Heide, Iris, Francke, Anneke L., Döpp, Carola, Heins, Marianne, van Hout, Hein P. J., Verheij, Robert A., and Joling, Karlijn J.
- Subjects
- *
DEMENTIA , *MEDICAL personnel , *SECONDARY care (Medicine) , *LONG-term health care , *PRIMARY care - Abstract
Introduction: This paper provides insight into the development of the Dutch Dementia Care and Support Registry and the lessons that can be learned from it. The aim of this Registry was to contribute to quality improvement in dementia care and support. Methods: This paper describes how the Registry was set up in four stages, reflecting the four FAIR principles: the selection of data sources (Findability); obtaining access to the selected data sources (Accessibility); data linkage (Interoperability); and the reuse of data (Reusability). Results: The linkage of 16 different data sources, including national routine health and administrative data appeared to be technically and legally feasible. The linked data in the Registry offers rich information about (the use of) care for persons with dementia across various healthcare settings, including but not limited to primary care, secondary care, long-term care and medication use, that cannot be obtained from single data sources. Conclusions: A key lesson learned is that in order to reuse the data for quality improvement in practice, it is essential to involve healthcare professionals in setting up the Registry and to guide them in the interpretation of the data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
26. Attitudes on data reuse among internal medicine residents.
- Author
-
LaPolla, Fred Willie Zametkin, Milliken, Genevieve, and Gillespie, Colleen
- Subjects
- *
SCALE analysis (Psychology) , *SECONDARY analysis , *KRUSKAL-Wallis Test , *PILOT projects , *HOSPITAL medical staff , *COMMUNICATION , *INTERNAL medicine , *STUDENT attitudes , *COMPARATIVE studies , *DATA analysis software - Abstract
Background: NYU Langone Health offers a collaborative research block for PGY3 Primary Care residents that employs a secondary data analysis methodology. As discussions of data reuse and secondary data analysis have grown in the data library literature, we sought to understand what attitudes internal medicine residents at a large urban academic medical center had around secondary data analysis. This case report describes a novel survey on resident attitudes around data sharing. Methods: We surveyed internal medicine residents in three tracks: Primary Care (PC), Categorical, and Clinician- Investigator (CI) tracks as part of a larger pilot study on implementation of a research block. All three tracks are in our institution's internal medicine program. In discussions with residency directors and the chief resident, the term "secondary data analysis" was chosen over "data reuse" due to this being more familiar to clinicians, but examples were given to define the concept. Results: We surveyed a population of 162 residents, and 67 residents responded, representing a 41.36% response rate. Strong majorities of residents exhibited positive views of secondary data analysis. Moreover, in our sample, those with exposure to secondary data analysis research opined that secondary data analysis takes less time and is less difficult to conduct compared to the other residents without curricular exposure to secondary analysis. Discussion: The survey reflects that residents believe secondary data analysis is worthwhile and this highlights opportunities for data librarians. As current residents matriculate into professional roles as clinicians, educators, and researchers, libraries have an opportunity to bolster support for data curation and education. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
27. Design of a Convolutional Neural Network Accelerator Based on On-Chip Data Reordering.
- Author
-
Liu, Yang, Zhang, Yiheng, Hao, Xiaoran, Chen, Lan, Ni, Mao, Chen, Ming, and Chen, Rong
- Subjects
CONVOLUTIONAL neural networks ,DYNAMIC random access memory ,COMPUTER vision ,VISUAL fields - Abstract
Convolutional neural networks have been widely applied in the field of computer vision. In convolutional neural networks, convolution operations account for more than 90% of the total computational workload. The current mainstream approach to achieving high energy-efficient convolution operations is through dedicated hardware accelerators. Convolution operations involve a significant amount of weights and input feature data. Due to limited on-chip cache space in accelerators, there is a significant amount of off-chip DRAM memory access involved in the computation process. The latency of DRAM access is 20 times higher than that of SRAM, and the energy consumption of DRAM access is 100 times higher than that of multiply–accumulate (MAC) units. It is evident that the "memory wall" and "power wall" issues in neural network computation remain challenging. This paper presents the design of a hardware accelerator for convolutional neural networks. It employs a dataflow optimization strategy based on on-chip data reordering. This strategy improves on-chip data utilization and reduces the frequency of data exchanges between on-chip cache and off-chip DRAM. The experimental results indicate that compared to the accelerator without this strategy, it can reduce data exchange frequency by up to 82.9%. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
28. Data preservation practices for enhancing agricultural research data usage among agricultural researchers in Tanzania.
- Author
-
Mwinami, Nolasko Victory, Dulle, Frankwell W., and Mtega, Wulystan Pius
- Subjects
RESEARCH personnel ,AGRICULTURAL research ,AGRICULTURE ,SERVER farms (Computer network management) ,BANKING industry - Abstract
The objective of this study was to investigate the role of research data preservation for enhanced data usage among agricultural researchers in Tanzania. Specifically, the study aimed to examine the data preservation methods used by agriculture researchers, find out how long agriculture researchers preserve their agriculture research data, and determine factors that influence agriculture researchers on their choice of data preservation methods for use. The study employed a cross-sectional research design. The study employed both qualitative and quantitative approaches. A survey was conducted to collect data in 11 research institutions. A simple random sampling technique was used to select 204 respondents from the study area while purposive sampling techniques were used to select 11 agriculture research institutions including 10 Tanzanian Agricultural Research Institution (TARI) centers, and Sokoine University of Agriculture (SUA). Also, 12 respondents were selected purposively for an in-depth interview as key informants. The study adopted Data Curation Centre (DCC) Lifecycle Model to explain data preservation process. Findings indicated that a majority of more than 90% of researchers preferred to preserve their data using different storage devices such as field notebooks, computers, and institutional libraries. Moreover, findings indicated that about 74% of agricultural researchers preferred to preserve their data for more than 6 years after the end of the project. Findings also indicated factors that influence researchers in the choice of data preservation methods to be easy to reach, cost-effective storage devices, support to use the devices, adequate infrastructure for data preservation, and reliable power supply. It can be concluded that there is yet a great role of research data preservation in enhancing data usage among researchers in Tanzania. It is recommended that the government should establish an agricultural research data bank to guarantee permanent availability of data at all times when needed. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
29. Data reusability for migration research: a use case from SoDaNet data repository.
- Author
-
Kondyli, Dimitra, Nisiotis, Constantinos-Symeon, and Klironomos, Nicolas
- Subjects
DATA libraries ,HUMANITARIAN assistance ,VALUE chains ,OPEN scholarship ,CASE-based reasoning - Abstract
This study highlights the role of Research Data Repositories (RDRs) in the concept of data reuse by examining a use case on migration research, a domain that requires up-to-date and accurate data for research and policy purposes. The survey from which the data for the use case was derived aimed to investigate the alignment of humanitarian assistance and social protection in Greece during the post-2015 refugee crisis. Through our analysis, we try to formulate a new corpus of variables and information that can create a value chain for research and policy purposes related to migration research, as well as to draw useful conclusions from this use case study in relation to the concept of data reuse. We address several issues related to data reuse, such as its definition, the role of research data repositories and research infrastructures in data reuse, as well as the limitations and advantages of reuse. We also present some specific features of the SoDaNet RDR, which hosts the primary data. We argue that comprehensive documentation of data adds value to the data and, through reuse, this value can be recycled to the RDR and, therefore, to potential new reusers. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
30. The European Digital Economy
- Author
-
Lubacha, Judyta, Mäihäniemi, Beata, and Wisła, Rafał
- Subjects
Data Extractivism ,Data Reuse ,Digitalisation ,Digital Economy and Society ,Digital Skills ,Digital transition ,bic Book Industry Communication::K Economics, finance, business & management::KC Economics::KCL International economics - Abstract
The “digital economy” is a conceptual umbrella referring to markets, organizations and their networks that are based on digital technologies, communication, data processing and e-commerce. It is multidimensional and its dynamic structure must be analysed from various dimensions, such as economic – changes in the nature of resources, production factors and economic processes; technological – technological progress viewed from a macroeconomic perspective vs. technological innovation viewed from a microeconomic perspective; regulatory – challenges facing regulators, new risks affecting the institutional order; and sociological – changes in society’s functioning principles, attitudes towards work and human relations. The purpose of this book is to analyse the effectiveness of digital technologies as well as the fundamental factors that contribute to technological progress in the long run. It also examines structural and qualitative shifts in economies and societies. It investigates many research questions, such as the gap between the level of digital economic development in European Union countries; digital transformation and its impact on workplace skills development patterns; and also the legal framework for data as resource. The book approaches these issues from a multidisciplinary perspective, from law to economics and sociology. It focuses on definitional discussions, the measurement challenges, drivers for digital transition, the impact on labour relations, digital skills and education, data reuse and data extractivism. This is a comprehensive introduction to the different contexts from which the digital economy can be addressed, offering an innovative method for studying this complex phenomenon, and as such, it will be a valuable resource for students, scholars and researchers across a range of disciplines.
- Published
- 2024
- Full Text
- View/download PDF
31. Data sharing and reuse practices: disciplinary differences and improvements needed
- Author
-
Khan, Nushrat, Thelwall, Mike, and Kousha, Kayvan
- Published
- 2023
- Full Text
- View/download PDF
32. Revisiting the impact of Liverpool as ECoC 2008: the lost opportunity to reconcile cultural policy and evaluation
- Author
-
Crone, Stephen and Ganga, Rafaela
- Published
- 2023
- Full Text
- View/download PDF
33. Supporting Data Discovery: Comparing Perspectives of Support Specialists and Researchers
- Author
-
Guangyuan Sun, Tanja Friedrich, Kathleen Gregory, and Brigitte Mathiak
- Subjects
research data management ,data discovery ,research data services ,data reuse ,Science (General) ,Q1-390 - Abstract
Purpose: Much of the research in data discovery is centered on the users’ viewpoint, frequently overlooking the perspective of those who develop and maintain the discovery infrastructure. Our goal is to conduct a comparative study on research data discovery, examining both support specialists’ and researchers’ views by merging new analysis with prior research insights. Methods: This work summarizes the studies the authors have conducted over the last seven years investigating the data discovery practices of support specialists from different disciplines. Although support specialists were not the main target of some of these studies, data about their perspectives was collected. Our corpus comprises in-depth interviews with 6 social science support specialists, interviews with 19 researchers and 3 support specialists from multiple disciplines, a global survey with 1630 researchers and 47 support specialists, and a use case analysis of 25 support specialists. In the analysis section, we juxtapose the fresh insights on support specialists’ views with the already documented perspectives of researchers for a holistic understanding. The latter is primarily discussed in the literature review, with references made in the analysis section to draw comparisons. Results: We found that support specialists’ views on data discovery are not entirely different from those of the researchers. There are, however, some differences that we have identified, most notably the interconnection of data discovery with general web search, literature search, and social networks. Conclusion: We conclude by proposing recommendations for different types of support work to better support researchers’ data discovery practices.
- Published
- 2024
- Full Text
- View/download PDF
34. Promoting FAIR Data Through Community-driven Agile Design: the Open Data Commons for Spinal Cord Injury (odc-sci.org).
- Author
-
Torres-Espín, Abel, Almeida, Carlos A, Chou, Austin, Huie, J Russell, Chiu, Michael, Vavrek, Romana, Sacramento, Jeff, Orr, Michael B, Gensel, John C, Grethe, Jeffery S, Martone, Maryann E, Fouad, Karim, Ferguson, Adam R, and STREET-FAIR Workshop Participants
- Subjects
STREET-FAIR Workshop Participants ,Humans ,Spinal Cord Injuries ,Reproducibility of Results ,Information Dissemination ,Ecosystem ,Biomedical Research ,Data sharing ,FAIR ,community repository ,data reuse ,neurotrauma ,spinal cord injury ,Spinal Cord Injury ,Neurodegenerative ,Traumatic Head and Spine Injury ,Networking and Information Technology R&D (NITRD) ,Physical Injury - Accidents and Adverse Effects ,Neurosciences ,Generic health relevance ,Biochemistry and Cell Biology ,Neurology & Neurosurgery - Abstract
The past decade has seen accelerating movement from data protectionism in publishing toward open data sharing to improve reproducibility and translation of biomedical research. Developing data sharing infrastructures to meet these new demands remains a challenge. One model for data sharing involves simply attaching data, irrespective of its type, to publisher websites or general use repositories. However, some argue this creates a 'data dump' that does not promote the goals of making data Findable, Accessible, Interoperable and Reusable (FAIR). Specialized data sharing communities offer an alternative model where data are curated by domain experts to make it both open and FAIR. We report on our experiences developing one such data-sharing ecosystem focusing on 'long-tail' preclinical data, the Open Data Commons for Spinal Cord Injury (odc-sci.org). ODC-SCI was developed with community-based agile design requirements directly pulled from a series of workshops with multiple stakeholders (researchers, consumers, non-profit funders, governmental agencies, journals, and industry members). ODC-SCI focuses on heterogeneous tabular data collected by preclinical researchers including bio-behaviour, histopathology findings and molecular endpoints. This has led to an example of a specialized neurocommons that is well-embraced by the community it aims to serve. In the present paper, we provide a review of the community-based design template and describe the adoption by the community including a high-level review of current data assets, publicly released datasets, and web analytics. Although odc-sci.org is in its late beta stage of development, it represents a successful example of a specialized data commons that may serve as a model for other fields.
- Published
- 2022
35. Issues and paths forward in the identification and reuse of historic analog records
- Author
-
Bethany G. Anderson, Erin Antognoli, Sandi L. Caldrone, Justin D. Derner, Shannon L. Farrell, Katrina Fenlon, John R. Hendrickson, Lois G. Hendrickson, Holly A. Johnson, Nicole E. Kaplan, Julia A. Kelly, Kristen L. Mastel, and Sarah C. Williams
- Subjects
historic data ,analog data ,data reuse ,preservation ,discoverability ,FAIR data ,Environmental sciences ,GE1-350 - Abstract
Introduction: Historic data, often in analog format, is a valuable resource for assessing effects of directional changes in climate and climatic variability. However, historic data can be difficult to locate, interpret, and reformat into a useful state.Methods: Teams of scientists, librarians, archivists, and data managers at four US institutions have undertaken various projects to gather, describe, and in some cases, transform historic data. They have also surveyed researchers who either possess historic data or have used it in their work.Results: Historic data projects involved locating data, writing data descriptions, and connecting with individuals who had knowledge about the data’s collection. The surveys and interviews found that researchers valued historic data and were worried that it was at risk of loss. They noted the lack of best practices.Discussion: Each project attempting to rescue or enhance access to historic data has a unique path but being guided by FAIR principles should be at the core whether or not the end result is machine-readable data. Working with a team incorporating librarians, archivists, and data managers can aid individual researchers’ in producing accessible, and reusable datasets. There is much work to be done in raising awareness about the value of historic data but motivating factors for doing so include its usefulness in environmental research and other disciplines and its risk of loss as researchers retire and are unsure of how to save historic data, both in analog and electronic formats.
- Published
- 2024
- Full Text
- View/download PDF
36. Editorial: Navigating the landscape of FAIR data sharing and reuse: repositories, standards, and resources
- Author
-
Maaike M. H. van Swieten and Christian Haselgrove
- Subjects
FAIR principles ,neuroinformatics ,data sharing ,data reuse ,repositories ,standards ,Neurosciences. Biological psychiatry. Neuropsychiatry ,RC321-571 - Published
- 2024
- Full Text
- View/download PDF
37. Curating for Contrarian Communities: Data Practices of Anthropogenic Climate Change Skeptics.
- Author
-
Wofford, Morgan F. and Thomer, Andrea K.
- Subjects
- *
CLIMATE change , *DIGITAL technology , *ARTIFICIAL intelligence , *INFORMATION sharing , *INFORMATION policy , *INFORMATION technology - Abstract
The open data movement is often touted as a sweeping strategy to democratize science, promote diverse data reuse, facilitate reproducibility, accelerate innovation, and much more. However, the potential perils of open data are seldom examined and discussed in equal measure to these promises. As we continue to invest in open data, we need to study the full spectrum of what open data facilitates in practice, which can then inform future policy and design decisions. This paper aims to address this gap by presenting an investigative digital ethnography of one contrarian community, anthropogenic climate change (ACC) skeptics, to describe how they process, analyze, preserve, and share data. Skeptics often engage in data reuse similar to conventional data reusers, albeit for unconventional purposes and with varying degrees of trust and expertise. The data practices of ACC skeptics challenge the assumption that open data is universally beneficial. These findings carry implications for data repositories and how they might curate data and design databases with this type of reuse in mind. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
38. DataChat: Prototyping a Conversational Agent for Dataset Search and Visualization.
- Author
-
Fan, Lizhou, Lafia, Sara, Li, Lingyao, Yang, Fangyuan, and Hemphill, Libby
- Subjects
- *
ARTIFICIAL intelligence , *INFORMATION policy , *DIGITAL technology , *INFORMATION technology , *INFORMATION sharing - Abstract
Data users need relevant context and research expertise to effectively search for and identify relevant datasets. Leading data providers, such as the Inter‐university Consortium for Political and Social Research (ICPSR), offer standardized metadata and search tools to support data search. Metadata standards emphasize the machine‐readability of data and its documentation. There are opportunities to enhance dataset search by improving users' ability to learn about, and make sense of, information about data. Prior research has shown that context and expertise are two main barriers users face in effectively searching for, evaluating, and deciding whether to reuse data. In this paper, we propose a novel chatbot‐based search system, DataChat, that leverages a graph database and a large language model to provide novel ways for users to interact with and search for research data. DataChat complements data archives' and institutional repositories' ongoing efforts to curate, preserve, and share research data for reuse by making it easier for users to explore and learn about available research data. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
39. "Garbage Bags Full of Files": Exploring Sociotechnical Perceptions of Formats within the Recovery and Reuse of Scientific Data.
- Author
-
Wagner, Travis L., Fenlon, Katrina, and Sorensen, Amanda
- Subjects
- *
TRASH bags , *DIGITAL technology , *INFORMATION sharing , *INFORMATION policy , *INFORMATION technology , *ARTIFICIAL intelligence , *COVID-19 pandemic - Abstract
This paper explores the social and technical perceptions of physical and digital formats as they relate to work in the recovery and reuse of scientific data, specifically historical, archival, and defunct data sources. Proprietary and obsolete formats, or formats that need significant transformation work, stand out as central challenges for scientists and data curators who are recovering reusable data from archival or legacy data sources. The challenges confronting data sharing and reuse of contemporary scientific data are already known to be myriad; formats often pose a major, compounding challenge to retrospective data curation research and practice. Based on 23 qualitative interviews with practitioners conducting data recovery and reuse, ranging from marine biologists to data librarians, we study how they understand, engage with, and utilize formats within their data curation work. This paper enumerates the formats deployed throughout the scientific data curation process and explores how practitioners creating and curating scientific data based on historical and archival materials encounter, make sense of, and utilize formats. The paper focuses on practitioner perceptions of formats around the following themes: how practitioners' historical relationships to certain challenging formats inform their ongoing curation practices; the importance of contexts in prioritizing or ignoring formats within scientific curation work; and how formats reveal larger sociotechnical issues. The paper concludes by with practical and theoretical implications of navigating formats within the recovery and reuse of scientific data and offers suggestions for reconfiguring formats within broader data curation lifecycles. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
40. Data reusability for migration research: a use case from SoDaNet data repository
- Author
-
Dimitra Kondyli, Constantinos-Symeon Nisiotis, and Nicolas Klironomos
- Subjects
data reuse ,secondary analysis ,repositories ,Open Science ,FAIR data ,use case ,Social Sciences - Abstract
This study highlights the role of Research Data Repositories (RDRs) in the concept of data reuse by examining a use case on migration research, a domain that requires up-to-date and accurate data for research and policy purposes. The survey from which the data for the use case was derived aimed to investigate the alignment of humanitarian assistance and social protection in Greece during the post-2015 refugee crisis. Through our analysis, we try to formulate a new corpus of variables and information that can create a value chain for research and policy purposes related to migration research, as well as to draw useful conclusions from this use case study in relation to the concept of data reuse. We address several issues related to data reuse, such as its definition, the role of research data repositories and research infrastructures in data reuse, as well as the limitations and advantages of reuse. We also present some specific features of the SoDaNet RDR, which hosts the primary data. We argue that comprehensive documentation of data adds value to the data and, through reuse, this value can be recycled to the RDR and, therefore, to potential new reusers.
- Published
- 2024
- Full Text
- View/download PDF
41. In France, the organization of perinatal care has a direct influence on the outcome of the mother and the newborn: Contribution from a French nationwide study.
- Author
-
Levaillant, Mathieu, Garabédian, Charles, Legendre, Guillaume, Soula, Julien, Hamel, Jean‐François, Vallet, Benoît, and Lamer, Antoine
- Subjects
- *
NEWBORN infants , *REGIONAL medical programs , *MOTHERS , *PREGNANT women , *HEALTH insurance , *ASPHYXIA neonatorum ,PERINATAL care - Abstract
Objective: To investigate maternal and neonatal outcomes after a delivery in France in 2019, according to hospital characteristics and the impact of distance and time of travel on mother and newborn. Methods: All parturients above 18 years of age who delivered in 2019 and were identified in the French health insurance database were included, with their newborns, in this retrospective cohort study. Main outcome measures were Severe Maternal Morbidity score and the Neonatal Adverse Outcome Indicator (NAOI). Results: Among the 733 052 pregnancies included, 10 829 presented a severe maternal morbidity (1.48%) and 77 237 had a neonatal adverse outcome (10.4%). Factors associated with an unfavorable maternal or neonatal outcome were Obstetric Comorbidity Index, primiparity, and cesarean or instrumental delivery. Prematurity was associated with less severe maternal morbidity but more neonatal adverse outcomes. Time of travel above 30 min was associated with a higher NAOI rate. Conclusions: Results suggest the efficiency of regionalization of perinatal care in France, although a difference in both outcomes persists according to unit volume, suggesting the need for a further step in concentrating perinatal care. Perinatal care organization should focus on mapping the territory with high‐level, high‐volume maternity throughout the territory; this suggests closing down high‐volume units and improving low‐volume ones to maintain coherent mapping. Synopsis: High‐volume facilities improve maternal and neonatal outcome after delivery. Distance from hospital is associated with a worse neonatal outcome. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
42. Advancing scientific inquiry through data reuse: Necessary condition analysis with archival data.
- Author
-
Dul, Jan, van Raaij, Erik, and Caputo, Andrea
- Subjects
SCIENTIFIC method ,DATA replication ,REGRESSION analysis ,RESEARCH questions ,STRATEGIC planning - Abstract
This article discusses the importance of reusing existing data in research. In addition to reuse data for replication of earlier findings and for answering extended or new research questions, we propose a third application of data reuse: studying the phenomenon from an alternative causal perspective. We focus on the reuse of data with a necessity causal perspective ("if not X, then not Y") as employed in necessary condition analysis (NCA). Such reuse of data offers additional insights compared with those obtained from the conventional probabilistic causal perspective ("if X, then probably Y") as employed in regression analysis. NCA is gaining recognition in various fields, including strategic management. Reusing data for conducting NCA is an efficient way to get new causal insights. We provide recommendations on how to use NCA with existing data and emphasize the importance of transparency when reusing data. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
43. Information Scientists' Motivations for Research Data Sharing and Reuse.
- Author
-
Shutsko, Aliaksandra and Stock, Wolfgang G.
- Subjects
- *
INFORMATION scientists , *INFORMATION sharing , *SHARING , *SELF-determination theory , *RESEARCH personnel , *MOTIVATION (Psychology) - Abstract
What are the motivations and sought gratifications leading information science researchers to share and to reuse research data? Research data are both datasets and supplementary materials such as interview guides or questionnaires. The theoretical backgrounds of this study are the Lasswell Formula of Communication, the Uses and Gratifications Theory, and the Self-determination Theory, which formed the basis for the construction of an interview guide and the interpretation of the interview transcripts. We performed 11 in-depth interviews with German information scientists, all with experiences with data. The results demonstrate that research data sharing is not a rare practice among information scientists. Due to problems with different information horizons of the sharing and the reusing researchers, the reusing of data sets is much rarer than the reuse of supplementary materials. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
44. 面向神经网络池化层的灵活高效硬件设计.
- Author
-
何增, 朱国权, and 岳克强
- Abstract
Copyright of Journal of Computer Engineering & Applications is the property of Beijing Journal of Computer Engineering & Applications Journal Co Ltd. and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2023
- Full Text
- View/download PDF
45. 学术图书馆数据文化建设的思考与探讨.
- Author
-
刘细文 and 涂志芳
- Abstract
With the emergence and development of the digital economy, digital society, and digital government, data culture has also emerged. Library data culture includes five aspects, namely library culture of data-related skills and attitudes, library culture of data sharing, library culture of data use and reuse, li- brary data ethics and governance culture, and library-specific data culture. Taking representative academic libraries at home and abroad as examples, the path of library data culture construction is analyzed from the perspectives of strategic planning and best practices, including data-driven research, data-driven services, data-driven management, and data-driven decision-making. The construction of library data culture faces risks such as data security and intellectual property, as well as challenges in digital technology development and application, capacity building and talent cultivation, and external competition. Libraries can sustainably carry out data culture construction by establishing a data mindset, strengthening capacity building, enhancing talent cultivation, prudently addressing process risks, and actively embracing external challenges. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
46. Reimagining Secondary Data in a Digital Age
- Author
-
Pritchard, Katrina
- Published
- 2023
- Full Text
- View/download PDF
47. Advantages of Data Reuse Based on Disciplinary Diversity and Citation Count
- Author
-
Ishita, Emi, Miyata, Yosuke, Kurata, Keiko, Goos, Gerhard, Founding Editor, Hartmanis, Juris, Founding Editor, Bertino, Elisa, Editorial Board Member, Gao, Wen, Editorial Board Member, Steffen, Bernhard, Editorial Board Member, Yung, Moti, Editorial Board Member, Goh, Dion H., editor, Chen, Shu-Jiun, editor, and Tuarob, Suppawong, editor
- Published
- 2023
- Full Text
- View/download PDF
48. In France, distance from hospital and health care structure impact on outcome after arthroplasty of the hip for proximal fractures of the femur
- Author
-
Mathieu Levaillant, Louis Rony, Jean-François Hamel-Broza, Julien Soula, Benoît Vallet, and Antoine Lamer
- Subjects
Volume–outcome relationship ,Hip arthroplasty ,Hospital ,Quality indicator ,Surgery ,Data reuse ,Orthopedic surgery ,RD701-811 ,Diseases of the musculoskeletal system ,RC925-935 - Abstract
Abstract Background Hip arthroplasty is a frequently performed procedure in orthopedic surgery, carried out in almost all health structures for two main issues: fracture and coxarthrosis. Even if volume–outcome relationship appeared associated in many surgeries recently, data provided are not sufficient to set surgical thresholds neither than closing down low-volumes centers. Question With this study, we wanted to identify surgical, health care-related and territorial factors influencing patient’ mortality and readmission after a HA for a femoral fracture in 2018 in France. Patients and methods Data were anonymously collected from French nationwide administrative databases. All patients who underwent a hip arthroplasty for a femoral fracture through 2018 were included. Patient outcome was 90-day mortality and 90-day readmission rate after surgery. Results Of the 36,252 patients that underwent a HA for fracture in France in 2018, 0.7% died within 90-day year and 1.2% were readmitted. Male and Charlson comorbidity index were associated with a higher 90-day mortality and readmission rate in multivariate analysis. High volume was associated with a lower mortality rate. Neither time of travel nor distance upon health facility were associated with mortality nor with readmission rate in the analysis. Conclusion Even if volume appears to be associated with lower mortality rate even for longer distance and time of travel, the persistence of exogenous factors not documented in the French databases suggests that regionalization of hip arthroplasty should be organized with caution. Clinical relevance As volume–outcome relationship must be interpreted with caution, policy makers should not regionalize such surgery without further investigation.
- Published
- 2023
- Full Text
- View/download PDF
49. Integration Management and Reuse of Research Data in the Next Generation University IR Resources
- Author
-
DU Pingping, LI Yuke, ZHANG Xueyuan, MU Yafeng
- Subjects
research data ,data integration ,data reuse ,next generation institutional knowledge resource ,Bibliography. Library science. Information resources ,Agriculture - Abstract
[Purpose/Significance] In 2021, the Ministry of Education of China issued the "Norms for the Construction of Digital Campuses in Higher Education Institutions (Trial)", which mentioned that universities should attempt to form a new model of academic exchange and resource sharing through the construction of institutional repositories (IRs). The construction of knowledge resources in university IRs is gradually developing towards the next generation IRs. Research data resources are an important component of the future resources. From the perspective of the generation, acquisition, and existence of information resources in universities, the institutional resources in universities primarily consist of resources generated by "industry, academia, research and application" activities, including teaching resources, scientific research resources, design resources, scientific activity resources, etc., which are specifically reflected in the activity process such as scholar resources, research activity resources, research process resources, and research achievements resources. Research data resources are studied as the research object in this article. [Method/Process] This paper analyzes the formation mechanism of institutional knowledge resources, and conduct research on the integration, management and reuse of research data resources, including: 1) internal factors of research data, data generation, source, type, structure, and dependency mode; 2) research on external elements of data, such as data standards, unique identifiers, data registration, data protocols, data rights, data reuse, and data sharing Through the study of internal and external factors, the influencing factors, functions and roles, responsibilities and rights, permissions and attribution of data integration management and data reuse were sorted and interpreted and feasible methods were designed for research data management, collection, implementation, and registration. In order to ensure the effectiveness of data management, a series of standards and schemes have been developed, such as data type and format standards, metadata schemes, and data guardianship demand survey templates. The purpose is to achieve data discovery, interoperability, and reuse through continuous monitoring of scientific data. The basic rules of reuse are mainly divided into: 1) reuse: the concept of reuse, reuse, sharing, and incomplete equivalence in reuse; 2) sharing: possible to be used; 3) protocol usage: discussion about how to use it; and 4) rights use: complying with the data copyright agreement. [Results/Conclusions] Through research courses, the development and implementation of favorable data management and reuse strategies have clarified the data objects, management set management services, and data reuse permissions of research data in university IRs. We have clarified the current situation and urgent issues to integrate research data into the long-term preservation, management, and sharing and reuse system of knowledge resources in university IRs. We have defined the responsibilities, management mechanisms, standardized business processes, permission attributes, data exchange platforms, data registration, and data reuse of research data subjects by category, region, and field, and provided some suggestions and guarantee measures for the establishment of a data management center in China.
- Published
- 2023
- Full Text
- View/download PDF
50. Research data sharing, reuse, and metrics : adoption and challenges across disciplines and repositories
- Author
-
Khan, Nushrat and Thelwall, Mike
- Subjects
data sharing ,data reuse ,data repositories ,research data - Abstract
Data sharing is widely believed to be beneficial to science and is now supported by digitization and new online infrastructures for sharing datasets. Nevertheless, differences in research cultures and the sporadic development of data repositories, support services, guidelines, and policies have resulted in uneven data sharing and reuse practices. An overall understanding of the current situation is therefore needed to identify gaps and next steps. In response, using two case studies and two surveys, this dissertation explores the current landscape and identifies challenges within data sharing and reuse practices. The results demonstrate how present systems and policies could be modified to support and encourage these activities. The researcher survey found that the type and format of data produced, as well as systematic data sharing varied between disciplines, with Physical Sciences and Earth and Planetary Sciences leading and Business and Economics, Engineering, and Medicine lagging in some respects. Surveys and observations were frequently produced in most fields, with samples and simulations being common in science and engineering and qualitative data being more prevalent in the social sciences, business, and humanities. Researchers who had prior data reuse experience shared data more frequently (56.8%, n=1,004) than those who only used their primary data for research (32.6%, n=396). The biodiversity case study and surveys show that secondary data are valuable for many purposes, but most struggle to find datasets to reuse. Data citations can incentivize data sharing, although a lack of appropriate data citations and reliable technologies make it difficult to efficiently track them. In biodiversity, where the sharing and reuse of open data via mature infrastructures is common, citing secondary datasets in references or data access statements has been increasing (48%, n=99). However, users simultaneously exploiting many data subsets in this field complicate the situation. This thesis makes recommendations for handling large numbers of biodiversity data subsets to attribute citations accurately. It also suggests further enhancements for the article-dataset linking service, Scholexplorer, to automatically capture such links. Based on responses from data repository managers, this research further identifies nine objectives for future repository systems. Specifically, 30% (n=34) of the surveyed managers would like integration and interoperability between data and systems, 19% (n=22) want better research data management tools, 16% (n=18) want tools that allow computation without downloading datasets, and 16% (n=18) want automated systems. It also makes 23 recommendations in three categories to support data sharing and promote further data reuse including 1) improved access and usability of data, as well as formal data citations; 2) improved search systems with suggested new features; and 3) cultural and policy-related issues around awareness and acceptance, incentives, collaboration, guidelines, and documentation. Finally, based on researcher feedback, this study proposes an alternative scoring model that combines a dataset quality score and a data reuse indicator that can be incorporated in academic evaluation systems. The outcomes from this research will help funders, policymakers and technology developers prioritize areas of improvement to incentivize data sharing and support data reuse with easily discoverable and usable data.
- Published
- 2021
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.