3 results on '"Dimitris Koureas"'
Search Results
2. ‘The Last Mile’: The registry behind the identifier
- Author
-
Claus Weiland, Dimitris Koureas, Alex Hardisty, Wouter Addink, and Larry Lannom
- Subjects
0106 biological sciences ,Persistent identifier ,0303 health sciences ,business.industry ,Computer science ,Digital Object Architecture ,Interoperability ,Search engine indexing ,handle ,Cloud computing ,General Medicine ,registry ,010603 evolutionary biology ,01 natural sciences ,Identifier ,World Wide Web ,03 medical and health sciences ,persistent identifier ,OBO Foundry ,Object model ,business ,Cyberspace ,030304 developmental biology - Abstract
Preserved specimens in natural science collections have lifespans of many decades and often, several hundreds of years. Specimens must be unambiguously identifiable and traceable in the face of changes in physical location, changes in organisation of the collection to which they belong, and changes in classification. When digitizing museum collections, a clear link must be maintained between the physical specimen itself and the information digitally representing that specimen in cyberspace. The idea of a Natural Science Identifier (NSId) as a neutral, unique, universal and stable long-term persistent identifier (PID) of a ‘Digital Specimen’ is central to museums’ ambitions for widening access. An NSId allows easy identification and referencing of specific Digital Specimens, regardless of type, location, owner or user. It provides a digital doorway to physical specimens through which services for arranging loans and visits can be accessed, as well as opening the door to innovative services for manipulating specimens’ information directly; for work reliant upon discovery of related third-party information; and for demanding 3D modelling and visualization of specimens. Because the work takes place within e-Infrastructures/Cyberspace, new possibilities for analysing hundreds of thousands of specimens simultaneously are opened by exploiting large-scale cloud computing capacity and deep mining/machine learning, for example. There are several established identifier mechanisms that could be used as a basis for NSId, but some variant of Handles is most appropriate over the very long-term because of their neutrality, resistance to change and sustainability. Adopted uses of the Handle system include identification of journal articles and datasets in education and research (using Digital Object Identifiers); film and television programme assets in the entertainment sector; financial derivatives; and for international shipping and construction. Aside from being stable and sustained over time, an essential requirement of a global PID mechanism is independence from the museums/institutions assigning identifiers. NSIds are opaque insofar as no information can or should be inferred solely by inspecting the identifier. Stakeholders change, collections move, and organisations evolve, merge or disappear. Even designations and descriptions of specimens and collections can change. Information should only be revealed when the identifier is resolved via a neutral index. One can debate the most appropriate instantiation of the Handle system but this is not useful. Relevance, ease of use and added-value of the supporting ‘NSId Registry’ (NSIdR) – the index of the different kinds of natural science object and their relations – are the decisive factors. This can be seen from the example of the Entertainment Identifier Registry (EIDR) founded by the major motion picture studios to create a reliable way to identify and track film and TV content distribution. Focus on the object model, promotional branding and value perception in the target user segment are the critical factors for success. Providing such a registry, seamlessly coupled to work practices and language of the professionals addresses the last mile challenge (Koureas et al. 2016). From specimens, class characteristics, storage containers and collections, to specific identifications, images, naming, literature references and more, the NSIdR’s triple-hierarchy object model, rooted in OBO Foundry’s Biological Collections Ontology, is the key to persistently identifying, relating and indexing the entire range of collection objects of interest to scientists and others working in the bio and geo realms. The NSIdR ‘knowledge graph’, interoperable with other identifier schemes, supports novel first- and third-party value-add services such as arranging loans and visits, curation and annotation, and machine-learning for relationship discovery and pattern exploration.
- Published
- 2019
3. FAIR Digital Objects for Science: From Data Pieces to Actionable Knowledge Units
- Author
-
Dimitris Koureas, Koenraad De Smedt, and Peter Wittenburg
- Subjects
0106 biological sciences ,data infrastructure ,Computer science ,Data management ,Interoperability ,Context (language use) ,Cloud computing ,Library and Information Sciences ,010603 evolutionary biology ,01 natural sciences ,lcsh:Communication. Mass media ,European Open Science Cloud ,03 medical and health sciences ,research infrastructure ,open science ,Media Technology ,information_technology_data_management ,Business and International Management ,030304 developmental biology ,Persistent identifier ,FAIR data ,0303 health sciences ,business.industry ,digital object ,Communication ,lcsh:Information resources (General) ,05 social sciences ,Data science ,lcsh:P87-96 ,Computer Science Applications ,Identifier ,Metadata ,EOSC ,persistent identifier ,The Internet ,data management ,data science ,0509 other social sciences ,050904 information & library sciences ,business ,lcsh:ZA3040-5185 - Abstract
Data science is facing the following major challenges: (1) developing scalable cross-disciplinary capabilities, (2) dealing with the increasing data volumes and their inherent complexity, (3) building tools that help to build trust, (4) creating mechanisms to efficiently operate in the domain of scientific assertions, (5) turning data into actionable knowledge units and (6) promoting data interoperability. As a way to overcome these challenges, we further develop the proposals by early Internet pioneers for Digital Objects as encapsulations of data and metadata made accessible by persistent identifiers. In the past decade, this concept was revisited by various groups within the Research Data Alliance and put in the context of the FAIR Guiding Principles for findable, accessible, interoperable and reusable data. The basic components of a FAIR Digital Object (FDO) as a self-contained, typed, machine-actionable data package are explained. A survey of use cases has indicated the growing interest of research communities in FDO solutions. We conclude that the FDO concept has the potential to act as the interoperable federative core of a hyperinfrastructure initiative such as the European Open Science Cloud (EOSC). publishedVersion
- Published
- 2020
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.