
Insights into Repository Registries and Metadata Aggregation
Explore the complex landscape of repository registries and metadata aggregation, delving into discussions on standardization, classification, and the need for trustful repositories. Discover the different types of registries and their significance in the data repository ecosystem.
Download Presentation

Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.
E N D
Presentation Transcript
Repository Registries Agenda 11.30 Welcome & State of the Discussion Is it all one is it all different? Peter & Herman and commenters 12.10 Actions to be taken for Collection Registry Bob and others 12.20 Towards a WG on Repository Registries Open Discussion - - - - Looking into existing schemas Integrating the approaches Making re3data schema an RDA recommendation Analyze various federation approaches and specify schema 12.45 Next Steps Peter & Herman 13.00 End
State of the Discussion Short History: P3 BoF P5 BoF Linking re3data and DataBib What is needed a start Contributions: re3data, gocdb, eudat, Biomed Regs (no further info on Biomed regs yet) VM: Docs, Schemas from CLARIN, Karmela agreement on classification and way to go lots of open questions P6 Session Summarizing and Actions
Where are we? What is special? Different MD? But there is an urgent need! That s what many people do. Collection MD records are just as other MD records Here we have two activities: Re3data type Federation type Curation responsibility Lots of mapping and curation
Where are we? highly granular MD Schema -based Granu- larity Harves- ting Hetero- geneity RDA Collection MD Y DO OAI-PMH high MD groups Y and NO Collec- tion MD groups Dom Rep OAI-PMH high Repository MD Light- weight Quick view Re3data be a standard ? - Heavy weight Incl. services To be ? high harmonized Federation MD Open questions: What to be done for Collection Registry? What to do to harmonize Federation MD? Where do we need to distinguish?
Repository Registries - Classification Type Description In RDA Registries that aggregate metadata records from MD providers (which are often data repositories) MD groups Metadata Registries Registries that indicate what kinds of collections repositories have; Collections are part of MD Domain Reps? Collection Registries Special registry where to find appropriate clinical trials; kind of collection registry Clinical Trial Registries? Karmela Special registry about where to find useful biomed information seems to be a kind of coll registry Biomed Reg IG Biomed Registries Make it standard Trustful Rep Reg Re3data registry to quickly find trustful repositories Registries to help automating operations in large federations WG to come Federation Registries Other registries out there There are different types of registries out there, do we need to look at them? If so who
Collection Registries To be taken care of by Bob
Joint Approach re3data Schema Trustful Repository Re3data service harvesting extracting GocDB+ Schema Trustful Repository GocDB+ service other Schema harvesting extracting Trustful Repository Overlap in schemas to be used. Make two schemas recommendations. Large Federation
Large Federations Federation Reg type EUDAT About 50 nodes GocDB DataONE ? ? EGI Grid Hundreds of nodes GocDB World Climate Modelling Centers ? ? CLARIN About 20 nodes own DNC/iRODS ? own ? Need to identify them to participate in further elaborations. WG needs to take an active role and time
Relations 1 1 GocDB+ Schema Rep Registry Service Trustful Repository 3 2 4 5 Metadata Service Simplified interaction: 1: harvest federation type metadata 2: request all reps that offer metadata (probably of certain type) 3: return list incl. port specs etc. 4: request metadata records 5: return all metadata records
Re3Data Schema descriptive Controlled vocabulary aspects usable for humans to find suitable repositories
Next Steps Let s setup a simple RDA Collection Registry WG to Quickly specify the Collection Registry Specify first partners and responsibilities Develop and Implement Simple Software Discuss with NDS and EUDAT about service Chairs: Bob & Herman/SJT Let s setup a Repository Registries WG to Define re3data schema as RDA recommendation study various federation use cases identify overlap in schemas define federation schema as RDA recommendation (GocDB seems to be a candidate to start from) define exposure protocols and perhaps some mappings Chairs: Peter/Johannes & ?