
Data Discovery Paradigms RDA Interest Group Activities Overview
Explore the activities of the Data Discovery Paradigms RDA Interest Group including goals, identified topics, task forces, and progress updates. The group aims to improve data discovery through best practices, use cases, and metadata enrichment, with a focus on stakeholder collaboration.
Download Presentation

Please find below an Image/Link to download the presentation.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.
You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.
The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.
E N D
Presentation Transcript
DATA DISCOVERY PARADIGMS RDA Interest Group
GOALS OF THE DDPIG Founding Co-Chairs Siri Jodha Khalsa, Univ. of Colorado Anita de Waard, Elsevier Goal Identify, study and make recommendations concerning issues related to improving data discovery Stakeholders Data producers, data repositories, data seekers
ACTIVITIES 23 topics identified in Kickoff meeting at RDA#8 74 people signed up for the group Later, these topics refined and voted on, leading to 5 top picks 1. Best Practices for making data findable 2. Use cases, prototyping tools and test collections 3. Metadata enrichment 4. Cataloging common API's 5. Relevancy ranking Task forces were formed and leads identified 1, 2 and 5 got to work immediately Leads of 3 and 4 have been slower to start Two very productive TF leads were asked to become co-chairs Mingfang Wu, Australian National Data Service Fotis Psomopoulos, Aristotle University of Thessaloniki
IG SESSION AT RDA P9 Attendance ~40 First three Task Forces presented their progress and proposed next steps Metadata Enrichment Task Force was formed with new leads Agreed follow-up actions leading to P10: Relevancy ranking: Sending out questionnaire, collect and prioritise collaborative projects; decide on platform for testbed Use cases: Rank use cases, rewriting document, provide examples of platforms, write final report Best practices: further edits on document, combine into a white paper, submit for publication Metadata enrichment: start regular telecons to plan next steps.
BEST PRACTICES FOR MAKING DATA FINDABLE Co-Leads: Anita de Waard Jeffrey Grethe William Michener Mingfang Wu Members: 26 Scope Explore current practices of making data finable and to recommend best practices to the data community Activity to Date: Drafted 3 documents Best practices for Data Producers Best practices for Data Repositories Best practices for Data Seekers Plan to submit to journal for publication
USE CASES, PROTOTYPING TOOLS AND TEST COLLECTIONS Leads: Members: 15 Scope Identify key requirements evident across data discovery use-cases from various scientific domains Activity to Date: Collected >60 use cases in the form of: As a (i.e. role), Theme (i.e. scientific domain/discipline), I want (i.e. requirement, missing feature, supported function), So that (i.e. what can be accomplished when the user need has been addressed), Comments Anita de Waard Antica Culina Fotis Psomopoulos Jens Klump Mingfang Wu
RELEVANCY RANKING Leads: Members: 11 Scope Help with selection of appropriate technologies for improving search functionality Provide a means or forum for sharing experiences/tools/test collections related to relevancy ranking. Work with data search community to explore what are realistic and yet reliable ways for data repositories to carry out relevancy ranking comparison and evaluation tasks Activity to Date: Preparation of survey on relevancy ranking systems to be sent to large list of repositories Peter Cotroneo Mingfang Wu SiriJodha Khalsa
METADATA ENRICHMENT Leads: Members: TBD Activity to Date: Two telecons since P9 to discuss scoping Beth Huffer Ilya Zaslavsky
OUTREACH TO OTHER RDA GROUPS Prior to P8, emails were sent to these RDA Groups inviting feedback on interim outputs from the first 3 task forces active-data-management-plans data-versioning domain-repositories education-and-training-handling-research-data health-data libraries-research-data metadata national-data-services pid preservation-e-infrastructure rdacodata-materials-data-infrastructure-interoperability rdawds-certification-digital-repositories rdawdsx-publishing-data repository-platforms-research-data
POTENTIALLY FRUITFUL COLLABORATIONS Sharing of approaches for improving fundability among domain repositories Contributing data discovery use cases to a common database of use cases Providing a testbed for experimentation with retrieval/ranking algorithms. Have offers, suggestions from: US NDS ANDS Elsevier s AWS EC2