Understanding ISOcat: Data Category Registry and Linguistic Resources

www isocat org www isocat org n.w
1 / 12
Embed
Share

Explore the ISOcat Data Category Registry, its role in managing linguistic resources, and the fundamental concepts surrounding data categories. Learn about the importance of data categories in managing and organizing language resources effectively.

  • ISOcat
  • Data Category Registry
  • Linguistic Resources
  • Metadata
  • CMDI

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. www.isocat.org www.isocat.org The ISO-DCR Marc Kemps-Snijdersa, Menzo Windhouwerb, Sue Ellen Wrightc aMeertens Institute, bMPI for Psycholinguistics, cKent State University menzo.windhouwer@mpi.nl 17 January /2011 CMDI tutorial 1

  2. www.isocat.org www.isocat.org Outline ISOcat: a Data Category Registry The role of data categories in CMDI A glimpse of ISOcat Status of the metadata profile 17 January /2011 CMDI tutorial 2

  3. www.isocat.org www.isocat.org ISOcat: a Data Category Registry The reference implementation of ISO 12620:2009 Terminology and other content and language resources Specification of data categories and management of a Data Category Registry for language resources A data category is the result of the specification of a given data field an elementary descriptor in a linguistic structure or an annotation scheme 17 January /2011 CMDI tutorial 3

  4. www.isocat.org www.isocat.org Data categories and linguistic resources wordOrder grammaticalGender Language BWO genders Lexicon 1..* A (schema for a) typological database Lexical Entry partOfSpeech writtenForm Lemma 1..* 0..* Form Sense writtenForm 0..* Word Form grammaticalGender lexicalType A (schema for a) lexicon 17 January /2011 CMDI tutorial 4

  5. www.isocat.org www.isocat.org Data category specification Administrative information Identifier Version Origin Justification Status Descriptive information Names, definitions, examples and explanations in various languages (English is mandatory) Application (domain) specific names Conceptual domain Possible values (per profile) Linguistic information Examples and explanations for various languages Possible values for various languages 17 January /2011 CMDI tutorial 5

  6. www.isocat.org www.isocat.org The role of data categories in CMDI CMD components, elements and items can have links to concepts These links should be resolvable to a concept description This concept description gives explicitsemantics Elements and components can use different terminology but still have common semantics ISOcat provides resolvable links to the semantic description of data categories (DCs) CMD items: simple DCs CMD elements: complex DCs CMD components: container DCs (upcoming) 17 January /2011 CMDI tutorial 6

  7. www.isocat.org www.isocat.org Data category references in CMDI <CMD_Component name="HeadWordType"> <CMD_Element name="HeadWordType" ConceptLink="http://www.isocat.org/datcat/DC-2486"> <ValueScheme> <enumeration> <item ConceptLink="http://www.isocat.org/datcat/DC-286">Lemma</item> <item ConceptLink="http://www.isocat.org/datcat/DC-2948">Word form</item> <item ConceptLink="http://www.isocat.org/datcat/DC-350">Phrase</item> <item ConceptLink="http://www.isocat.org/datcat/DC-1386">Sentence</item> <item ConceptLink="http://www.isocat.org/datcat/DC-2599">Other</item> <item ConceptLink="http://www.isocat.org/datcat/DC-2592">Unspecified</item> </enumeration> </ValueScheme> </CMD_Element> </CMD_Component> 17 January /2011 CMDI tutorial 7

  8. www.isocat.org www.isocat.org A glimpse of ISOcat http://www.isocat.org/ 17 January /2011 CMDI tutorial 8

  9. www.isocat.org www.isocat.orgStatus of the metadata profile Initial set of data categories has been created (to never disappear) by the Athens Core group But your own components might need your own specific DCs Translations for many EU languages have been added No ISO Standardization yet TDG is working towards starting up the process Athens Core group will become more prominent The addition of container DCs to be linked to CMD components is planned 17 January /2011 CMDI tutorial 9

  10. www.isocat.org www.isocat.org Standardization Decision Group Submission group Thematic Domain Group Data Category Registry Board Stewardship group Evaluation Validation rejected rejected Publication 17 January /2011 CMDI tutorial 10

  11. www.isocat.org www.isocat.org Component Registry Interacts with ISOcat Access to public DCs in the metadata profile So to currently access your private DCs you ll have to make them public Working towards: The ability to access your private workspace from the Component Registry Still will have to make your DCs public if you make your component/profile public 17 January /2011 CMDI tutorial 11

  12. www.isocat.org www.isocat.org Thank you for your attention! Visit www.isocat.org Questions? isocat@mpi.nl or http://trac.clarin.nl/ / helpdesk@clarin.nl or The CLARIN-NL ISOcat tutorial 2011 17 January /2011 CMDI tutorial 12

More Related Content