CLARIN-NL: Contributions by NLP & Current Status

CLARIN-NL: Contributions by NLP & Current Status
Slide Note
Embed
Share

This content provides an overview of CLARIN-NL and its contributions to NLP, detailing the infrastructure, easy access to data, software developed, and facilitated services. It highlights the availability of NLP resources to humanities researchers, emphasizing user-friendly interfaces and tools. The content showcases various projects and databases within CLARIN-NL, offering insights into the current status and future prospects of the research infrastructure.

  • CLARIN-NL
  • NLP
  • Humanities research
  • Digital language resources
  • Research infrastructure

Uploaded on Mar 04, 2025 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. CLARIN-NL: Contributions by NLP & Current Status Jan Odijk CLIN-dag Antwerp, 2015-02-06 1

  2. Overview CLARIN CLARIN-NL and NLP CLARIN-NL Portal CLARIN-NL: Current Status and Near Future 2

  3. CLARIN Infrastructure A research infrastructure for humanities researchers who work with digital language resources Research infrastructure: facilities, resources and related services used by the scientific community to conduct top-level research Humanities researchers: linguists, historians, literary scholars, Language resources: lexicons, corpora, databases, ; text, audio, video, . 3

  4. CLARIN-NL and NLP NLP data and software developed by you now available to humanities researchers in general thanks to CLARIN-NL By specifically designed search engines and interfaces By user friendly interfaces to software Mostly as web applications (and web services) Most by NL, some by FL, some together 4

  5. CLARIN-NL and NLP Easy Access to data OpenSONAR (see Reynaert s presentation) GrETEL (Flanders) (see Augustinus et al. s poster ) Cornetto (lexico-semantic database) DuELME (multiword expressions database) PaQu (see Odijk s poster ) Autosearch (as of March 2015) 5

  6. CLARIN-NL and NLP Easy use of software INPOLDER/Adelheid (syntactic analysis of 13thcentury Dutch ) TTNWW (orthographic normalisation, pos-tagging, parsing, NER, co-reference assignment, speech transcription, .) (NL+FL) TiCCLops (orthographic normalisation) PICCL (see Reynaert s poster) Stylene (Flanders) stylistic analysis 6

  7. CLARIN-NL and NLP Facilitated by CLAM wrapper to easily build web services Increased syntactic interoperability: FoLiA increasingly a de facto standard format in NL for linguistically annotated corpora Increased semantic interoperability through ISOCAT and currently the CLARIN Concept Registry (poster by Schuurman et al.) 7

  8. CLARIN-NL Portal CLARIN-NL Portal: http://portal.clarin.nl Faceted search in data and services created by CLARIN-NL Search by research domain, language, tool task, linguistic annotation and several other facets Links to search facilities in the whole CLARIN infrastructure Educational packages: https://dev.clarin.nl/node/CLARIN%20Educational %20Packages And much more . 8

  9. Current Status CLARIN-NL finishes by April 1st, 2015 Final event: Friday March 13, 2015, (morning) Netherlands Institute for Sound & Vision, Hilversum Register via the CLARIN-NL website: http://www.clarin.nl/node/2074 9

  10. Near Future CLARIAH-CORE successor project Jan 1, 2015 Dec 31, 2018 12 m euro Kick-off: Friday March 13, 2015, (afternoon) Netherlands Institute for Sound & Vision, Hilversum Register via the CLARIAH-website http://www.clariah.nl/activiteiten/clariah-kick-off 10

  11. Thanks for Attention! 11

  12. Subdisciplines C:\Users\pl_user7\AppData\Local\Microsoft\Windows\Temporary Internet Files\Content.IE5\L8HBSUK3\arrow[1].png CLARIN-NL Linguistics subdisciplines Historical linguistics Dialectology Discourse Studies Language Acquisition Language Documentation Lexicology / Lexicography Morphology Morpho-syntax Phonetics Phonology Pidgin & Creole Studies Semantics Sign Language Specific Language Impairment (SLI) Syntax Typology 12

  13. Languages Covered C:\Users\pl_user7\AppData\Local\Microsoft\Windows\Temporary Internet Files\Content.IE5\L8HBSUK3\arrow[1].png CLARIN-NL Languages Dutch English German Frisian Dutch Sign Language Classical Greek French Hebrew Aramaic Syriac > 50 languages from Insular South East Asia and West New Guinea And more And many more (> 2,000) from all CLARIN countries 13

  14. What you can do C:\Users\pl_user7\AppData\Local\Microsoft\Windows\Temporary Internet Files\Content.IE5\L8HBSUK3\arrow[1].png CLARIN-NL Functionality Offered Searching Browsing Analysis Corpus Exploration Annotation Tokenization Pos-tagging Lemmatization Orthographic normalisation Grammatical relation assignment Co-reference assignment Named entity recognition Multiword unit assignment Chunking Parsing Visualisation Diarisation Speech recognition 14

Related


More Related Content