Structured and Unstructured Data in Information Retrieval

introduction to information retrieval n.w
1 / 5
Embed
Share

Exploring the differences between structured and unstructured data in information retrieval, this content delves into how structured data refers to information organized in tables, while unstructured data pertains to free text. It also discusses semi-structured data and its search capabilities, providing insights into various data types for effective information retrieval strategies.

  • Information Retrieval
  • Data Structuring
  • Structured Data
  • Unstructured Data
  • Data Search

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. Introduction to Information Retrieval Structured vs. Unstructured Data

  2. IR vs. databases: Structured vs unstructured data Structured data tends to refer to information in tables Employee Manager Salary Smith Jones 50000 Chang Smith 60000 Ivy Smith 50000 Typically allows numerical range and exact match (for text) queries, e.g., Salary < 60000 AND Manager = Smith. 2

  3. Unstructured data Typically refers to free text Allows Keyword queries including operators More sophisticated concept queries e.g., find all web pages dealing with drug abuse Classic model for searching text documents 3

  4. Semi-structured data In fact almost no data is unstructured E.g., this slide has distinctly identified zones such as the Title and Bullets to say nothing of linguistic structure Facilitates semi-structured search such as Title contains data AND Bullets contain search Or even Title is about Object Oriented Programming AND Author something like stro*rup where * is the wild-card operator 4

  5. Introduction to Information Retrieval Structured vs. Unstructured Data

More Related Content