Investigation of Digital Reference Interviews: Effective Information Retrieval Strategies

an investigation of digital reference interviews n.w
1 / 21
Embed
Share

Explore the linguistic properties of computer-mediated information-seeking conversations and the utilization of dialogue acts analysis and machine learning for automatic detection. Research involves online chat reference logs to analyze discourse and communication functions.

  • research
  • information retrieval
  • dialogue acts
  • linguistic analysis
  • machine learning

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. An Investigation of Digital Reference Interviews: A Dialogue Act Approach Bei Yu, Assistant Professor Keisuke Inoue, PhD Candidate

  2. The web is full of conversations

  3. How can we find information in conversations conversationseffectively? 3

  4. How can information retrieval systems effectively utilize a collection of conversations as an information resource? How can IR systems incorporate processes or structure of information-seekingconversations?

  5. Research Questions 1. What are the linguistic properties of computer-mediated information-seeking conversations? Dialogue acts analysis of digital reference interviews 2. How can such properties be detected automatically? Machine learning experiments of dialogue acts annotation 5

  6. Data Online chat reference log provided Online chat reference log provided by by OCLC, OCLC, courtesy of Dr. Radford courtesy of Dr. Radford and Dr. and Dr. Connaway Connaway 800 interview 800 interview sessions collected sessions collected from from April. 2004 to April. 2004 to Sept. Sept. 2006 2006 200 interviews were selected for 200 interviews were selected for discourse analysis based on the discourse analysis based on the questions asked. questions asked. 6

  7. Dialogue Act Classification The communicative function of utterances in dialogue-based interactions Popescu-Belis, 2008 Two levels of analysis: function and domain Two coals of dialogues: underlying goals and communicative goals 7

  8. Unit of Analysis Text Dialogue Act X Segment 1 Message 1 Session 1 Text Dialogue Act Y Segment l Message m Session n n = 210, m = 26 (average), l = 1.5 (average)

  9. Classification Scheme Sub Domain Function Domain Topic Information Problem Background Information Reference Info. Provision Object History Search Process Current Process Strategy

  10. Classification Scheme Structure Sub Domain Function Domain Topic Information Problem Background Information Reference Info. Provision Object History Search Process Current Process Strategy

  11. Example Info. Provision Topic Topic Which colleges did top fashion designers go? Info. Request Topic Topic You mean top fashion designers anywhere? ? Info. Provision Topic Topic Info. Provision Answer Answer ASK US! Yep, anywhere in U.S. Calvin Klein Graduated from NY s Fashion Institute of Technology in 1964 ? Info. Provision Feedback Feedback ASK US! I need more recent ones Info. Request Topic Topic Do you have anyone in mind? ? Info. Provision Topic / Background Topic / Background No I m deciding which school to go. ASK US! (continue ) (continue )

  12. Annotation Three MLIS students worked on approx. five sessions per week (20 weeks total). Approx. 8K messages, 12K segments. Approx. 20% overlap between two annotators. Approx. 10% overlap between three annotators. Kappa was confirmed satisfactory (> .8) except for the deepest layer.

  13. Results Example: Distribution of Dialogue Act Functions User Librarian Info Request Info Provision Comm Mgmt Social Rel Mgmt Task Mgmt

  14. Results Example: Information Domains over Time Librarian User Other Feedback Search Process Info. Object Info:Problem Start Mid End Start Mid End

  15. Observations DA analysis enabled: Confirming the theories/models of Communication, Linguistic, and information behavior. Characterizing the digital reference interviews Enabling comparisons with other types of information-seeking conversations. 15

  16. Machine Learning (Text Classification) Given a piece of text, find a label for the text. Different types of variables (features) to represent text. Various algorithms to find labels. 16

  17. Algorithm HM-SVM Combining the HMM (Hidden Markov Model) and SVM (Support Vector Machine) A few implementations available Proven to be effective for structured labels No applications for DA labeling yet 17

  18. Preliminary Results Classifying the Function (shallowest) Layer (with SVM): Class Info. Provision Info. Request Precision Recall F-Measure 0.861 0.697 0.894 0.687 0.877 0.692 Task Mgmt Dialogue Mgmt Social Rel. Mgmt Weighted Average 0.703 0.851 0.89 0.836 0.67 0.763 0.868 0.837 0.686 0.804 0.879 0.836

  19. Machine Learning The preliminary results are promising. The future work include: Experimenting with the Domain (deeper) layer Testing with HM-SVM Analyzing the results and testing with different features.

  20. Summary DA analysis: Confirmed the previous theories/models. Characterized the digital reference interviews Future Work Comparisons with other types of conversations Improving the Machine Learning and applying it to IR systems experiment (e.g. as a new feature for a ranking algorithm). 20

  21. Thank you to the ALISE / OCLC for the wonderful opportunity. Thank you to Dr. Lynn Connaway for all the work and support.

Related


More Related Content