Discretized Interpretation of Continuous Prompts: Insights into Sentiment Analysis and Language Models

prompt waywardness n.w
1 / 25
Embed
Share

Explore the intersection of discretized interpretation of continuous prompts in sentiment analysis and language models. Discover the challenges and nuances of converting continuous prompts into discrete text prompts for optimizing sentiment analysis models. Delve into the research questions surrounding meaningful interpretations and the fidelity of these interpretations to the original content.

  • Sentiment Analysis
  • Language Models
  • Continuous Prompts
  • Discretized Interpretation
  • Research Questions

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. Prompt Waywardness: On Discretized Interpretation of Continuous Prompts Daniel Khashabi Xinxi Lyu Sewon Min LianhuiQin Kyle Richardson Sean Welleck Hannaneh Hajishirzi Tushar Khot Ashish Sabharwal Sameer Singh Yejin Choi 1

  2. pre-trained language models (LM) LM [Peters+ 2018 , Radford+ 2019, Brown+ 2020, . ] 2

  3. LM Language prompt [Peters+ 2018 , Radford+ 2019, Brown+ 2020, . ] 3

  4. discrete (text) prompts: easy to interpret, but not easy to optimize What is the sentiment of the following review? (positive or negative) LM Sentence: That was a great fantasy movie. positive Language prompt [Peters+ 2018 , Radford+ 2019, Brown+ 2020, . ] 4

  5. discrete (text) prompts: easy to interpret, but not easy to optimize What is the sentiment of the following review? (positive or negative) LM Sentence: That was a great fantasy movie. positive 5

  6. discrete (text) prompts: easy to interpret, but not easy to optimize What is the sentiment of the following review? (positive or negative) LM Sentence: That was a great fantasy movie. positive Something related to sentiment analysis? continuous prompts: 0.9 0.1 -2.1 0.0 unclear how to interpret, but easy to optimize LM Sentence: That was a great fantasy movie. positive [Li and Liang 2021; Lester+ 2021] 6

  7. Research question: are there any meaningful discrete (textual) interpretations to continuous prompts? Opposite: how unfaithful can their interpretation be to what they do? Something related to sentiment analysis? 0.9 0.1 -2.1 0.0 LM Sentence: That was a great fantasy movie. positive 7

  8. Research question: are there any meaningful discrete (textual) interpretations to continuous prompts? Opposite: how unfaithful can their interpretation be to what they do? any arbitrary text: Flip the sentiment of the sentence Proj(.) 0.9 0.1 -2.1 0.0 LM Sentence: That was a great fantasy movie. positive 8

  9. Waywardness hypothesis (informal): One can find accurate continuous prompts such that they can be projected to any arbitrary text. Opposite: how unfaithful can their interpretation be to what they do? any arbitrary text: Flip the sentiment of the sentence Proj(.) 0.9 0.1 -2.1 0.0 LM Sentence: That was a great fantasy movie. positive 9

  10. Waywardness hypothesis (informal): One can find accurate continuous prompts such that they can be projected to any arbitrary text. any arbitrary text: Flip the sentiment of the sentence Proj(.) 0.9 0.1 -2.1 0.0 LM Sentence: That was a great fantasy movie. positive 10

  11. ?: optimized for the task LM Sentence: That was a great fantasy movie. positive 11

  12. definition of another task: Write down the conclusion you can reach by combining the given Fact 1 and Fact 2. ?: optimized for the task + project to a given text ? : optimized for the task LM Sentence: That was a great fantasy movie. positive 12

  13. definition of another task: nearest-neighbor mapping of continuous prompt onto the word embeddings Write down the conclusion you can reach by combining the given Fact 1 and Fact 2. ?: optimized for the task + project to a given text random sentence from web: int clamp(int val, int min_val) { return std::max(min_val, val); } ? : optimized for the task LM Sentence: That was a great fantasy movie. positive 13

  14. EXPECTATION ?: optimized for the task + project to a given text ? ? ? : optimized for the task accuracy LM Sentence: That was a great fantasy movie. positive 14

  15. EXPECTATION continuous prompts that project to any given text with tiny drop in task accuracy! ? ? accuracy ?: optimized for the task + project to a given text REALITY ? 91.8 ~0.6% ? ? : optimized for the task accuracy 92.4 100 40 60 80 accuracy LM Sentence: That was a great fantasy movie. positive 15

  16. 60 target prompts 5 benchmarks x accuracy accuracy accuracy 70 80 90 100 40 60 80 70 80 90 100 91.9 87.3 49.3 ? ? ? ~0.8% ~0.5% ~1.0% 88.1 92.4 ? ? ? 50.3 sentiment classification (SST-5) sentiment classification (SST-2) topic classification (AGNews) accuracy accuracy 70 80 90 100 70 80 90 100 86.0 89.2 ? ? ~2.0% ~1.3% 90.5 ? ? 88.0 subjectivity classification (Subj) question type classification (TREC) 16

  17. Continuous prompts on larger LMs show more wayward behavior! 17

  18. Making Sense of Waywardness (1) The mapping between continuous and discrete space is not one-to-one. 18

  19. Making Sense of Waywardness (1) The mapping between continuous and discrete space is not one-to-one. It is true for many choices of Proj(.) (2) Deep models give a lot of expressivity power to the earlier layers. [Telgarsky 2016; Raghu+ 2017] ? 19

  20. Implications of Waywardness 20

  21. Implications of Waywardness 1. Faithful interpretation of continuous prompts is difficult. Something related to sentiment analysis? continuous prompts: 0.9 0.1 -2.1 0.0 unclear how to interpret, but easy to optimize LM Sentence: That was a great fantasy movie. positive 21

  22. Implications of Waywardness 1. Faithful interpretation of continuous prompts is difficult. 2. Harms of [mis]interpreting continuous prompts. continuous prompt benign projection Proj(.) Rank the candidates ignoring their race or gender. malicious behavior LM < < 22

  23. Implications of Waywardness 1. Faithful interpretation of continuous prompts is difficult. 2. Harms of [mis]interpreting continuous prompts. 3. Difficulty of optimizing discrete prompts. 4. Sufficiency of task-independent initialization for continuous prompt tuning. 23

  24. Summary Waywardness hypothesis (informal): One can find accurate continuous prompts such that they can be projected to any arbitrary text. We provided empirical evidence and intuitions for this hypothesis. Concluded with implications of this hypothesis. We need algorithmic or architectural innovations for automatic discovery prompts that are faithful to what they solve. 24

  25. Experiment: effect of prompt length The relative accuracy drop is marginal when the prompt length is not too small (e.g. 7 or larger). 25

Related


More Related Content