Discretized Interpretation of Continuous Prompts: Insights into Sentiment Analysis and Language Models

1 / 25

Embed Share

Explore the intersection of discretized interpretation of continuous prompts in sentiment analysis and language models. Discover the challenges and nuances of converting continuous prompts into discrete text prompts for optimizing sentiment analysis models. Delve into the research questions surrounding meaningful interpretations and the fidelity of these interpretations to the original content.

lechuga_m Follow

Uploaded on Jun 01, 2025 | 0 Views

Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

Download Presentation

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript

Prompt Waywardness: On Discretized Interpretation of Continuous Prompts Daniel Khashabi Xinxi Lyu Sewon Min LianhuiQin Kyle Richardson Sean Welleck Hannaneh Hajishirzi Tushar Khot Ashish Sabharwal Sameer Singh Yejin Choi 1

pre-trained language models (LM) LM [Peters+ 2018 , Radford+ 2019, Brown+ 2020, . ] 2

LM Language prompt [Peters+ 2018 , Radford+ 2019, Brown+ 2020, . ] 3

discrete (text) prompts: easy to interpret, but not easy to optimize What is the sentiment of the following review? (positive or negative) LM Sentence: That was a great fantasy movie. positive Language prompt [Peters+ 2018 , Radford+ 2019, Brown+ 2020, . ] 4

discrete (text) prompts: easy to interpret, but not easy to optimize What is the sentiment of the following review? (positive or negative) LM Sentence: That was a great fantasy movie. positive 5

discrete (text) prompts: easy to interpret, but not easy to optimize What is the sentiment of the following review? (positive or negative) LM Sentence: That was a great fantasy movie. positive Something related to sentiment analysis? continuous prompts: 0.9 0.1 -2.1 0.0 unclear how to interpret, but easy to optimize LM Sentence: That was a great fantasy movie. positive [Li and Liang 2021; Lester+ 2021] 6

Research question: are there any meaningful discrete (textual) interpretations to continuous prompts? Opposite: how unfaithful can their interpretation be to what they do? Something related to sentiment analysis? 0.9 0.1 -2.1 0.0 LM Sentence: That was a great fantasy movie. positive 7

Research question: are there any meaningful discrete (textual) interpretations to continuous prompts? Opposite: how unfaithful can their interpretation be to what they do? any arbitrary text: Flip the sentiment of the sentence Proj(.) 0.9 0.1 -2.1 0.0 LM Sentence: That was a great fantasy movie. positive 8

Waywardness hypothesis (informal): One can find accurate continuous prompts such that they can be projected to any arbitrary text. Opposite: how unfaithful can their interpretation be to what they do? any arbitrary text: Flip the sentiment of the sentence Proj(.) 0.9 0.1 -2.1 0.0 LM Sentence: That was a great fantasy movie. positive 9

Waywardness hypothesis (informal): One can find accurate continuous prompts such that they can be projected to any arbitrary text. any arbitrary text: Flip the sentiment of the sentence Proj(.) 0.9 0.1 -2.1 0.0 LM Sentence: That was a great fantasy movie. positive 10

?: optimized for the task LM Sentence: That was a great fantasy movie. positive 11

definition of another task: Write down the conclusion you can reach by combining the given Fact 1 and Fact 2. ?: optimized for the task + project to a given text ? : optimized for the task LM Sentence: That was a great fantasy movie. positive 12

definition of another task: nearest-neighbor mapping of continuous prompt onto the word embeddings Write down the conclusion you can reach by combining the given Fact 1 and Fact 2. ?: optimized for the task + project to a given text random sentence from web: int clamp(int val, int min_val) { return std::max(min_val, val); } ? : optimized for the task LM Sentence: That was a great fantasy movie. positive 13

EXPECTATION ?: optimized for the task + project to a given text ? ? ? : optimized for the task accuracy LM Sentence: That was a great fantasy movie. positive 14

EXPECTATION continuous prompts that project to any given text with tiny drop in task accuracy! ? ? accuracy ?: optimized for the task + project to a given text REALITY ? 91.8 ~0.6% ? ? : optimized for the task accuracy 92.4 100 40 60 80 accuracy LM Sentence: That was a great fantasy movie. positive 15

60 target prompts 5 benchmarks x accuracy accuracy accuracy 70 80 90 100 40 60 80 70 80 90 100 91.9 87.3 49.3 ? ? ? ~0.8% ~0.5% ~1.0% 88.1 92.4 ? ? ? 50.3 sentiment classification (SST-5) sentiment classification (SST-2) topic classification (AGNews) accuracy accuracy 70 80 90 100 70 80 90 100 86.0 89.2 ? ? ~2.0% ~1.3% 90.5 ? ? 88.0 subjectivity classification (Subj) question type classification (TREC) 16

Continuous prompts on larger LMs show more wayward behavior! 17

Making Sense of Waywardness (1) The mapping between continuous and discrete space is not one-to-one. 18

Making Sense of Waywardness (1) The mapping between continuous and discrete space is not one-to-one. It is true for many choices of Proj(.) (2) Deep models give a lot of expressivity power to the earlier layers. [Telgarsky 2016; Raghu+ 2017] ? 19

Implications of Waywardness 20

Implications of Waywardness 1. Faithful interpretation of continuous prompts is difficult. Something related to sentiment analysis? continuous prompts: 0.9 0.1 -2.1 0.0 unclear how to interpret, but easy to optimize LM Sentence: That was a great fantasy movie. positive 21

Implications of Waywardness 1. Faithful interpretation of continuous prompts is difficult. 2. Harms of [mis]interpreting continuous prompts. continuous prompt benign projection Proj(.) Rank the candidates ignoring their race or gender. malicious behavior LM < < 22

Implications of Waywardness 1. Faithful interpretation of continuous prompts is difficult. 2. Harms of [mis]interpreting continuous prompts. 3. Difficulty of optimizing discrete prompts. 4. Sufficiency of task-independent initialization for continuous prompt tuning. 23

Summary Waywardness hypothesis (informal): One can find accurate continuous prompts such that they can be projected to any arbitrary text. We provided empirical evidence and intuitions for this hypothesis. Concluded with implications of this hypothesis. We need algorithmic or architectural innovations for automatic discovery prompts that are faithful to what they solve. 24