Directional Stimulus Prompting

Directional Stimulus Prompting

方向性刺激提示

Li et al., (2023) (opens in a new tab) proposes a new prompting technique to better guide the LLM in generating the desired summary.

Li等人,(2023) (opens in a new tab) 提出了一種新的提示技術,以更好地指導LLM產生所需的摘要。

A tuneable policy LM is trained to generate the stimulus/hint. Seeing more use of RL to optimize LLMs.

一種可調的策略語言模型被訓練用於產生刺激/提示。看到越來越多使用強化學習來優化低級語言模型。

The figure below shows how Directional Stimulus Prompting compares with standard prompting. The policy LM can be small and optimized to generate the hints that guide a black-box frozen LLM.

下面的圖表顯示了定向刺激提示與標準提示的比較。政策LM可以很小,並且可以優化以產生提示,以指導黑盒子凍結LLM。

DSP

Image Source: Li et al., (2023) (opens in a new tab)

DSP

圖片來源:Li et al.,(2023) (opens in a new tab)

Full example coming soon!

完整的範例即將推出!