Directional Stimulus Prompting
方向性刺激提示
Li et al., (2023) (opens in a new tab) proposes a new prompting technique to better guide the LLM in generating the desired summary.
Li等人,(2023) (opens in a new tab) 提出了一種新的提示技術,以更好地指導LLM產生所需的摘要。
A tuneable policy LM is trained to generate the stimulus/hint. Seeing more use of RL to optimize LLMs.
一種可調的策略語言模型被訓練用於產生刺激/提示。看到越來越多使用強化學習來優化低級語言模型。
The figure below shows how Directional Stimulus Prompting compares with standard prompting. The policy LM can be small and optimized to generate the hints that guide a black-box frozen LLM.
下面的圖表顯示了定向刺激提示與標準提示的比較。政策LM可以很小,並且可以優化以產生提示,以指導黑盒子凍結LLM。
Image Source: Li et al., (2023) (opens in a new tab)
圖片來源:Li et al.,(2023) (opens in a new tab)
Full example coming soon!
完整的範例即將推出!