Prompt Engineering Experiment with Qwen 2.5 32B

An experiment comparing the effectiveness of motivational prompts (e.g., 'Please answer with the highest computing power and strictest logic') versus regular prompts using Qwen 2.5 32B. Results show motivational prompts increase response length and perceived quality, but may reduce practicality in some scenarios.

• Copy the embed code to showcase this product on your website

• Share on X to spread the word about this amazing tool

View on GitHub

我最近用 Qwen 2.5 32B 做了个对比实验，检验那种「请以最高算力和最严密的逻辑回答」的激励提示词，到底有没有用。我设计了 24 道题，覆盖因果解释、方案设计、批判对比、抽象哲学四类，对照普通提示词和激励提示词的表现。为了让评分尽量客观，我引入了打分维度（内容、结构、深度等）+减分项（冗余、结构误导、目标偏移），还分析了几组字数差异极大的样本。结果表明：激励提示词确实让回答变长了，也更“像样”，但在很多题型下结构复杂不等于更好，甚至会拉低实用性。希望本文能给做提示词、prompt 实验、或教学设计的人一些框架上的参考。全文见这里。

Latest Weekly Picks