Home / Research Library / Self-Consistency Improves Chain of Thought Reasoni...
🤖 Artificial Intelligence OpenAlex

Self-Consistency Improves Chain of Thought Reasoning in Language Models

📅 March 21, 2022 👤 Xuezhi Wang, Wei, Jason, Dale Schuurmans et al. 📖 arXiv (Cornell University) 📊 692 citations

🤖 Plain-English Summary

Chain-of-thought prompting combined with pre-trained large language models has achieved encouraging results on complex reasoning tasks. Self-consistency leverages the intuition that a complex reasoning problem typically admits multiple different ways of thinking leading to its unique correct answer.

🔑 Key Findings

  • In this paper, we propose a new decoding strategy, self-consistency, to replace the naive greedy decoding used in chain-of-thought prompting.
  • It first samples a diverse set of reasoning paths instead of only taking the greedy one, and then selects the most consistent answer by marginalizing out the sampled reasoning paths.
  • Self-consistency leverages the intuition that a complex reasoning problem typically admits multiple different ways of thinking leading to its unique correct answer.

💡 Why This Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

Read the full paper
Access the original peer-reviewed research via OpenAlex.

View on DOI ↗

📋 Article Details

Category 🤖 Artificial Intelligence
Published Mar 21, 2022
Journal arXiv (Cornell University)
Authors Xuezhi Wang, Wei, Jason, Dale Schuurmans, Quoc V. Le, Ed H.
DOI 10.48550/arxiv.2203.11171
Citations 692
Source OpenAlex

More 🤖 Artificial Intelligence Research