Home / Research Articles Hub / Pre-train, Prompt, and Predict: A Systematic Surve...
🤖 Artificial Intelligence OpenAlex

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

📅 Published: September 14, 2022 👤 Pengfei Liu, Weizhe Yuan, Jinlan Fu et al. 📖 ACM Computing Surveys 📊 3,594 citations
AI-Generated Summary

This article surveys and organizes research works in a new paradigm in natural language processing, which we dub “prompt-based learning.” Unlike traditional supervised learning, which trains a model to take in an input x and predict an output y as P ( y|x ), prompt-based learning is based on language models that model the probability of text directly. In this article, we introduce the basics of this promising paradigm, describe a unified set of mathematical notations that can cover a wide variet...

⚡ This is an original paraphrased summary — not copied from the abstract. Full paper available at the source link below.

Key Findings
  • 1 To use these models to perform prediction tasks, the original input x is modified using a template into a textual string prompt x′ that has some unfilled slots, and then the language model is used to probabilistically fill the unfilled information to obtain a final string x̂ , from which the final output y can be derived.
  • 2 This framework is powerful and attractive for a number of reasons: It allows the language model to be pre-trained on massive amounts of raw text, and by defining a new prompting function the model is able to perform few-shot or even zero-shot learning, adapting to new scenarios with few or no labeled data.
  • 3 In this article, we introduce the basics of this promising paradigm, describe a unified set of mathematical notations that can cover a wide variety of existing work, and organize existing work along several dimensions, e.g., the choice of pre-trained language models, prompts, and tuning strategies.
Why It Matters

This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.

This summary is based on publicly available metadata and abstract. For the full research paper, visit the original source:

Read Full Paper at OpenAlex
More Artificial Intelligence Papers ← Back to Hub 📚 Learning Hub
Article Details
Source OpenAlex
Category 🤖 Artificial Intelligence
Published Sep 14, 2022
Journal ACM Computing Surveys
DOI 10.1145/3560815
Citations 3,594
Authors Pengfei Liu, Weizhe Yuan, Jinlan Fu, Zhengbao Jiang, Hiroaki Hayashi