How many examples should I use for few-shot prompting?

Most tasks work well with 3 to 5 examples. This provides enough variety to establish patterns without consuming too much of your context window. Start with 3 examples and add more only if you're seeing inconsistent results.

What's the difference between few-shot prompting and fine-tuning?

Few-shot prompting provides examples within the prompt itself, and the learning is temporary. Fine-tuning actually updates the model's parameters based on training data, creating permanent changes. Few-shot is faster and easier but less powerful for specialized tasks. Fine-tuning requires more resources but can produce better results for specific domains.

When should I use one-shot instead of few-shot prompting?

Use one-shot when zero-shot is close to working but needs slight clarification, when you don't have time to craft multiple examples, or when a single demonstration is sufficient to convey your intent. It's a quick middle ground before committing to full few-shot prompting.

Can I combine few-shot prompting with chain-of-thought?

Yes, and it's often very effective. Instead of just showing input-output pairs, you show the reasoning process that leads to each output. This teaches the model not just what answer to produce, but how to think through problems step by step.

Zero-Shot vs Few-Shot Prompting: 2026 Guide

Q: What is zero-shot prompting?

Zero-shot prompting is when you give an AI model a task without providing any examples. The model relies entirely on its pre-trained knowledge to generate a response. It works best for common tasks like basic translation, simple questions, or straightforward summarization.

You've probably typed a prompt into ChatGPT or Claude and gotten a response that was close but not quite right. Maybe the format was off. Maybe the tone didn't match what you needed. Or maybe the AI just didn't understand the specific type of output you were looking for.

That's where zero-shot prompting, one-shot prompting, and few-shot prompting come into play. These techniques control how many examples you give the AI before asking it to complete a task, and they can dramatically change the quality of what you get back.

The core idea is straightforward: the more context you provide, the better the AI can understand exactly what you want. But there's more nuance here than just "more examples = better results." Each approach has trade-offs, and picking the right one depends on your task, your time, and how much control you need over the output.

If you're working on understanding prompt engineering basics, these shot-based techniques are foundational skills that will improve nearly everything you do with AI.

What Is Zero-Shot Prompting?

Zero-shot prompting means asking an AI to complete a task without giving it any examples. You describe what you want, and the model uses its pre-trained knowledge to figure out the rest.

Here's what a zero-shot prompt looks like:

Prompt:
Classify this review as positive, negative, or neutral: "The battery life on this phone is surprisingly good, but the camera quality disappointed me."

Response:
Neutral

The AI has never seen this specific review before, but it understands what sentiment classification means from its training data. It applies that understanding directly to your input.

Zero-shot prompting works well for:

Common tasks the model has seen many times during training (translation, summarization, sentiment analysis)
General knowledge questions
Simple instructions that don't require specific formatting
Quick queries where speed matters more than precision

The limitation? Zero-shot prompting relies entirely on what the model already knows. If your task is unusual, domain-specific, or requires a particular output structure, the AI might not deliver what you need.

What Is One-Shot Prompting?

One-shot prompting (sometimes called one-shot learning in the prompt context) gives the AI exactly one example before presenting the actual task. It's a middle ground that provides just enough context without the overhead of multiple demonstrations.

Prompt:
Classify reviews as positive, negative, or neutral.

Review: "This laptop is blazing fast and handles everything I throw at it."
Classification: Positive

Review: "The battery life on this phone is surprisingly good, but the camera quality disappointed me."
Classification:

Response:
Neutral

That single example does two things. First, it shows the model the exact format you want (Review followed by Classification). Second, it gives a concrete demonstration of how you're applying the labels.

One-shot prompting is useful when:

Zero-shot produces inconsistent formatting
You need to clarify an ambiguous task
You have limited time to craft multiple examples
The task is straightforward but benefits from one concrete demonstration

The drawback is that a single example can only show so much. If your task has multiple variations or edge cases, one example won't cover them all.

What Is Few-Shot Prompting?

Few-shot prompting provides the AI with multiple examples, typically between 2 and 5, though you can go higher if your tokens and context window limits allow it. These few-shot examples teach the model patterns it can apply to new inputs.

Prompt:
Classify reviews as positive, negative, or neutral.

Review: "This laptop is blazing fast and handles everything I throw at it."
Classification: Positive

Review: "Terrible customer service. Waited 3 hours on hold."
Classification: Negative

Review: "It works as expected. Nothing special."
Classification: Neutral

Review: "The battery life on this phone is surprisingly good, but the camera quality disappointed me."
Classification:

Response:
Neutral (or could be classified as mixed/positive depending on the model's interpretation of the examples)

With three examples covering positive, negative, and neutral cases, the model has a much clearer picture of your classification criteria. It can recognize patterns and apply them consistently.

Few-shot prompting excels at:

Complex classification tasks with multiple categories
Domain-specific work where general knowledge isn't enough
Tasks requiring precise output formatting (JSON, specific templates)
Situations where consistency across many inputs matters
Content generation where you need to match a particular style or tone

This is where in-context learning really shines. The model isn't being retrained. It's learning from the examples you provide within the prompt itself, adapting its behavior without any parameter updates.

Understanding In-Context Learning

In-context learning (ICL) is the mechanism that makes shot-based prompting work. When you understand how LLMs learn in-context, you realize that the model isn't actually "learning" in the traditional machine learning sense. It's not updating its weights or storing new information permanently.

Instead, ICL works through pattern recognition. Large language models are trained on massive amounts of text that includes countless examples of tasks being demonstrated and then completed. When you provide examples in your prompt, the model recognizes this pattern and continues it.

Think of it like this: if you show someone three examples of how you want a table formatted, then hand them new data, they'll format it the same way. The AI does something similar. It identifies the pattern from your examples and applies that pattern to the new input.

Key insights about in-context learning:

The learning is temporary. Once the conversation ends or the context window clears, the "learning" disappears
Example quality matters more than quantity. Three well-chosen examples often outperform ten mediocre ones
Format consistency in your examples helps the model recognize patterns more easily
The order of examples can affect results, though this varies by model and task

This is fundamentally different from pre-training versus post-training methods like fine-tuning, where the model's weights are actually updated based on new data.

Zero-Shot vs Few-Shot: When to Use Each

The choice between zero-shot and few-shot prompting isn't about which is "better." It's about matching the technique to your specific situation.

Use zero-shot prompting when:

The task is common and well-understood (basic translation, simple summarization)
Speed matters more than perfect accuracy
You're doing exploratory work and don't know exactly what you need yet
The model already handles similar tasks well out of the box
You want to minimize token usage and costs

Use few-shot prompting when:

Zero-shot produces inconsistent or incorrect results
You need precise control over output format or structure
The task involves domain-specific terminology or concepts
You're handling edge cases that require explicit demonstration
Consistency across multiple inputs is critical

Use one-shot as a bridge:

When zero-shot is almost working but needs slight guidance
When you don't have time to craft multiple examples
When one clear demonstration is enough to convey your intent

A practical approach: start with zero-shot. If the results aren't what you need, add one example. Still not working? Add two or three more. This iterative process helps you find the minimum number of examples required for your specific task.

Real Examples of Few-Shot Prompting

Let's look at some concrete few-shot examples across different use cases.

Example 1: Sentiment Analysis with Nuance

Standard sentiment analysis often misses nuance. Zero-shot might classify "I don't hate it" as negative because of the word "hate." Few-shot can fix this.

Prompt:
Classify the sentiment as positive, negative, or neutral.

Text: "I don't hate this restaurant."
Sentiment: Positive

Text: "The ending was terrible, but in the best way possible."
Sentiment: Positive

Text: "It's not the worst I've seen."
Sentiment: Neutral

Text: "I wouldn't say I dislike working from home."
Sentiment:

By showing examples with negation and complex phrasing, you teach the model to handle similar constructions correctly.

Example 2: Code Documentation

If you're using AI writing and content tools or code assistants, few-shot prompting can enforce documentation standards.

Prompt:
Write a docstring for each function following this format.

Function: def add(a, b): return a + b
Docstring: """Adds two numbers and returns the sum. Args: a (int): First number. b (int): Second number. Returns: int: Sum of a and b."""

Function: def multiply(x, y): return x * y
Docstring: """Multiplies two numbers and returns the product. Args: x (int): First multiplier. y (int): Second multiplier. Returns: int: Product of x and y."""

Function: def get_user_age(user_id): return database.query(user_id).age
Docstring:

Example 3: Data Extraction

Pulling structured data from unstructured text is a perfect few-shot use case.

Prompt:
Extract the name, company, and role from each sentence.

"Sarah joined Microsoft as a product manager."
Name: Sarah | Company: Microsoft | Role: Product Manager

"After 5 years at Tesla, James left to pursue consulting."
Name: James | Company: Tesla | Role: Not specified (former employee)

"Dr. Chen leads the AI research team at DeepMind."
Name: Dr. Chen | Company: DeepMind | Role: AI Research Team Lead

"Marcus was recently promoted to VP of Engineering at Stripe."

The examples show exactly how to format output and handle variations in how information is presented.

How Many Examples Should You Use?

Research and practical experience suggest that 3 to 5 examples hit the sweet spot for most tasks. Here's why:

2 examples establish a pattern but may not cover enough variation
3 to 5 examples typically provide enough diversity without overwhelming the context
More than 5 examples often show diminishing returns and eat into your context window

However, there are exceptions. Complex tasks with many categories might benefit from more examples. Tasks with a very specific format might need only 2. And recent research suggests that for modern reasoning-focused models, sometimes zero-shot with explicit instructions outperforms few-shot, especially for mathematical and logical problems.

The key factors to consider:

Task complexity: More complex tasks generally benefit from more examples
Output variation: If you need many different output types, show more variety
Context window size: Larger context windows give you more room for examples
Cost considerations: More examples mean more tokens, which can add up

When mastering prompt engineering techniques, learning to calibrate example quantity is a skill that improves with practice.

Combining Few-Shot with Chain-of-Thought

Few-shot prompting becomes even more powerful when combined with chain-of-thought for better reasoning. Instead of just showing input-output pairs, you show the reasoning process that leads to the output.

Standard few-shot:
Q: If there are 3 cars in a parking lot and 2 more arrive, how many cars are there?
A: 5

Q: A store has 15 apples. If they sell 7 and receive a shipment of 12, how many do they have?
A:

Few-shot with chain-of-thought:
Q: If there are 3 cars in a parking lot and 2 more arrive, how many cars are there?
A: Starting with 3 cars. 2 more arrive. 3 + 2 = 5 cars total.

Q: A store has 15 apples. If they sell 7 and receive a shipment of 12, how many do they have?
A:

The second approach teaches the model to show its work, making reasoning more transparent and often more accurate for multi-step problems.

For even more complex reasoning scenarios, techniques like tree-of-thought advanced reasoning build on these foundations to explore multiple solution paths.

Common Mistakes to Avoid

When implementing shot-based prompting, watch out for these pitfalls:

Inconsistent formatting across examples
If your examples use different formats, the model won't know which pattern to follow. Keep structure identical across all demonstrations.

Examples that are too similar
Showing five examples of positive sentiment doesn't help the model understand negative cases. Cover the range of possibilities you expect to encounter.

Overly complex examples
Long, convoluted examples can confuse the model. Keep demonstrations clear and focused on the specific pattern you're teaching.

Ignoring the order
Example order can influence results. Generally, put your most representative examples first and save edge cases for later in the sequence.

Too many tokens on examples, too few on the actual task
Balance matters. If your examples consume most of your context window, you won't have room for the actual content you're processing.

Shot-Based Prompting in Production

When you move from experimentation to production, shot-based prompting requires additional considerations.

Consistency: The same prompt should produce consistent results. Test extensively before deploying.

Edge cases: Your examples need to cover the weird cases users will inevitably throw at your system.

Cost management: Few-shot prompting uses more tokens per request. Calculate whether the accuracy improvement justifies the cost increase.

Latency: More examples mean longer prompts, which can increase response time. For real-time applications, this matters.

Maintenance: When your requirements change, you'll need to update examples. Build systems that make this easy.

Understanding how system prompts shape responses helps you decide what belongs in a system prompt versus what should be few-shot examples within user prompts.

Ready to find the right AI tool for your prompting needs? Browse our AI tools directory to explore options that fit your workflow, from chatbots to specialized writing assistants.

When Shot-Based Prompting Isn't Enough

Sometimes, no amount of examples will get you where you need to go. Signs that you might need a different approach:

Few-shot produces wildly inconsistent results even with good examples
The task requires knowledge the model simply doesn't have
You need guaranteed accuracy that prompting alone can't provide
Token costs for extensive examples become prohibitive

In these cases, consider:

Fine-tuning: Training a model specifically on your task
RAG (Retrieval-Augmented Generation): Providing relevant documents at query time
Agentic workflows: Chaining multiple prompts together for complex multi-step tasks
Hybrid approaches: Combining prompting with other techniques

Shot-based prompting is powerful, but it's one tool in a larger toolkit.

What's Next for Shot-Based Prompting

As models get more capable, the dynamics of shot-based prompting continue to evolve. Recent research suggests that the most advanced models sometimes perform better with zero-shot chain-of-thought than traditional few-shot approaches, particularly for reasoning tasks.

This doesn't make few-shot obsolete. It remains valuable for:

Format alignment (making outputs match specific templates)
Style matching (getting the right tone and voice)
Domain adaptation (teaching models about specialized fields)
Edge case handling (showing how to deal with unusual inputs)

The trend seems to be toward models that need fewer examples for reasoning but still benefit from demonstrations for formatting and style. The best practitioners will stay flexible, testing both approaches for their specific use cases.

Conclusion

Zero-shot, one-shot, and few-shot prompting are foundational techniques that give you control over how AI models respond to your requests. Zero-shot works when tasks are straightforward and the model's default behavior is close to what you need. One-shot adds just enough context to clarify ambiguous instructions. Few-shot provides the pattern recognition power to handle complex, specialized, or format-specific tasks.

The practical approach: start simple, add examples when needed, and always test with real data. In-context learning makes these techniques possible, allowing models to adapt their behavior based on demonstrations without any retraining.

Whether you're classifying sentiment, extracting data, generating content, or building AI-powered applications, understanding when and how to use each prompting approach will make your results more accurate, more consistent, and more useful.

Zero-shot vs One-shot vs Few-shot Prompting Explained

Key takeaways