Blog / Prompt tips / AI Prompts for Market Research: The Work…

AI Prompts for Market Research: The Workflow I Use to Go From "Vibes" to Evidence

A practical prompt workflow for market research: scoping, sourcing, synthesizing, simulating, and stress-testing insights without fooling yourself.

Ilia Ilinskii
Rephrase · Mar 02, 2026

Prompt tips10 min

On this page

Step 1: Start with a research contract, not a question Step 2: Turn "the market" into a set of evidence buckets Step 3: Use the model to update a prior, not to "generate insights"Step 4: Synthesize with decision-aware outputs (not pretty prose)Step 5: Use synthetic personas carefully (good for exploration, bad for proof)Practical example: generating survey questions that don't poison your data Closing thought: prompts don't replace research, they replace blank pages References

Market research used to mean one of two painful modes: either you paid a firm for a glossy deck, or you did the work yourself and drowned in tabs, transcripts, and half-finished spreadsheets.

Now you can ask an LLM for "market research" and get something that sounds like a deck in 30 seconds.

That's the trap.

LLMs are great at producing fluent narratives, but market research is not a narrative task. It's an evidence task. If you don't design prompts that force the model to separate what it knows from what it's making up, you'll get confident fiction. And you'll make decisions on it.

So the way I think about prompting for market research is: don't prompt for "answers." Prompt for a repeatable research system. One that ingests context, produces hypotheses, tags uncertainty, and keeps you honest.

Below is the workflow I use, and the prompt patterns that make it work.

Step 1: Start with a research contract, not a question

The fastest way to waste tokens is to start with "research X market." You'll get generic segmentation and a Porter's Five Forces impression.

Instead, I write a "research contract" prompt that pins down scope, decision, and what counts as evidence. This is basically the same idea as treating a forecast as an update problem: start from a prior, then revise based on evidence, instead of "predicting from scratch." That framing matters a lot when you want calibrated outputs rather than vibes [1].

Here's the prompt I actually use:

You are a market research lead supporting a product decision.

Decision to support:
- Decision: [e.g., "Should we launch an AI meeting notes product for small legal firms?"]
- Time horizon: [e.g., 6 months]
- Geography: [e.g., US + Canada]
- Buyer: [role, budget owner]
- Alternative choices: [do nothing / build feature A / target segment B]

Research outputs I need:
1) A list of hypotheses ranked by decision impact.
2) For each hypothesis: what evidence would confirm vs. disconfirm it.
3) A sourcing plan: what public sources to check first, and what to ask in primary research.
4) A "risk of hallucination" note: where you are most likely to guess.

Rules:
- If you are unsure, say "unknown" and propose how to verify.
- Don't invent statistics. If you use a number, label it as an assumption.
- Use short, skimmable sections.

What I noticed: once you force the model into "decision support," it stops trying to be a Wikipedia page and starts acting like a research operator.

Step 2: Turn "the market" into a set of evidence buckets

Market research is usually a mix of four evidence types: what people say, what they do, what competitors sell, and what the environment makes possible (pricing, regulation, distribution).

I prompt the model to build buckets, then fill each bucket with questions, not conclusions. This keeps your research open-ended longer, which is good. Premature certainty is the enemy.

Given the decision above, create an evidence map with 4 buckets:
A) Customer pain & willingness-to-pay
B) Competitive landscape & substitutes
C) Distribution & acquisition constraints
D) Macro constraints (compliance, procurement, security, switching costs)

For each bucket:
- list 5-8 research questions
- identify which questions can be answered with public data vs. require interviews/surveys
- list "likely pitfalls" (ways an LLM might overgeneralize)

This is also where you start defending against "synthetic respondents" misuse. There's a growing body of work warning that LLMs can be tempting as survey replacements, but that treating simulated responses as interchangeable with human data is not safe for confirmatory claims [3]. In practice: use the model to shape what to ask, not to pretend you've already asked it.

Step 3: Use the model to update a prior, not to "generate insights"

If you're doing market sizing, competitive comparisons, or trend calls, you should think like a forecaster: you have a prior belief, and new evidence should update it. That's not philosophy; it's an operational prompt design that improves calibration.

The mention-markets paper is a nice concrete demonstration: when the prompt explicitly tells the model to treat a baseline probability as a prior and revise it using provided text, calibration improves versus naive prompting [1]. Different domain, same principle.

So I use a "prior → evidence → posterior" template for claims like "Segment X will pay $Y" or "Competitor Z is most at risk."

We are evaluating this hypothesis:

H: "[state hypothesis]"

My current prior belief:
- Prior probability (0-100): [e.g., 55]
- Why: [2-3 bullets, can be rough]

Evidence provided (delimited):
"""
[paste notes, quotes, competitor pages, pricing screenshots, links, etc.]
"""

Task:
1) Extract only the evidence relevant to H (quote it back).
2) List arguments that increase confidence and arguments that decrease confidence.
3) Provide an updated probability (0-100) and explain the update.
4) Flag what additional evidence would move the number the most.

Rules:
- Don't invent new facts.
- If evidence is weak, say so and keep the posterior close to the prior.

That "keep the posterior close to the prior when evidence is weak" line is doing real work. It nudges the model away from overreacting to a single spicy anecdote.

Step 4: Synthesize with decision-aware outputs (not pretty prose)

Most "AI market research" outputs fail because they optimize for readability, not usefulness.

One research thread I like here comes from operations/decision-making: if you use LLMs to generate distributions (e.g., willingness-to-pay samples) the right evaluation is decision quality, not whether the distribution "looks similar" under a distance metric [2]. Translation: your synthesis should be shaped by what decision you'll take next.

So I ask for synthesis in a format that can be acted on: decision options, expected upside, key assumptions, and the minimum viable validation plan.

Create a decision memo draft with these sections:

1) Recommendation: choose one of [Option A/B/C] and state confidence (low/med/high)
2) Decision drivers: the 3 variables that matter most
3) What we know (grounded): only statements supported by the evidence pasted
4) What we assume: list assumptions with expected impact if wrong
5) Fast validation plan: 5 cheapest tests (landing page, cold outreach, pricing interview, etc.)
6) Kill criteria: what results would make us stop

Rules:
- Separate "grounded" vs "assumed" explicitly.
- No generic market fluff.

You're basically building a small "research-to-action" pipeline. If the output can't tell you what to do Monday morning, it's not market research.

Step 5: Use synthetic personas carefully (good for exploration, bad for proof)

People love prompting: "Pretend you're a CFO at a 200-person company. Would you buy this?"

It can be useful. But treat it like improv, not measurement.

The validation paper makes the key point I wish more founders internalized: heuristic prompt-tweaks can make LLM simulations look human-like, but that doesn't give you the statistical guarantees you'd want for confirmatory research [3]. In other words: personas are great for generating objections you forgot, not for estimating demand.

Here's a safer persona prompt:

Act as 6 distinct buyer personas for [product]. Your job is NOT to answer whether you'd buy.
Your job is to generate objections and buying criteria I should test in real interviews.

For each persona:
- context (role, company size, current tool stack)
- top 3 "must-have" requirements
- top 3 objections / reasons to ignore this product
- what proof would change your mind (evidence, security review, ROI calc, reference, etc.)

Rules:
- Avoid invented statistics or "market facts."
- Keep each persona realistic and internally consistent.

Now you're using the model as an adversarial brainstorming partner, not a fake survey panel.

Practical example: generating survey questions that don't poison your data

One of the most concrete "prompt as market research tool" moves is using the LLM to draft survey/interview instruments.

A community example I've seen shared is a "Market Research Question Generator" prompt that forces the model to produce a mix of demographic, behavioral, intent, open-ended, and Likert questions [4]. That structure is useful, because it stops you from writing five versions of the same leading question.

I'd adapt it like this:

You are a survey methodologist helping me design unbiased questions.

Research goal:
- [e.g., "Understand why trial users of our API churn in the first 7 days"]

Audience:
- [e.g., "backend engineers at startups, 5-200 employees"]

Generate:
- 2 screening questions
- 8 core questions split into: behavioral (past), situational (recent), alternatives/substitutes, willingness-to-pay, and open-ended
- 2 attention/quality checks

Rules:
- Avoid leading phrasing and double-barreled questions.
- Each question must state what decision it informs.
- Use simple language.

Then I run a second prompt that red-teams the survey for bias and ambiguity.

Closing thought: prompts don't replace research, they replace blank pages

If you take one thing from this: use prompts to create structure, not certainty.

The best prompting workflows for market research behave like a loop. You define a decision. You state a prior. You gather evidence. You update. You design the next test. Repeat.

If your prompts don't force the model to label assumptions, quote evidence, and admit uncertainty, you're not doing "AI market research." You're doing AI storytelling.

References

Documentation & Research

Forecasting Future Language: Context Design for Mention Markets - arXiv cs.CL
https://arxiv.org/abs/2602.21229
Evaluating LLM-persona Generated Distributions for Decision-making - arXiv cs.LG
https://arxiv.org/abs/2602.06357
This human study did not involve human subjects: Validating LLM simulations as behavioral evidence - arXiv cs.AI
https://arxiv.org/abs/2602.15785

Community Examples

The "Market Research Question Generator" prompt: Instantly creates 5 structured questions for any survey - r/PromptEngineering
https://www.reddit.com/r/PromptEngineering/comments/1qpp77m/the_market_research_question_generator_prompt/

Blog / Prompt tips / AI Prompts for Market Research: The Work…

← All notes

AI Prompts for Market Research: The Workflow I Use to Go From "Vibes" to Evidence

A practical prompt workflow for market research: scoping, sourcing, synthesizing, simulating, and stress-testing insights without fooling yourself.

Ilia Ilinskii
Rephrase · Mar 02, 2026

Prompt tips10 min

On this page

Market research used to mean one of two painful modes: either you paid a firm for a glossy deck, or you did the work yourself and drowned in tabs, transcripts, and half-finished spreadsheets.

Now you can ask an LLM for "market research" and get something that sounds like a deck in 30 seconds.

That's the trap.

Below is the workflow I use, and the prompt patterns that make it work.

Step 1: Start with a research contract, not a question

The fastest way to waste tokens is to start with "research X market." You'll get generic segmentation and a Porter's Five Forces impression.

Here's the prompt I actually use:

You are a market research lead supporting a product decision.

Decision to support:
- Decision: [e.g., "Should we launch an AI meeting notes product for small legal firms?"]
- Time horizon: [e.g., 6 months]
- Geography: [e.g., US + Canada]
- Buyer: [role, budget owner]
- Alternative choices: [do nothing / build feature A / target segment B]

Research outputs I need:
1) A list of hypotheses ranked by decision impact.
2) For each hypothesis: what evidence would confirm vs. disconfirm it.
3) A sourcing plan: what public sources to check first, and what to ask in primary research.
4) A "risk of hallucination" note: where you are most likely to guess.

Rules:
- If you are unsure, say "unknown" and propose how to verify.
- Don't invent statistics. If you use a number, label it as an assumption.
- Use short, skimmable sections.

What I noticed: once you force the model into "decision support," it stops trying to be a Wikipedia page and starts acting like a research operator.

Step 2: Turn "the market" into a set of evidence buckets

Market research is usually a mix of four evidence types: what people say, what they do, what competitors sell, and what the environment makes possible (pricing, regulation, distribution).

I prompt the model to build buckets, then fill each bucket with questions, not conclusions. This keeps your research open-ended longer, which is good. Premature certainty is the enemy.

Given the decision above, create an evidence map with 4 buckets:
A) Customer pain & willingness-to-pay
B) Competitive landscape & substitutes
C) Distribution & acquisition constraints
D) Macro constraints (compliance, procurement, security, switching costs)

For each bucket:
- list 5-8 research questions
- identify which questions can be answered with public data vs. require interviews/surveys
- list "likely pitfalls" (ways an LLM might overgeneralize)

Step 3: Use the model to update a prior, not to "generate insights"

So I use a "prior → evidence → posterior" template for claims like "Segment X will pay $Y" or "Competitor Z is most at risk."

We are evaluating this hypothesis:

H: "[state hypothesis]"

My current prior belief:
- Prior probability (0-100): [e.g., 55]
- Why: [2-3 bullets, can be rough]

Evidence provided (delimited):
"""
[paste notes, quotes, competitor pages, pricing screenshots, links, etc.]
"""

Task:
1) Extract only the evidence relevant to H (quote it back).
2) List arguments that increase confidence and arguments that decrease confidence.
3) Provide an updated probability (0-100) and explain the update.
4) Flag what additional evidence would move the number the most.

Rules:
- Don't invent new facts.
- If evidence is weak, say so and keep the posterior close to the prior.

That "keep the posterior close to the prior when evidence is weak" line is doing real work. It nudges the model away from overreacting to a single spicy anecdote.

Step 4: Synthesize with decision-aware outputs (not pretty prose)

Most "AI market research" outputs fail because they optimize for readability, not usefulness.

So I ask for synthesis in a format that can be acted on: decision options, expected upside, key assumptions, and the minimum viable validation plan.

Create a decision memo draft with these sections:

1) Recommendation: choose one of [Option A/B/C] and state confidence (low/med/high)
2) Decision drivers: the 3 variables that matter most
3) What we know (grounded): only statements supported by the evidence pasted
4) What we assume: list assumptions with expected impact if wrong
5) Fast validation plan: 5 cheapest tests (landing page, cold outreach, pricing interview, etc.)
6) Kill criteria: what results would make us stop

Rules:
- Separate "grounded" vs "assumed" explicitly.
- No generic market fluff.

You're basically building a small "research-to-action" pipeline. If the output can't tell you what to do Monday morning, it's not market research.

Step 5: Use synthetic personas carefully (good for exploration, bad for proof)

People love prompting: "Pretend you're a CFO at a 200-person company. Would you buy this?"

It can be useful. But treat it like improv, not measurement.

Here's a safer persona prompt:

Act as 6 distinct buyer personas for [product]. Your job is NOT to answer whether you'd buy.
Your job is to generate objections and buying criteria I should test in real interviews.

For each persona:
- context (role, company size, current tool stack)
- top 3 "must-have" requirements
- top 3 objections / reasons to ignore this product
- what proof would change your mind (evidence, security review, ROI calc, reference, etc.)

Rules:
- Avoid invented statistics or "market facts."
- Keep each persona realistic and internally consistent.

Now you're using the model as an adversarial brainstorming partner, not a fake survey panel.

Practical example: generating survey questions that don't poison your data

One of the most concrete "prompt as market research tool" moves is using the LLM to draft survey/interview instruments.

I'd adapt it like this:

You are a survey methodologist helping me design unbiased questions.

Research goal:
- [e.g., "Understand why trial users of our API churn in the first 7 days"]

Audience:
- [e.g., "backend engineers at startups, 5-200 employees"]

Generate:
- 2 screening questions
- 8 core questions split into: behavioral (past), situational (recent), alternatives/substitutes, willingness-to-pay, and open-ended
- 2 attention/quality checks

Rules:
- Avoid leading phrasing and double-barreled questions.
- Each question must state what decision it informs.
- Use simple language.

Then I run a second prompt that red-teams the survey for bias and ambiguity.

Closing thought: prompts don't replace research, they replace blank pages

If you take one thing from this: use prompts to create structure, not certainty.

The best prompting workflows for market research behave like a loop. You define a decision. You state a prior. You gather evidence. You update. You design the next test. Repeat.

If your prompts don't force the model to label assumptions, quote evidence, and admit uncertainty, you're not doing "AI market research." You're doing AI storytelling.

References

Documentation & Research

Forecasting Future Language: Context Design for Mention Markets - arXiv cs.CL
https://arxiv.org/abs/2602.21229
Evaluating LLM-persona Generated Distributions for Decision-making - arXiv cs.LG
https://arxiv.org/abs/2602.06357
This human study did not involve human subjects: Validating LLM simulations as behavioral evidence - arXiv cs.AI
https://arxiv.org/abs/2602.15785

Community Examples

The "Market Research Question Generator" prompt: Instantly creates 5 structured questions for any survey - r/PromptEngineering
https://www.reddit.com/r/PromptEngineering/comments/1qpp77m/the_market_research_question_generator_prompt/