What is the difference between 'interesting' and 'I would buy it' in research?

In VAD terms, 'interesting' is moderate positive valence with low to moderate arousal — a mild positive response with no urgency. 'I would buy it' is high positive valence with high arousal and high dominance — a strong positive response with a sense of wanting and agency. Traditional concept testing cannot distinguish these two states. Emotional AI captures both in real time from the involuntary facial response at first exposure.

How long does emotional AI concept testing take?

A standard concept validation study — including research design, participant recruitment, sessions, analysis and reporting — typically completes within 10–15 working days from agreement. Where existing participant panels are available, this can reduce to 7–10 days. A proof-of-concept applying EchoDepth to an existing concept dataset or stimulus set can complete within 5 working days of data receipt.

Concept testing & NPD

How to validate a product concept before you launch — without trusting the focus group.

Traditional concept testing produces approval. It cannot produce purchase signal. The two are not the same — and the gap between them is where most new product launches fail. Here is how emotional AI closes that gap, and how to get a genuine answer in days rather than weeks.

Published April 2026 · By Jonathan Prescott, Cavefish Ltd · Part of the EchoDepth Insights series

The context: Approximately 80–90% of new consumer product launches fail within 12 months, despite positive preceding research (Nielsen, Cambridge Innovation Advisors). The majority of these failures were preceded by concept test scores that predicted success. Something is systematically wrong with how we validate ideas before committing to launch.

The problem: concept testing measures liking, not wanting

Standard concept testing — whether run as a focus group, an online survey, or a monadic test — asks participants to evaluate a concept and indicate how much they like it, how interested they are, or how likely they are to purchase. The responses look like purchase signal. They are not.

What concept testing actually measures is a combination of social approval, intellectual interest, and hypothetical purchase intent. All three of these are different from genuine desire. Genuine desire is a physiological state: elevated arousal, strong positive valence, a sense of agency and wanting. You cannot reliably access it by asking someone to predict what they would do.

Research consistently shows that purchase intention scores from concept tests are poor predictors of actual purchase behaviour. The correlation between "top-2-box purchase intent" and in-market conversion is low enough that experienced researchers treat concept scores as directional indicators at best. The reason is not that the researchers are doing something wrong. It is that the methodology cannot measure what it needs to measure.

The "interesting but wouldn't buy" problem

There is a specific failure mode that FMCG, innovation and NPD teams encounter repeatedly. A concept goes through research. It scores well — participants describe it as interesting, different, something they'd try. It passes the internal gate. It launches. It fails.

In VAD terms, "interesting but wouldn't buy" has a recognisable signature. Moderate positive valence: the concept is received positively. Low to moderate arousal: there is no urgency, no pull, no wanting. Low dominance: the participant does not feel a strong sense of agency toward the concept. This emotional pattern — mild positive interest without genuine desire — produces exactly the kind of positive concept test scores that lead to failed launches.

Genuine purchase desire has a different signature. High positive valence. High arousal — the response is immediate, strong, and physiologically activated. High dominance — the participant feels drawn toward the concept rather than passively interested in it. This is what EchoDepth captures from the involuntary facial response in the 200–500ms window after first exposure to a concept.

How to validate a concept with emotional AI

The process is straightforward. It does not require new technology infrastructure, specialist facilities, or an extended research programme. It works alongside your existing concept testing methodology, or as a standalone validation step.

Define the research question

Agree the specific decision this validation needs to support — which concept to progress, which target segment to prioritise, or which of several executions has the strongest emotional pull. The tighter the question, the faster and cheaper the research.

Prepare stimulus materials

Concept boards, packaging designs, product descriptions, advertising storyboards, or rendered product visuals. EchoDepth evaluates any stimulus presented on screen — no physical prototypes needed.

Capture individual emotional response

Each participant views the concepts individually, via a browser link on their own device. No facility, no group, no facilitator. EchoDepth scores 44 facial Action Units per frame in real time — capturing the genuine emotional response at first exposure, before social moderation or verbal rationalisation has occurred. 20–30 participants per segment is typically sufficient for directional ranking.

Receive structured concept ranking

Within five working days: VAD scores per concept, emotional ranking across the set, identification of which concept produced genuine desire vs which produced "interesting," and specific recommendations about which to progress and why — data to information to knowledge the team can act on.

What you get that traditional concept testing cannot provide

Genuine first-exposure signal

The emotional response captured in the 200–500ms after concept exposure — before any verbal processing, social adjustment, or rationalisation has occurred. This is the closest thing to a prediction of genuine market desire available from any research method.

Concept ranking by real desire

Which concept produced high-valence, high-arousal response — genuine wanting — vs which produced moderate positive interest. This distinction is invisible to any survey-based concept test and is the primary commercial differentiator.

Speed

A complete concept validation study — design, recruitment, sessions, analysis, report — in 10–15 working days. A proof-of-concept on existing stimulus materials in 5 working days. No facility booking, no moderator scheduling, no travel.

Actionable output

Not raw data to interpret — a structured report with specific, prioritised recommendations. Which concept to progress, which to refine, which to discontinue. Data to information to knowledge the team can act on before the next gate.

Client perspective

"Using EchoDepth from Cavefish allows us to validate ideas quickly, minimising the risk of launching a product or idea."

Gethin Thomas

CEO, Iterate

Common questions

How do you validate a product concept quickly?

Emotional AI concept validation captures genuine first-exposure emotional response to product concepts in real time — individually, remotely, without focus groups. Participants view concepts via a browser link; EchoDepth scores their facial response per frame. A structured concept ranking report is delivered within five working days. Total study time from agreement to report is typically 10–15 working days.

Why does traditional concept testing fail to predict launch success?

Traditional concept testing produces approval ratings and hypothetical purchase intent, not genuine purchase signal. Peer pressure, social desirability bias, and the fundamental difficulty of predicting one's own behaviour all distort responses. The result: concepts test well but fail at launch because the method measured liking, not wanting. Approximately 80–90% of FMCG new product launches fail despite positive preceding research (Nielsen).

How many participants do I need?

For directional concept ranking across 3–5 concepts within a single target segment, 20–30 participants produces reliable findings. For studies requiring significance across multiple segments or geographies, larger samples are agreed at the research design stage. Because the methodology is individual and remote, recruiting across multiple geographies simultaneously is straightforward.

What is the "interesting but wouldn't buy" gap?

The "interesting but wouldn't buy" gap is the most common failure mode in concept research. It describes concepts that receive positive responses in research — participants describe them as interesting, different, worth trying — but fail at launch because the research cannot distinguish mild positive interest from genuine purchase desire. In VAD terms, "interesting" is moderate valence with low arousal; genuine desire is high valence with high arousal. EchoDepth captures this distinction from the involuntary facial response at first exposure.

Can emotional AI work alongside traditional concept testing?

Yes. EchoDepth runs in parallel with existing concept test methodology — adding the emotional response layer to a qual debrief, a monadic survey, or any other concept evaluation approach. The combination of stated response (what participants say) and felt response (what EchoDepth measures) is significantly more predictive than either alone, and the discrepancies between the two layers are often the most commercially valuable finding.

Validate your next concept before you commit to launch.

Send us the concept stimulus. We will show you which has genuine desire behind it — and which is just interesting — within five working days.

Book a discovery call FMCG research

How to validate a product concept before you launch — without trusting the focus group.

The problem: concept testing measures liking, not wanting

The "interesting but wouldn't buy" problem

How to validate a concept with emotional AI

What you get that traditional concept testing cannot provide

Common questions

FMCG consumer insights software

Why focus groups fail to predict market success

Pricing & engagement models

Validate your next concept before you commit to launch.