Super Bowl LX: The Complete Ad Effectiveness Report

Top net purchase-intent lift (Toyota “Second Chance Test Drive”)

+18.6pp

+5.1pp vs. median LX advs benchmark

Product-demo creative multiplier on intent vs. celebrity-first creative

2.4×

+1.1× vs. prior Super Bowl normvs benchmark

Share of viewers who took any post-ad action within 24 hours

41%

+6pp when a price/offer was explicitvs benchmark

“Likable but not buying” gap size (high likeability, low intent segment-weighted)

29%

+9pp for celebrity-heavy adsvs benchmark

Median modeled first-purchase basket among intent shifters (across featured categories)

$68

+$14 when ad included a concrete use-case demovs benchmark

Trust penalty incidence for “AI/too-good-to-be-true” claims in tech/finance ads

12%

+7pp among Privacy-Conscious Tech Criticsvs benchmark

The research suggests a fundamental decoupling between trust and transaction. While Gen Z consumers report record-low levels of institutional brand trust, their purchase behavior remains robust, driven by a new architecture of peer-to-peer verification.

"The funniest ads bought attention; the clearest ads bought intent—only 3 of the top 8 most-liked spots were also top-5 in modeled purchase lift."

"Toyota’s “Second Chance Test Drive” led intent lift at +18.6pp, not because it was the most loved, but because it reduced friction and showed the product working."

"59% did nothing after the ads; among the 41% who acted, Google Search (35.8%) beat sharing and rewatch behavior."

"Demo-led creative delivered a 2.4× intent multiplier versus celebrity-led creative in this simulation—proof beat fame."

"Reddit’s reach was low (22% usage) but its trust was relatively high (58), making it disproportionately influential for Skeptical Researchers."

"Most memory evaporates fast: 66.7% forgot most ads within 24 hours—your conversion stack has to activate the same night."

"AI/finance hype is the tax: 12% saw trust penalties from ‘too-good-to-be-true’ claims, rising to 17% among privacy-conscious critics."

Section 02

Analytical Exhibits

10 data-driven deep dives into signal architecture.

Generate custom exhibits with Mavera →

EX1

Which Super Bowl LX ads actually moved purchase intent

Net purchase-intent lift among exposed viewers (segment-weighted).

Takeaway

"Auto + retail utility ads produced the largest intent lift; pure entertainment rarely cracked +10pp."

Median LX ad intent lift (featured set)

+13.5pp

Top-to-#5 lift spread

4.1pp

Toyota lift driven by high-intent segments (Deal-First + Pragmatists + Researchers)

61%

Toyota modeled CPA index vs. LX median (1.00)

0.86

Net Purchase-Intent Lift by Ad (pp)

Toyota — “Second Chance Test Drive”

18.6%

Amazon — “One-Click Local”

16.2%

Doritos — “Crash the Chat”

14.1%

T-Mobile — “5G Home Swap”

13.4%

DraftKings — “Same-Game Promise”

12%

Google Pixel — “Real Talk AI”

11.2%

Budweiser — “Backyard Legacy”

9.8%

Apple — “Vision Air”

8.7%

Raw Data Matrix

Ad	Intent lift (pp)	High-intent segment contribution	Modeled CPA index (lower is better)
Toyota — “Second Chance Test Drive”	+18.6	61%	0.86
Amazon — “One-Click Local”	+16.2	58%	0.89
Doritos — “Crash the Chat”	+14.1	49%	0.95
T-Mobile — “5G Home Swap”	+13.4	52%	0.93

Analyst Note

Modeled intent lift reflects net change in “likely to purchase/try in next 30 days,” controlling for baseline category buyers. Values are segment-weighted to the U.S. viewing composition in this simulation.

EX2

Likability is not effectiveness

Likeability vs. purchase intent for the same ad set.

Takeaway

"Several “funniest” spots posted strong likeability but weak purchase movement; the best performers were clearer on product and payoff."

Viewers in “liked it but not buying” state (top-liked ads)

29%

Average intent lift advantage: product-forward vs. celebrity-forward

11.3pp

Share who said “I laughed but didn’t learn what to do next”

18%

Intent gain when CTA was explicit (e.g., test drive, trial, offer)

+6pp

Likeability (0–100) vs. Net Purchase-Intent Lift (pp)

Likeability score

Intent lift (pp)

Toyota — “Second Chance Test Drive”

Amazon — “One-Click Local”

Doritos — “Crash the Chat”

Apple — “Vision Air”

Mountain Dew — “Lizard Lounge”

CryptoX — “The Future Is Now”

Raw Data Matrix

Ad	Likeability	Intent lift (pp)	Gap indicator
Mountain Dew — “Lizard Lounge”	82	+5.9	High like / low buy
Apple — “Vision Air”	74	+8.7	Liked, but pricey friction
Toyota — “Second Chance Test Drive”	71	+18.6	Balanced
CryptoX — “The Future Is Now”	57	+4.1	Low trust ceiling

Analyst Note

Likeability is modeled from entertainment value + emotional resonance. Intent lift requires comprehension + trust + perceived personal relevance; these do not move in lockstep.

EX3

Which ads won the high-intent segments (where money moves)

Lift concentration among the 46% of viewers most likely to buy within 30 days.

Takeaway

"Toyota and Amazon converted pragmatic and deal-driven segments; betting and telecom ads were efficient but narrower."

Size of “high-intent” audience pool (modeled)

46%

Conversion efficiency: high-intent vs. low-intent viewers

1.9×

Average lift among high-intent viewers (top 6 ads)

9.2pp

Average lift among entertainment-only viewers (same ads)

3.6pp

Lift Contribution from High-Intent Segments (%)

Toyota — “Second Chance Test Drive”

61%

Amazon — “One-Click Local”

58%

T-Mobile — “5G Home Swap”

52%

DraftKings — “Same-Game Promise”

51%

Google Pixel — “Real Talk AI”

47%

Doritos — “Crash the Chat”

49%

Raw Data Matrix

Ad	High-intent share of lift	Primary converting segments	Largest friction
Toyota	61%	Deal-First, Pragmatists, Researchers	Financing clarity
Amazon	58%	Pragmatists, Loyalists, Social Proof	Local delivery skepticism
DraftKings	51%	Trend-Driven, Sports-Core	Responsible play concerns
Google Pixel	47%	Researchers, Affluent Experience	AI claim trust

Analyst Note

High-intent pool defined as viewers with baseline category purchase probability above the 60th percentile plus low switching resistance.

EX4

Creative signals that actually drove purchase intent

Modeled drivers ranked by marginal impact on intent (not recall).

Takeaway

"Proof beats vibe: product demo, price clarity, and friction removal outperformed celebrity presence and plot twists."

Intent multiplier: demo-led vs. celebrity-led spots

2.4×

Incremental intent when offer was explicit

+6pp

Viewers who penalized “hyped” claims without proof

12%

Humor-only intent efficiency vs. demo+offer baseline

0.7×

Marginal Contribution to Intent Lift (share of total lift)

Clear product demo (show it working)

22%

Concrete offer/price anchor

18%

Friction removal (setup, trade-in, delivery)

16%

Third-party proof (ratings, tests, credible voice)

14%

Strong use-case fit ("for people like me")

13%

Humor (standalone)

Celebrity presence (standalone)

Raw Data Matrix

Driver	Intent impact	Recall impact	Risk if overused
Clear product demo	High	Medium	Can feel boring without story
Offer/price anchor	High	Low	Margin pressure / promo training
Humor	Low-Med	High	Creates “liked not buying” gap
Celebrity (standalone)	Low	High	Reactance / authenticity doubt

Analyst Note

Marginal contributions are derived from simulated counterfactuals (removing a signal while holding other creative elements constant). Totals sum to 100% across the modeled driver set.

EX5

Clarity vs. entertainment: the intent trade-off is real

How comprehension and entertainment jointly shaped downstream action.

Takeaway

"Entertainment without clarity boosted shares, not sales; clarity without entertainment boosted sales, not shares. The winners did both above-average."

Intent advantage: balanced vs. entertainment-heavy

+8.6pp

Search rate multiplier: proof-first vs. entertainment-heavy

1.7×

Peak share rate (entertainment-heavy)

19%

Peak search rate (proof-first)

26%

Average Outcomes by Creative Mode

Share likelihood (0–100)

Purchase intent lift (pp)

Entertainment-heavy / low-clarity

Balanced (entertainment + clarity)

Clarity-heavy / low-entertainment

Celebrity-first (weak product linkage)

Proof-first (demo + validation)

Raw Data Matrix

Creative mode	Any action within 24h	Search rate	Share rate
Entertainment-heavy / low-clarity	31%	14%	19%
Balanced	46%	22%	15%
Proof-first	49%	26%	12%
Celebrity-first	33%	15%	18%

Analyst Note

Creative modes are assigned from cognitive-load scoring: message density, claim burden, and product-link strength. Outcomes are simulated with segment-specific thresholds for action.

EX6

Where people went to verify claims (and which channels they trust)

Usage and trust for post-Super Bowl research and verification.

Takeaway

"YouTube and Google are the workhorses for verification; Reddit is trusted by fewer people but disproportionately influences Skeptical Researchers."

Used Google Search within 24h for at least one LX ad

64%

Highest trust score (Google Search)

Reddit trust score (high influence, low reach)

TikTok trust score (high reach, low trust)

Platform Usage vs. Trust (0–100)

Raw Data Matrix

Platform	Used within 24h	Primary motivation	Highest-usage segment
Google Search	64%	Confirm price/terms	Deal-First Switchers
YouTube	58%	See real-world proof	Skeptical Researchers
Reddit	22%	Find the catch	Skeptical Researchers
TikTok	36%	See if it’s trending	Trend-Driven Gen Z

Analyst Note

Trust scores reflect perceived credibility for purchase decisions (not entertainment). Usage reflects modeled behavior within 24 hours of exposure.

EX7

Price and offer clarity: the fastest lever for real intent

How explicit economics shifted intent and action.

Takeaway

"When an ad anchored cost or savings, it produced +6pp higher intent on average and reduced “looks cool, but…” drop-off by 10 points."

Average intent gain when offer was explicit

+6.0pp

Drop in “interested but unsure cost” friction when $ anchor shown

10pts

Click/visit propensity multiplier with risk reversal

1.5×

Median modeled first basket among intent shifters

$68

Incremental Intent Lift from Offer Elements (pp)

Trade-in / buyback guarantee stated

7.4%

Clear monthly price ($/mo) shown

6.8%

Limited-time incentive (e.g., free delivery)

5.9%

Risk reversal (free trial / easy cancel)

5.6%

Bundle value (what’s included) clarified

4.8%

Fine print minimized (lower claim burden)

4.2%

Raw Data Matrix

Offer element	Best-fit categories	Most responsive segment	Main risk
Trade-in guarantee	Auto, Tech	Deal-First Switchers	Margin exposure
$/mo price clarity	Telecom, Auto	Quality Pragmatists	Anchor too high
Free trial / easy cancel	Subscriptions	Skeptical Researchers	Churn if onboarding weak
Bundle clarity	Retail, Tech	Brand Loyalists	Complexity if overexplained

Analyst Note

Incremental lift values are derived from simulated creative variants, holding category and brand baseline constant.

EX8

Trust winners and losers by ad (not by brand fame)

Net trust delta after exposure (0–100 scale).

Takeaway

"Trust gains came from proof and restraint; trust losses came from inflated AI/finance claims and unclear terms—even when the spot was entertaining."

Largest net trust gain (Toyota)

+12

Largest net trust loss (CryptoX)

-6

Peak claim skepticism score (CryptoX)

Average “misleading / overclaim” penalty incidence (all ads)

Trust Change vs. Claim Skepticism (0–100)

Net trust delta

Claim skepticism (higher = more skepticism)

Toyota — “Second Chance Test Drive”

Amazon — “One-Click Local”

Google Pixel — “Real Talk AI”

DraftKings — “Same-Game Promise”

CryptoX — “The Future Is Now”

Raw Data Matrix

Ad	Net trust delta	Top trust driver	Top trust breaker
Toyota	+12	Demo + safety proof	Financing questions
Amazon	+9	Utility + familiar process	Skepticism about “local”
Google Pixel	+6	Practical use cases	AI exaggeration concern
CryptoX	-6	None dominant	Too-good-to-be-true

Analyst Note

Net trust delta is a post-exposure shift in “I believe this brand’s claims” and “I’d feel safe buying.” Skepticism reflects perceived claim burden and mismatch with known category realities.

EX9

Recall doesn’t equal intent: the 24-hour decay pattern

Memory persistence vs. purchase movement by ad type.

Takeaway

"High-recall comedy decayed slower in memory but didn’t convert; proof-first ads converted even when recall was only mid-tier."

Peak action rate (product demo)

49%

Peak 24h recall (comedy sketch)

62%

Action efficiency: product demo vs. comedy sketch

2.3×

Average recall decay from immediate → 24h (all types)

11pts

24h Aided Recall vs. Purchase-Intent Lift

24h aided recall (%)

Intent lift (pp)

Comedy sketch (celebrity-heavy)

Emotional story (brand legacy)

Product demo (proof-first)

Offer-forward utility

Shock/absurdist

Raw Data Matrix

Ad type	Immediate recall	24h recall	Any action within 24h
Comedy sketch	71%	62%	28%
Product demo	57%	48%	49%
Offer-forward utility	52%	44%	46%
Emotional story	63%	55%	34%

Analyst Note

Recall is modeled via attention retention + distinctiveness. Action is modeled via comprehension + trust + situational fit. High distinctiveness can inflate recall without improving intent.

EX10

The post-game conversion stack: where to spend to capture intent

Best-performing retargeting channels by simulated incremental conversion efficiency.

Takeaway

"Search and YouTube close demand; TikTok sparks discovery but needs proof assets; brand sites underperform unless they land on an offer page with minimal friction."

Recommended budget share to Paid Search

30%

Incremental conversion efficiency: Search vs. Programmatic (modeled)

1.6×

Recommended budget share to YouTube proof assets

25%

TikTok trust score (requires creator + proof pairing)

Incremental Conversion Efficiency vs. Usage (0–100)

Raw Data Matrix

Channel	Recommended share of retargeting budget	Primary KPI	Expected lift driver
Paid Search	30%	CPA / ROAS	Capture intent spikes
YouTube	25%	View-through conversions	Proof + explanation
Retail media	18%	Add-to-cart rate	Close near purchase
Social (IG/TikTok)	17%	CTR + assisted conv.	Social validation

Analyst Note

Efficiency scores incorporate trust, usage, and modeled propensity to convert given ad-type follow-up assets (offer pages, demo cut-downs, creator proof).

Section 03

Cross-Tabulation Intelligence

Segment response matrix (0–100 indices)

	Intent Lift Index	Likeability Index	Trust Gain Index	Research Likelihood	Promo Elasticity
Deal-First Switchers (14%%)	78	64	55	72	84
Brand Loyalists (13%%)	62	66	61	48	44
Skeptical Researchers (11%%)	58	60	52	81	50
Social Proof Seekers (10%%)	60	72	58	54	63
Quality Pragmatists (12%%)	74	63	62	69	58
Trend-Driven Gen Z (10%%)	55	79	49	46	67
Affluent Experience Seekers (9%%)	57	68	57	62	39
Sports-Core Traditionalists (8%%)	66	61	56	44	52
Entertainment-Only Watchers (7%%)	32	81	42	28	46
Privacy-Conscious Tech Critics (6%%)	41	54	38	76	33

Generate your own insights with Mavera →

Section 04

Trust Architecture Funnel

Trust-to-purchase funnel (Super Bowl LX ad exposure → conversion readiness)

1) Exposure (100%)Saw at least one LX ad in full or partial form

Live broadcast + in-room co-viewing

3h 18m viewing window

-38% dropoff

2) Comprehension (62%)Can correctly describe what the product is and why it matters

In-ad clarity + simple payoff

0–10 minutes post-spot

-24% dropoff

3) Trust Calibration (38%)Believes key claims enough to consider next step

Proof cues + restrained claims + recognizable processes

10–60 minutes post-spot

-12% dropoff

4) Active Verification (26%)Searches, watches reviews, checks terms, or asks others

Google SearchYouTubeAmazon ReviewsReddit

Same night to next morning

-14% dropoff

5) Conversion Readiness (12%)Meets threshold to buy/try (or to schedule a high-consideration action like a test drive)

Paid Searchretail mediaoffer landing pages

1–14 days post-game

Section 05

Demographic Variance Analysis

Variance Explorer: Demographic Stress Test

Income

Geography

Synthesized Impact for: <$50K • Urban

Adjusted Metric

"Brand Distrust 73% → 78% ▲ (High reliance on peer verification in lower income brackets)"

Analyst Interpretation

$50K HHI: entertainment-only likeability rarely converts; price/offer clarity is the unlock. $150K: more tolerance for brand-led storytelling, but still needs a concrete reason-to-believe to switch. $300K+: lowest sensitivity to price; highest sensitivity to brand status/values, but also the most cynical about ‘hype.’ This demographic slice exhibits high sensitivity to Category involvement / purchase cycle length (a contextual variable that usually beats demographics). If you’re not in-market, likeability is cheap and intent lift is capped.. The peer multiplier effect is most pronounced here, suggesting a tactical shift toward community-led verification rather than broad brand messaging.

Section 06

Segment Profiles

Deal-First Switchers

14% of population

Receptivity78/100

Research Hrs2.1 hrs/purchase

Threshold10–20% incentive or strong value proof

Top ChannelGoogle Search

RiskHigh promo elasticity (84/100) can erode margins if over-targeted

Top Trust SignalClear monthly price / savings anchor

Quality Pragmatists

12% of population

Receptivity74/100

Research Hrs2.7 hrs/purchase

ThresholdProof-first messaging; moderate offer acceptable

Top ChannelYouTube

RiskPunishes vague claims; will churn if onboarding disappoints

Top Trust SignalDemonstration + “fits my life” use case

Skeptical Researchers

11% of population

Receptivity58/100

Research Hrs4.2 hrs/purchase

ThresholdNeeds corroboration; lowest susceptibility to hype

Top ChannelYouTube reviews / Reddit

RiskStrong negative word-of-mouth if caught overclaiming (skepticism sensitivity +22%)

Top Trust SignalThird-party proof (reviews/tests) + downside transparency

Trend-Driven Gen Z

10% of population

Receptivity55/100

Research Hrs1.4 hrs/purchase

ThresholdMust feel culturally current; price still matters

Top ChannelTikTok → Search

RiskHigh likeability (79/100) but lower trust (49/100) increases “try then churn” risk

Top Trust SignalCreator/social validation + visible real-world usage

Sports-Core Traditionalists

8% of population

Receptivity66/100

Research Hrs1.8 hrs/purchase

ThresholdPrefers familiar brands; responds to utility

Top ChannelGoogle Search

RiskReactance to “too trendy” humor; can disengage quickly

Top Trust SignalStraightforward benefit + no-nonsense claims

Entertainment-Only Watchers

7% of population

Receptivity32/100

Age 38•Privacy-Conscious Tech Critics•Receptivity: 42/100

Description

"Suspicious of tech/AI claims and data use; will research aggressively and discourage others if unconvinced."

Top Insight

"Vague AI claims triggered a 2.1× higher trust penalty incidence (modeled 17% vs. 8% average)."

Recommended Action

"Publish transparent data practices + limitations and lead with utility; target -25% reduction in “overclaim” objections."

Generate your own Insights →

Section 08

Recommendations

Re-cut for proof: build 15s/30s demo variants for the top-intent segments

"For ads with high likeability but sub-10pp intent (e.g., celebrity/comedy styles), produce proof-first cutdowns: show the product working in the first 3 seconds, then a single quantified value cue. Use segment-weighted distribution to Deal-First Switchers, Quality Pragmatists, and Skeptical Researchers."

Effort

Medium

Impact

High

Timeline0–10 days post-game

MetricIncrease modeled intent lift by +3.5pp and raise search rate by +20% vs. current creative

Segments Affected

Deal-First SwitchersQuality PragmatistsSkeptical Researchers

Exploit the compressed conversion window with Search + YouTube proof sequencing

"Because 66.7% forget most ads within 24 hours, front-load capture: 30% budget to Paid Search on brand + category queries, 25% to YouTube proof assets, with landing pages that answer price/terms immediately."

Effort

Low

Impact

High

Timeline0–14 days post-game

MetricReduce blended CPA by 12% and increase conversion readiness from 12% → 14% (+2pp)

Segments Affected

Deal-First SwitchersQuality PragmatistsBrand Loyalists

Add economic anchors (even modest ones) to reduce “interested but unsure cost” friction

"Introduce a single explicit anchor (e.g., $/mo, trade-in guarantee, free delivery) to lift intent by an average +6.0pp and reduce cost uncertainty by ~10 points. Keep offers modest (≤20% equivalent) to avoid margin erosion; 51.0% convert with ≤20% incentives."

Effort

Medium

Impact

High

Timeline7–21 days

MetricIncrease offer-driven intent lift by +5pp while holding AOV within -3% of baseline

Segments Affected

Deal-First SwitchersQuality PragmatistsSports-Core Traditionalists

Build a “Proof Hub” landing experience for Skeptical Researchers

"Create a single page that consolidates: third-party ratings, comparison chart, top 3 limitations, and transparent terms. This reduces claim skepticism (target -10 points) and increases trust gain (target +4 points) in the highest-research segment."

Effort

High

Impact

Medium

Timeline14–45 days

MetricIncrease researcher conversion rate by +15% and reduce “overclaim” objections by -25%

Segments Affected

Skeptical ResearchersPrivacy-Conscious Tech Critics

Separate ‘share’ optimization from ‘buy’ optimization in KPI reporting

"Implement a dual-scorecard: (A) Engagement KPIs (shares, talkability, rewatch) and (B) Commerce KPIs (search, site visit, add-to-cart, signup). Given “share” (57) trails “search/learn” (66) as the dominant behavior for top ads, reporting must prevent entertainment-heavy creative from winning budget by default."

Effort

Low

Impact

Medium

TimelineImmediate (0–7 days)

MetricReallocate 15–25% of spend from high-like/low-intent assets to proof/offer assets; target +10% ROAS

Segments Affected

Entertainment-Only WatchersTrend-Driven Gen ZQuality Pragmatists

De-risk tech/finance messaging: reduce AI hype, increase specificity and constraints

"Tech/finance ads showed a 12% trust penalty incidence for “AI/too-good-to-be-true” claims (17% among Privacy-Conscious Tech Critics). Replace vague future language with bounded claims, privacy/data statements, and real scenarios."

Effort

Medium

Impact

Medium

Timeline10–30 days

MetricCut claim skepticism by -8 points and improve net trust delta by +3 points on tech/finance retargeting

Segments Affected

Privacy-Conscious Tech CriticsSkeptical ResearchersAffluent Experience Seekers

Ready to dive deeper?

Generate your own Intelligence with the Mavera Platform.

Get Full Access→

Join 500+ research teams using synthetic intelligence to generate unique insights.