Lumio Arc — Q2 2026 awareness (UK · Germany · UAE)

Lumio Arc premium wellness wearable: six-week brand awareness flight ahead of Q3 retail. UK is Meta-only; Germany and UAE split Meta + TikTok. Monitoring ties internal briefs to reach, impressions, frequency discipline, CPM vs benchmarks, Meta EARL, TikTok VCR, and GA4 branded search — with UAE cross-platform overlap called out as a known data gap.

Scenario variables

Campaign

Brand: Lumio
Product: Lumio Arc

Run metadata

Flight code: LUM-Q2-2026-AWARENESS-UK-DE-AE
Flight: 28 Apr — 8 Jun 2026
Mid-campaign review: 19 May 2026 (week 3)
KPI tables as of: 19 May 2026 (weeks 1–3 actuals)

Economics

Total budget: £850,000

Policy

Brand safety: ASA / Werberat / NMC + UAE modesty

Inputs — documents

Lumio — Campaign brief

Necessity

Q2 2026 brand awareness launch · UK, Germany, UAE.

Brand overview: Brand: Lumio. Product: Lumio Arc — premium wellness wearable (stress, sleep, recovery via biometrics + AI coaching). Positioning: “Feel better, perform better.” Premium lifestyle tech, professionals 25–40. Tone: calm, confident, aspirational — never clinical or aggressive.
Campaign objective: Drive brand awareness and consideration for Lumio Arc in three priority markets ahead of Q3 2026 retail launch.
Primary KPIs: Unique reach: 5M per market over 6-week flight. Impressions: 20M per market over 6-week flight. Frequency: 3–5 per person over full campaign flight (not per week). Estimated ad recall lift (Meta): minimum +8% lift.
Secondary KPIs: Video completion rate (TikTok): minimum 50%. Branded search lift: +15% by end of week 4. CPM: stay within ±20% of market benchmarks.
Markets & budget %: UK — Tier 1 — 40% — highest brand-affinity potential, competitive. Germany — Tier 1 — 35% — privacy-conscious; TikTok growing fast. UAE — Tier 2 — 25% — affluent early adopters; cross-platform overlap risk.
Platforms: Meta (Instagram + Facebook): UK, Germany, UAE — primary reach & frequency. TikTok: Germany, UAE only — discovery & cultural relevance (UK not on TikTok this flight).
Flight dates: Start: 28 Apr 2026. End: 8 Jun 2026. Mid-campaign review: week 3 — 19 May 2026. Final review: week 6 — 8 Jun 2026.
Budget (GBP): Total: £850,000. UK £340,000 · Germany £297,500 · UAE £212,500.
Target audience: Primary: professionals 25–40, health-conscious, tech-savvy, HH income £50k+. Secondary: fitness enthusiasts & biohackers 28–45. Signals: wellness / fitness / productivity / mindfulness interests; health-tech & lifestyle brands; early adopters (latest phone, wellness apps).
Success definition: By end of week 6, Lumio Arc recognisable to ≥1 in 4 of target audience across all three markets; unaided brand recall up measurably in post-campaign brand lift study.

Lumio — Media plan

Necessity

Weekly budgets · platform allocation · caps · rotation · dayparting · escalation.

Weekly budget (£): W1: UK 45,000 · DE 39,375 · AE 28,125 — launch broad reach W2: UK 55,000 · DE 48,125 · AE 34,375 — scale top ad sets W3: UK 60,000 · DE 52,500 · AE 37,500 — mid-campaign review W4: UK 65,000 · DE 56,875 · AE 40,625 — peak awareness W5: UK 65,000 · DE 56,875 · AE 40,625 — maintain W6: UK 50,000 · DE 43,750 · AE 31,250 — wind down, retarget prep Totals: UK 340,000 · DE 297,500 · AE 212,500
UK — Meta only: Broad awareness 25–40 wellness UK — Video 15s — 50% — freq cap 3 / 7d. Lookalike 1% site visitors — Video 15s — 30% — 3 / 7d. Interest retargeting engaged content — Static/Carousel — 20% — 2 / 7d.
Germany — Meta + TikTok: Meta broad awareness DE — Video 15s — 40% — 2 / 7d. Meta LAL 1% DE — Video 15s — 20% — 2 / 7d. TikTok in-feed 25–35 wellness — Video 9–15s — 30% — platform-managed freq. TikTok TopView reserved — Video 15s — 10% — 1/user/day.
UAE — Meta + TikTok: Meta broad awareness AE — Video 15s — 45% — 3 / 7d. Meta affluent interest — Carousel — 15% — 2 / 7d. TikTok in-feed discovery — Video 9–15s — 40% — platform-managed freq.
Frequency policy: Meta: max 3 impressions / person / 7d across all ad sets combined. TikTok: platform-managed delivery — monitor VCR as fatigue proxy. If frequency > 4.0 any market any 7d window → flag for immediate review. If CPM +20% WoW → flag for budget reallocation review.
Creative rotation: W1–2: Hero video v1 “Morning routine” — all markets. W3: v1 + Static v1 — all. W4: Hero v2 “Performance” — all if v1 fatigue. W5–6: v2 + UGC creator — UK & UAE only.
Dayparting: UK: 7–9am, 12–1pm, 7–10pm. Germany: 6–8am, 12–2pm, 6–9pm (evening targeting limits apply). UAE: 8–10am, 1–3pm, 8–11pm (Ramadan timing rules if applicable).
Escalation (sign-off before exec): Budget reallocation >15% between markets. Creative pause mid-flight. Frequency cap removal. Audience expansion beyond agreed parameters.

Lumio — Brand guidelines

Necessity

Paid media application · identity · messaging · market rules · creative approvals.

Identity: Tagline: “Feel better, perform better.” Personality: calm, intelligent, premium, human — never cold, clinical, or aggressive. Colours: Deep Sage #4A6741 (primary), Warm Cream #F5F0E8 (backgrounds), Soft Gold #C9A96E (accents). Never: bright reds, neons, heavy blacks. Type: clean sans headlines (Neue Haas Grotesk class); body Inter-class. No decorative/script in paid.
Messaging framework: Hero: Lumio Arc helps you understand your body so you can perform at your best. Supporting (priority): (1) Stress tracking that makes sense of your day. (2) Sleep intelligence for how you really live. (3) Recovery insights for when to push vs rest. Tone: trusted health-knowledgeable friend, not a doctor; confident not arrogant; aspirational but grounded; use “you” not “our users”.
Approved claims (examples): “Understand your stress before it overwhelms you.” · “Sleep better, recover faster, perform at your best.” · “Designed for how real people actually live.” · “Your body has answers. Lumio helps you hear them.” Science/biometrics OK if not overly clinical.
Prohibited: No medical claims (treats / diagnoses / prevents). No before/after implying dramatic transformation. No extreme fitness imagery. No pricing in awareness creatives. No competitor comparisons. No distress/pain imagery. No weight-loss claims. Device never on someone who appears unwell.
Format rules (summary): Video 15s awareness: logo first 3s; tagline in final 3s; subtitles all markets; calm ambient music cleared; ratios 9:16 Stories/TikTok, 1:1 feed, 16:9 desktop where used. Static: logo corner; max 20% text overlay; product visible; brand photography only. Carousel: consistent treatment; each frame standalone; max 5 frames awareness.
UK market rules: Standard guidelines. Influencer: #Ad. No health claims beyond general wellbeing.
Germany market rules: Extra data/privacy sensitivity — avoid “data collection” framing in copy. Prefer functional, honest vs hyper-aspirational lifestyle. All copy in German in DE assets. Avoid luxury-showoff settings.
UAE market rules: Modest imagery — no exposed skin beyond arms. No alcohol references. Arabic subtitles required on TikTok. Fri–Sat weekend differs from UK/DE.
Creative change approvals: Minor copy edit → Performance Lead, same day. New creative version → Performance + Brand, 48h. Format change → Performance Lead, same day. New visual treatment → full brand review, 5 business days. Market-specific adaptation → Performance + Local Market Lead, 48h.

Lumio — Creative asset log

Necessity

Q2 2026 live assets · localised DE · week 3 performance notes · deadlines.

Asset register (live): LUM-V01 Morning Routine 15s video 9:16 — UK, DE, UAE — 28 Apr — Meta Stories, TikTok — Active LUM-V02 same 1:1 — UK, DE, UAE — 28 Apr — Meta Feed — Active LUM-V03 16:9 — UK only — 28 Apr — Meta Feed desktop — Active LUM-S01 Product Hero static 1:1 — UK, UAE — 5 May — Meta Feed — Active LUM-S02 Lifestyle static 9:16 — UK — 5 May — Meta Stories — Active LUM-C01 3-frame carousel 1:1 — UK — 12 May — Meta Feed — Active
Scheduled / pending: LUM-V04 / LUM-V05 Performance v2 15s — scheduled 19 May — Meta + TikTok (formats per sheet). LUM-UGC01 UK creator 9:16 — scheduled 26 May — TikTok + Meta. LUM-UGC02 UAE creator 9:16 — scheduled 26 May — TikTok.
Germany localised: LUM-DE-V01 / LUM-DE-V02 Morning Routine DE — video 9:16 & 1:1 — German subtitles & copy — Active. LUM-DE-S01 Product Hero static DE — 1:1 functional copy — Active from 5 May.
Performance notes (19 May, W3): LUM-V01 UK Meta: VCR 67%, freq (7d) 3.8, CPM £9.20 — CPM +31% vs W1; recall lift still climbing. LUM-V01 DE TikTok: VCR 41%, CPM €5.80 — VCR fell from 62% in 4 days; algo signal weakening. LUM-DE-V01 DE Meta: VCR 71%, freq 2.1, CPM €6.40 — within benchmarks. LUM-V01 UAE Meta: VCR 69%, freq 3.2, CPM AED 28.50 — strong; TikTok overlap suspected. LUM-V01 UAE TikTok: VCR 58%, CPM AED 19.80 — good completion; cross-platform overlap unconfirmed.
Refresh decision log: 19 May 2026 — All markets — Under review — Mid-campaign checkpoint — Pending approval.
Upcoming deadlines: Performance v2 final — all — due 17 May — Creative agency. DE localisation v2 — due 18 May — Local agency. UAE Arabic subs v2 — due 18 May — Local agency. UGC briefing UK+UAE — due 20 May — Partnerships.

Lumio — Market context

Necessity

UK · Germany · UAE profiles, competition, benchmarks, regs, key dates, cross-market ops.

United Kingdom: Audience: strong wellness-tech adoption; IG/FB primary; TikTok growing but not primary for 30+; high category saturation — creative quality differentiates. Competition: Oura Ring significant Meta UK awareness since Mar 2026; Whoop +estimated 40% UK spend Q2; Apple Watch Series 10 health push — high SOV. Meta CPM benchmark £6.50–7.50; campaign tracking £9.20 W3 (above benchmark) — likely competitor auction pressure vs saturation. Stories > Feed for recall. Freq sweet spot ~3–4 exposures/week for wellness. ASA health claims + influencer #Ad. Key dates: May Bank Holiday 26 May 2026 (higher CPM); Euro 2026 qualifiers distraction risk in June.
Germany: Audience: privacy-first / GDPR-aware; sceptical of hyper-aspirational ads — functional honest messaging wins; TikTok fast in 25–35 but behind Meta reach; German-language mandatory. Competition: Garmin brand campaign Meta DACH; Fitbit/Google lighter than expected — opportunity; Polar strong organic. Meta CPM bench €5.50–6.50; TikTok €4.00–5.50. TikTok wellness VCR bench 55–65%; current TikTok VCR 41% — possible deprioritisation; in-feed weakening — TopView / Spark-class tests noted. Avoid biometric “collection” copy; Werberat + strict cookie consent. Pentecost 19 May 2026; May public holiday cluster — CPM volatility.
UAE: Audience: affluent early adopters; bilingual EN/AR performs best; TikTok very popular including 30+; IG strong for premium lifestyle; very high cross-daily Meta+TikTok usage — overlap risk. Competition: Samsung Galaxy Watch heavy UAE spend; local wellness brands minimal — luxury wearable positioning expected. Meta CPM bench AED 22–28; TikTok AED 16–22. Meta–TikTok audience overlap estimated 60–70% for target demo — incremental reach risk. No third-party dedup tool connected — overlap unconfirmed. NMC regs; no political/religious content; modest imagery; Ramadan ended Mar 2026 (out of flight). Arabic subs TikTok GCC best practice. Eid Al Adha Jun 2026 — higher CPMs; campaign ends before late-June summer slowdown.
Cross-market ops: If any market >25% below reach target by W3 → budget reallocation review. Moves between markets need Performance Lead; max 15% reallocation without extra sign-off. Reporting: weekly Mon 9am; mid review 19 May 2026; final report 15 Jun 2026. Data gaps: UAE cross-platform overlap (proxy only); DE TikTok freq not comparable to Meta; UK competitor CPM impact estimated from auction insights only.

Past campaign benchmarks

Nice to have

Use live market-context CPM / VCR bands as this flight’s auction baseline.

Bench bands (this plan): UK Meta £6.50–7.50 · Germany Meta €5.50–6.50 · Germany TikTok €4.00–5.50 (VCR cat. 55–65%) · UAE Meta AED 22–28 · UAE TikTok AED 16–22.
How monitoring uses them: Flag when live CPM exits ±20% vs band without documented calendar cause; pair with freq + reach to decide auction vs creative vs overlap.

Brand safety & placement

Nice to have

Aligns to guidelines + UAE modesty + DE privacy tone + political/religious avoidance AE.

Global: Block politics, religious controversy, adult adjacency, unapproved UGC, negative-news adjacency where list available.
Per market: UAE: modesty + no alcohol + NMC sensitivity + Arabic subs on TikTok. Germany: no biometric-collection framing; cookie-heavy environments — conservative messaging. UK: ASA + #Ad on influencer.

Approval & autonomy

Nice to have

Media-plan escalations + creative approval matrix (condensed).

Always needs sign-off first: >15% budget move between markets · mid-flight creative pause · removing frequency caps · audience expansion past agreed parameters · new visual treatment (full brand) · new creative version (Brand + Performance).
Typically autonomous: Minor copy edit (Performance Lead); format change (Performance Lead); bid/daypart tweaks inside plan; rotate among approved asset IDs; pause line items breaching caps pending review.

Platform integrations

Meta Ads Manager

Must have

connectedFrequency, CPM, reach, EARL, impressions by ad set and market

TikTok Ads Manager

Must have

connectedVideo completion rate, reach, CPM, impressions by market (DE, AE)

GA4

Must have

previewBranded search volume by market — +15% by W4 target — connect property

Third-party overlap tool

Good to have

missingUAE 60–70% overlap estimated; not connected — incremental reach risk

Organic social listening

Good to have

missingSentiment by market — optional

Input pipeline (stages 1–5)

Stage 1 — Data intake

Layer 3 — pipeline reality

Show

Time window applied: 2026-04-28T00:00:00Z → 2026-05-19T23:59:59Z (calendar weeks 1–3; aligned to mid-campaign review).
Decision contract vs default window: No override — run used default monitoring window from the active decision contract.
Feeds: Attempted 9 · succeeded 7 · skipped 1 · auth_error 1

Feed IDs

composio_meta_ads_insightsok
composio_tiktok_reportingok
ga4_branded_search_propertyok
ga4_admin_api_propertyauth error
integration_tools_catalogok
tenant_doc_campaign_briefok
tenant_doc_media_planok
tenant_doc_creative_logok
third_party_overlap_uaeskipped

Composio sweep · connected apps considered

Meta Marketing API
TikTok for Business
Google Analytics 4
Google Sheets (read)
Composio connected-account index

Explain

This stage is “what raw measurements we could pull,” not the diagnosis yet. Layer 3 (pipeline reality): we tried to hydrate these sources for this tenant in this window.

Stage 2 — Data normalization

Layer 2 — stable metric identity

Show

Canonical KPI signals produced: 47
Custom metric defs applied (tenant overlay): 0

Definitions used (preview)

Metric ID	Label	Unit
reach_unique_7d	Unique reach (7d rollup)	people
impressions_7d	Impressions delivered	count
frequency_7d	Frequency (7-day rolling)	avg imps / user
cpm_local	CPM	GBP \| EUR \| AED / 1k imps
earl_meta	Estimated ad recall lift	% (Meta only)
vcr_tiktok	Video completion rate	% (TikTok)
branded_search_sessions	Branded search (sessions)	count / week

Explain

Layer 2 for commerce (stable metric identity). This card answers: “Are we comparing apples to apples this run?”

Stage 3 — Context mapping

Domain profile / ontology

Show

KPI classes activated: Reach · Delivery · Efficiency · Engagement · BrandSearchProxy · CrossPlatformOverlapRisk
Connected integration coverage (this tenant): Meta Ads (UK, DE, AE) · TikTok Ads (DE, AE) · GA4 property (see Stage 1 feeds for admin path) · Doc corpus (brief, plan, creative log, market context).

Explain

Which business vocabulary we mapped these metrics into for downstream detection. Coverage summary reflects connected integration reach for those classes.

Stage 4 — Business context

Non-sensitive overlays

Show

Merged context sources: Campaign flight boundaries (brief) · weekly budget pacing (media plan) · UK/DE/AE public holiday proximity · category CPM benchmark bands (market context doc).

Context merged

UK: May Bank Holiday 26 May 2026 within lookahead window.
Germany: Pentecost 19 May 2026 (review date) — leisure scroll / CPM volatility noted.
UAE: Eid Al Adha June 2026 flagged in long-range calendar; flight ends 8 Jun.

Explain

What contextual lenses could explain movement besides “something broke.” Kept non-sensitive and short.

Stage 5 — Pattern detection

Thresholded deltas vs baseline

Show

Baseline reference: Per-market platform benchmarks + prior-week same-metric rollups (W2 → W3).

Anomalies

UK · Meta · CPM — up — +22% WoW (£7.55 → £9.20); +31% vs week 1 (£7.02)
UK · Meta · Unique reach (W3) — down — 310K vs 820K prior week (plateau signal)
UK · Meta · Frequency (7d) — up — 3.8 vs soft threshold 3.0 / 7d
Germany · TikTok · Video completion rate — down — −21 pp in 4 days (62% → 41%)
Germany · TikTok · Estimated frequency — up — 12.0 vs ~4.4–4.6 prior weeks
Germany · TikTok · Engagement rate — down — 3.2% → 1.9% WoW

Explain

What changed enough to be worth investigating under this run’s thresholds — KPI + direction + magnitude vs baseline, not a wall of raw numbers.

KPIs to monitor

Primary

Unique reach per market per platform — 5M / market / flight
Impressions delivered vs target — 20M / market / flight
Frequency per market per platform — 3–5 / person / campaign
CPM trend week on week per market — within ±20% vs bench
Estimated ad recall lift (Meta only) — min +8% lift
Video completion rate (TikTok) — min 50%

Secondary

Audience overlap signal between platforms in same market
Branded search volume lift per market (GA4) — +15% by W4
CPM relative to market auction benchmarks

Campaign KPI data (tables)

Targets (from campaign brief)

KPI	Target per Market	Per Week Implied
Unique Reach	5,000,000	~833,000
Impressions	20,000,000	~3,333,000
Frequency (campaign total)	3.0–5.0	~0.5–0.8 per week
EARL (Meta)	8% lift	measured at week 6
TikTok VCR	50% minimum	weekly
CPM vs benchmark	within 20%	weekly
Branded search lift	15% by week 4	measured weekly

UK — Meta only · weekly KPI actuals

Metric	Week 1	Week 2	Week 3	Target W3 Cumulative	Status
Unique Reach	780,000	820,000	310,000	2,490,000	🔴 BELOW — reach plateaued
Impressions	2,100,000	2,450,000	1,820,000	9,999,000	🟡 ON TRACK total but slowing
Frequency (7-day)	1.8	2.6	3.8	max 3.0/7d	🔴 ABOVE threshold
CPM (£)	7.02	7.55	9.20	max £8.40 (20% over £7.00 benchmark)	🔴 31% above W2
EARL (%)	4.2%	6.1%	7.8%	8.0% by W6	🟢 TRACKING well
CTR (%)	0.82%	0.79%	0.71%	n/a (awareness)	🟡 Slight decline
Video Completion Rate (%)	68%	64%	61%	60% minimum	🟢 OK
Budget Spent (£)	44,800	54,600	58,900	158,300	🟢 ON BUDGET

Germany — Meta · weekly KPI actuals

Metric	Week 1	Week 2	Week 3	Target W3 Cumulative	Status
Unique Reach	620,000	710,000	690,000	2,100,000	🟢 ON TRACK
Impressions	1,680,000	2,050,000	2,100,000	7,000,000	🟢 ON TRACK
Frequency (7-day)	1.6	2.1	2.3	max 2.0/7d	🟡 SLIGHTLY ABOVE
CPM (€)	5.90	6.10	6.40	max €6.60 (20% over €5.50)	🟢 WITHIN RANGE
EARL (%)	3.8%	5.2%	6.9%	8.0% by W6	🟢 TRACKING
Video Completion Rate (%)	72%	69%	71%	60% minimum	🟢 STRONG

Germany — TikTok · weekly KPI actuals

Metric	Week 1	Week 2	Week 3 (W3 day 1-4)	Target W3 Cumulative	Status
Unique Reach	410,000	480,000	198,000	1,320,000	🔴 LAGGING — reach not scaling
Impressions	1,820,000	2,210,000	2,380,000	8,400,000	🟢 HITTING TARGET
Frequency (estimated)	4.4	4.6	12.0 (estimated)	n/a	🔴 SAME USERS REPEATEDLY
CPM (€)	4.20	4.50	5.80	max €5.40 (20% over €4.50)	🔴 ABOVE BENCHMARK
Video Completion Rate (%)	62%	58%	41%	50% minimum	🔴 DROPPED 17pp in 4 days
Engagement Rate (%)	3.2%	2.8%	1.9%	n/a	🔴 DECLINING — algorithm signal
Budget Spent (€)	38,200	47,100	51,800	137,100	🟢 ON BUDGET

UAE — Meta · weekly KPI actuals

Metric	Week 1	Week 2	Week 3	Target W3 Cumulative	Status
Unique Reach	480,000	510,000	520,000	1,575,000	🟢 ON TRACK
Impressions	1,380,000	1,620,000	1,760,000	5,250,000	🟢 ON TRACK
Frequency (7-day)	2.1	2.4	3.2	max 3.0/7d	🟡 APPROACHING LIMIT
CPM (AED)	22.40	24.10	28.50	max AED 28.80 (20% over AED 24)	🟡 NEAR LIMIT
EARL (%)	5.1%	6.8%	8.4%	8.0% by W6	🟢 AHEAD OF TARGET
Video Completion Rate (%)	71%	69%	69%	60% minimum	🟢 STRONG

UAE — TikTok · weekly KPI actuals

Metric	Week 1	Week 2	Week 3	Target W3 Cumulative	Status
Unique Reach	390,000	420,000	440,000	1,312,500	🟢 ON TRACK
Impressions	1,560,000	1,890,000	2,020,000	6,300,000	🟢 ON TRACK
CPM (AED)	17.20	18.40	19.80	max AED 21.60	🟢 WITHIN RANGE
Video Completion Rate (%)	61%	59%	58%	50% minimum	🟢 OK
Engagement Rate (%)	4.1%	3.9%	3.8%	n/a	🟢 STABLE

UAE — cross-platform overlap signals (proxy data only)

Signal	Value	Notes
Meta audience size (target demo UAE)	~1,200,000	Platform reported
TikTok audience size (target demo UAE)	~980,000	Platform reported
Combined unique reach claimed (Meta + TikTok)	~1,480,000	Sum of both platforms
Estimated actual UAE target audience size	~900,000–1,100,000	Industry estimate for target demo
Implied overlap	35–55%	Inferred — not confirmed by tool
Incremental reach from TikTok over Meta	Unknown	No cross-platform deduplication
Budget allocated to TikTok UAE	AED 85,000	~40% of UAE budget
Estimated wasted impressions (if 50% overlap)	~1,010,000	Based on W1-W3 TikTok impressions

Cross-market summary — week 3 status

Market	Platform	Reach Status	Impressions Status	Primary Issue	Root Cause
UK	Meta	🔴 Plateaued	🟡 Slowing	CPM spike 31%	Auction competition (Oura Ring / Whoop)
Germany	Meta	🟢 On track	🟢 On track	None	None
Germany	TikTok	🔴 Lagging	🟢 On track	VCR dropped 17pp	Algorithm deprioritisation — format mismatch
UAE	Meta	🟢 On track	🟢 On track	Frequency approaching limit	Cross-platform overlap inflating numbers
UAE	TikTok	🟢 On track	🟢 On track	Overlap unconfirmed	Cross-platform audience duplication

Branded search (GA4 — all markets)

Market	Week 1 Baseline	Week 2	Week 3	Lift vs Baseline	Target
UK	1,240 searches/week	1,380	1,590	+28.2%	+15% by W4
Germany	890 searches/week	940	1,020	+14.6%	+15% by W4
UAE	620 searches/week	710	830	+33.9%	+15% by W4

Monitoring pipeline (stages 6–16)

Run ID awareness-reach-imp-20260511T234348

Scenario S5-4 · wearable_awareness

Harness: runFullStage1to14 + runProductionPipeline (6a→10, Stage 13 baselines in runner). UI labels match mental model; not only stage6b_* filenames.

Verbatim LLM rows: docs/S5-4-wearable-awareness-verbatim-llm-output.md
JSONL: server/scripts/output/llm-full/awareness-reach-imp-20260511T234348-S5-wearable_awareness.jsonl

Stage 6 — Cause analysis (hypothesis formation)

Stage 6 — sources

Source	Detail
6a (deterministic)	8 root-cause hypotheses generated (RootCauseService.hypothesise) — not expanded here; downstream 6b narrowed to 4 branches below.
6b Phase 1 (LLM)	Four competing branches with priors + feed/metric hooks (table).

Stage 6 — hypotheses (6b Phase 1)

Hypothesis (short)	Confidence (prior)	Evidence hooks (feeds / metrics / flags)
MacroAuctionPressure_SocialCommerceExplosion — category demand surge → cross-platform CPM inflation	0.55	TikTokCPM z=+2.50; MetaAdsCPM z=+2.40 (same window); L1 :SocialCommerceExplosion @ 40%; no deploy/creative/budget flags
AudienceExhaustion_FrequencyCapSaturation — audience pool contracting / frequency saturation	0.45	MetaAdsFrequency z=+2.30; MetaAdsAudienceReachPct z=-2.20; case facts: UK Meta freq 3.8 / 7d, reach 310k; AE overlap ~45% (proxy — dedup tool not connected)
CreativeFatigue_EngagementDegradation — engagement down → quality / CPM pressure	0.20	TikTokCTR z=-2.20; Meta MetaAdsCTR not anomalous; Meta VCR case fact DE 0.71 (weak primary Meta creative story)
MeasurementAttributionArtifact — pipeline / attribution artifact	0.20	meta_ads not connected; runSignalMode=synthetic_scenario; TotalPaidMediaROAS unsafe for causal claims; window misalignment flags in audit

Explain

Plausible stories consistent with the signals—still pre-commit. Priors are from Stage 6b Phase 1 (submit_evidence_brief); no locked primary cause yet.

Stage 7 — Diagnosis synthesis (“committed diagnosis”)

Stage 7 — synthesis fields

Field	Value
Committed primary cause (canonical 6b commit)	Not produced — Stage 6b Phase 2 failed with Anthropic 400: tools.0.custom: For array type, property maxItems is not supported → runner used stub diagnosis (low / failed diagnostic step in downstream 6c text).
confidenceAssessment (effective)	Low for a formal locked diagnosis; directional evidence narrative from 6b2 still applied.
confirmationDataSources	In-product: anomaly KPIs + caseStudyFacts + monitoringDataDisclosure. External: 6b2 web searches (6 queries) + tool results; no live Ads Manager / Shopify / dedup tool rows in this harness.

Stage 7 — decision ledger (summary)

Row	Outcome	Confidence after evidence
Macro auction / social-commerce explosion	Rejected as primary — search found no acute May-2026 category spike; Meta CPMs cited down YoY Q1’26	0.20
Audience exhaustion / frequency saturation	Leading — frequency thresholds + exhaustion triad match; week-3 flight timing	0.65
Creative fatigue	Secondary (TikTok-weighted) — TikTok CTR pattern; not primary cross-platform (Meta CTR not anomalous)	0.25
Measurement / attribution artifact	Caveat retained — Meta outages May 5–6; synthetic + meta_ads connectivity ambiguity	0.20

Expand — evidence hooks by ledger row (no chain-of-thought)

Macro (loser): Queries e.g. social commerce CPM inflation Meta TikTok 2026, TikTok Shop wearables wellness ad spend surge May 2026, wearables wellness brands social commerce UK advertising May 2026 → no acute competitor surge; secular growth only.
Audience exhaustion (winner): Query Meta ads frequency saturation threshold awareness campaigns CPM → freq 3–4+ cold-audience danger zone aligns UK 3.8; triad CPM↑ freq↑ reach↓.
Creative (partial): Query TikTok ads CTR decline high frequency audience saturation ecommerce 2026 → TikTok-specific decay pattern; Meta engagement metrics not in anomaly set.
Measurement (caveat): Query Meta ads platform status issues May 2026 → Meta Ads Delivery outages May 5 (1h33m) & May 6 (22h37m); March 2026 attribution reclassification noted.

Explain

What we’re willing to stand behind for this run: an evidence-backed working story (audience exhaustion best supported at 0.65 after 6b2) inside an auditable ledger—but not a fully committed 6b locked diagnosis because Phase 2 commit hit an API schema error.

Stage 8 — Evidence checks

Show

External context snapshot (6b2 commit): period: normal_Q2_2026 · platformStatus: degraded Meta Ads Delivery (May 5–6); attribution reclassification Mar 2026; TikTok nominal · competitorActivity: quiet (no acute May-2026 surge found) · suppressAnomalyEscalation: false

Stage 8 — evidence checks

Check	What was tried	Outcome
Macro CPM story	Web search (social / TikTok / UK wearables angles)	No material acute corroboration; treat macro as background, not driver of z≈2.4–2.5 spike
Frequency / saturation	Web search (Meta freq vs CPM)	Corroborated vs UK freq 3.8 + triad
Platform health	Web search (Meta status May 2026)	Contradiction / caveat: outages + attribution changes → qualify Meta-side certainty
TikTok engagement	Web search (CTR + saturation)	Partial corroboration for TikTok path
Competitor surge	Searches + narrative	Quiet — no May-2026 spend spike found for category/geo

Details

Still open: live reach ÷ audience size (Ads Manager)
Still open: AE ~45% overlap — unverified (dedup tool not connected)
Still open: TikTok frequency for account
Still open: whether May 5–6 outage touched this account’s reporting
Still open: Shopify / revenue truth — not connected

Explain

What we tried to falsify or corroborate outside the first-pass story: web retrieval + structured commit (commit_evidence_findings). Several items did not change (macro acute story); others materially changed confidence (audience exhaustion up, macro down).

Stage 9 — Policy checks

Stage 9 — policy checks

Constraint / policy	Effect
6c — escalationRequired	false
Channels	Internal only — ops dashboard / Slack / Teams / email to paid media; no customer-facing channels
Audience	Internal — performance marketing + audience/creative owners
Timing	Monitor-and-investigate first; 24h emphasis on verifying meta_ads feed reality + May 5–6 outage impact before spend changes
Brand identity risks (blocked narratives)	No panic-discounting; no mass pause without verified root cause; no rushed creative rotation without confirming mechanism
Stage 7 (enforcer)	conservative=true · 82 allowed action types after pruning (console)

Explain

What we refuse to recommend or execute regardless of what looks optimal: customer-facing plays while diagnostic commit is broken/low-confidence; destructive spend/creative moves per 6c narrative; escalation not required to exec/board for this pattern.

Stage 10 — Option generation

Stage 10 — option generation

Candidate (title)	Intent	Tag
CreateInternalBrief (selected downstream)	Internal documentation + alignment for paid media / measurement follow-up	Exploratory / verify-first — 6b Phase 2 failed; synthetic scenario + feed gaps

Console: Stage 8 emitted 1 candidate; Stage 10 ran IntelligentPlanGenerator with model gpt-4o — full plan text is not in Anthropic llm-full JSONL for this run.

Explain

Possible moves consistent with policy—not a final customer-facing pick. With stubbed formal diagnosis, outputs should read as verify-first, not autopilot.

Stage 11 — Option scoring

Stage 11 — option scoring rubric

Rubric (plain language)	Application this run
Safety under uncertainty	Favor internal / non-destructive when 6b commit failed
Actionability without OAuth	Prefer brief / checklist over tool execution
Alignment with 6c	Must respect internal-only + investigate-first

Details

Top scored: CreateInternalBrief — best fit: captures 6b2 narrative + 6c guardrails for humans without pretending connectors executed.

Dampening: scores / decisiveness should be read as attenuated due to diagnosis commit failure + synthetic scenario + meta_ads connectivity ambiguity—scoring must not look magically decisive.

Explain

How we ordered choices given uncertainty: one candidate passed Stage 8; Stage 9 selected CreateInternalBrief as the best harm-minimizing action under conservative enforcement.

Stage 12 — Action planning

Stage 12 — action planning (safety bundle)

Safety bundle item	Content
Tradeoffs	Acting on CPMs without revenue truth → risk of false fixes; acting without feed verification → risk of wrong lever
Rejected alternative	Mass pause Meta; force creative refresh as primary lever; shift budget to TikTok “because cheaper” (6b Phase 1 explicitly blocked these patterns where data missing)
Kill-switch / recheck	Re-run diagnosis after meta_ads live reconnect + post-outage window check; if reach/freq/CPM normalize → downgrade exhaustion hypothesis
Human approval vs automation	Human: any customer-facing, budget reallocation, or attribution setting change. Safe automation (if wired): internal ticketing / brief generation only—not executed here beyond LLM plan text

Explain

The plan’s safety envelope: investigate connectivity and outage impact before spend or creative surgery; treat Meta delivery metrics as provisional until feeds are real.

Stage 13 — Execution

Stage 13 — execution

Item	Status
Customer connectors / workflows	Not executed — harness ends at plan generation; oauthCoverage=none, workflowDagEligible=false (console)
What did run	Baseline / learning repo updates in runner (“Stage 13” in console): e.g. 35 seeded, 4 updated horizon rows for KPI tracking

Explain

What actually ran in the real world: in this test harness, only offline baseline bookkeeping plus LLM stages—not production side effects on ad accounts.

Stage 14 — Outcome capture

Stage 14 — outcome capture

Signal	Status
Post-intervention KPIs	Pending verification window — no automated post-step measurement in this script path

Explain

Did the intervention move the needle yet—too early to tell; no post-step read connected in the runner.

Stage 15 — Learning updates

Stage 15 — learning updates

Update	Detail
Baselines	In-memory repo: seed/update rows for KPI horizons (console: 35 seeded, 4 updated)
Quality gate	Learning should be treated as bounded because run is synthetic_scenario + partial feed truth

Explain

What the system will do differently next time: slightly warmer baselines for repeated metrics on this tenant in-process—not magical self-rewriting of strategy.

Stage 16 — Reporting

Show — digest

Committed-first narrative (honest): best-supported story = audience exhaustion / frequency pressure (6b2 0.65); formal 6b commit missing (API tool schema error).
Ranked certainty: high for “investigate before spend”; medium for exhaustion mechanism; low for pure macro; caveats for measurement.
Missing-data caveats: meta_ads not truly verified live; Shopify / dedup absent; synthetic overrides.
Operational validation loop: (1) Feed truth (2) Outage overlap (3) Audience size vs reach (4) TikTok freq/creative slice.
Drawers: evidence → llm-full JSONL + pasteboard doc; ledger → Stage 7 table above.

Explain

Stakeholder-readable story with explicit limits: trust the evidence-retrieval synthesis for next questions; do not treat as signed-off diagnosis until 6b Phase 2 is fixed and rerun.

Appendix — pipeline ↔ code stage map (engineers)

UI stage	Code path (this run)
6	6a hypotheses + 6b Phase 1 (submit_evidence_brief)
7	6b Phase 2 (failed) + effective synthesis from 6b2 commit_evidence_findings
8	6b2 ReAct + web tools + commit
9	6c interpret_brand_constraints + Stage 7 ConstraintEnforcerService
10	Stage 8 candidates + Stage 10 orchestrator (gpt-4o plan; not in Anthropic JSONL)
11	Stage 9 ImpactScorerService
12	remediationSolutionPlan layer (from pipeline; brief in logs)
13	Runner “Stage 13” baseline writes
14–16	Honest N/A or pending for this harness

AI output (scenario test)

Run your scenario test and write results into mockAiOutput on the pack, or inject at runtime from the test harness.