SHUR Gap-Finder • Negative Space Intelligence

Adding Benevolence to Friction

YouTube generates $40 billion in ad revenue using a timer. An AI that understands story arcs could add $2 billion by moving the ad sixty seconds.

YouTube / Alphabet March 2026 Kevin Mowrer • Limore Shur • Nuri Djavit • Jonny Dubowsky

The Problem 01

YouTube’s ad insertion is a clock, not a storyteller. It counts minutes, not moments.

YouTube inserts mid-roll ads based on time elapsed, not content context. The algorithm counts down from the last ad break and drops the next one at a fixed interval — regardless of what the viewer is experiencing. The result is what Kevin Mowrer calls the “death moment”: the viewer is about to witness the big kiss, the killer reveal, the punchline — and a car insurance ad detonates in the middle of it.

This is not a minor UX complaint. Mid-roll ad completion rates drop from 82% to 57% when ads exceed 20 seconds. Sixty-eight percent of smart-TV users report annoyance at unexpected interruptions. Average video retention sits at a dismal 23.7%. Viewers do not distinguish between “bad ads” and “bad timing” — they just leave.

“They’ve launched the death moment missile. You’re about to have the big kiss — ad. And they hate it.”

Kevin Mowrer

The economics are stark. YouTube’s 2025 ad revenue was $40.37 billion. Its total revenue, including subscriptions, exceeded $60 billion — more than Netflix. If content-aware ad placement improved viewer retention by just five percent, that represents approximately $2 billion in incremental annual revenue. And the secondary effect is equally powerful: viewers who aren’t angry at the ad actually watch it, increasing time-on-ad and advertiser ROI.

The solution is not fewer ads. It’s better-timed ads. As Limore Shur articulated on the call: you’re literally adding benevolence to friction. The friction is necessary — ads fund the entire creator economy. But the timing can be kind.

Five-Lens Negative Space 02

Five analytical perspectives on what is absent, unsaid, contradictory, or avoided in the YouTube ad ecosystem.

Gap Analysis 03

Twelve gaps identified across the YouTube ad timing ecosystem — from platform blind spots to unexplored regulatory tailwinds.

Value Flow Analysis 04

How value moves — and leaks — through the current time-based ad insertion system.

Current State: The Leaking Pipeline

Creator Uploads Video

↓

YouTube Generates Transcript + Content ID

↓

Timer Inserts Ad at Fixed Interval

↓

Viewer Interrupted Mid-Climax

↓

Drops Off

Skips Ad

Pays Premium

Proposed State: Content-Aware Insertion

Creator Uploads Video

↓

Transcript + Audio + Visual Multimodal Analysis

↓

Content-Aware Engine Identifies Narrative Valleys

↓

Ad Placed at Natural Break (Scene Transition, Topic Shift, Emotional Valley)

↓

Keeps Watching

Watches Full Ad

Higher CPM

Revenue Impact Model

$40.4B	YouTube 2025 ad revenue
5%	Conservative retention lift from content-aware timing
$2.0B	Incremental ad revenue from retention improvement alone
+10%	Estimated time-on-ad increase (less viewer anger)
$2.8B	Combined incremental value (retention + engagement)

SHUR Gap-Finder • Technical Architecture

The Three-Layer Engine

Transcript, audio, and visual signals combined through genre-specific ontologies to identify narrative valleys where ads cause minimal disruption.

Multimodal Content Analysis ~200 Genre Pacing Profiles 6 Fundamental Story Arcs

Analysis Pipeline 01

Three layers of increasing depth, weighted by content category. Transcript is cheapest and fastest. Visual is richest and most expensive.

Composite Scoring

Each layer produces insertion candidates with confidence scores. The composite engine merges them, applies genre-specific weights, and enforces constraints: minimum 120 seconds between insertion points (no ad stacking), maximum 600 seconds (ensure ad frequency), and genre-specific blackout zones (never during a climax, never mid-punchline).

The key insight from Nuri: this only touches timing, not ad serving. Google’s ad-serving stack (targeting, bidding, brand safety) remains untouched. The engine outputs a ranked list of optimal timestamps. Google’s existing system selects which ad to show at each timestamp. Clean separation of concerns.

Story Arc Intelligence 02

Computational research validates six fundamental emotional arc shapes. Each has predictable valleys — the optimal ad insertion points.

Reagan et al. (2016, University of Vermont) analyzed 1,327 novels from Project Gutenberg using sentiment analysis and identified six fundamental emotional arc shapes. This wasn’t literary theory — it was computational proof that emotional trajectories are classifiable and finite.

Disney’s MediaScience lab in Austin, Texas — founded in 2008 at Disney’s request — operates 28 NeuroQube stations with EEG, heart rate, skin conductance, facial expression, and eye tracking. Their clients include Google, Netflix, and Apple. They’ve already proven that emotion is a targeting metric for advertising.

“They intentionally build in boring moments. It’s all part of the story. You can’t have highs without lows.”

Nuri Djavit — referencing Pixar/Disney story method

Pixar’s Story Spine structures content as seven beats: “Once upon a time” → “Every day” → “Until one day” → “Because of that” → “Because of that” → “Until finally” → “And ever since then.” The “Because of that” sections are rising action — do not interrupt. The transitions between beats are natural insertion points.

The Six Fundamental Arcs

Arc Shape	Pattern	Optimal Ad Zones	Death Zones
Rags to Riches	Steady rise	Early exposition, before final triumph	During the rising momentum
Tragedy	Steady fall	Between declining stages	At emotional bottom
Man in a Hole	Fall → Rise	At the bottom (viewer is processing), after recovery	During the fall, during the climb
Icarus	Rise → Fall	At the peak (natural pause), during early fall	During the ascent
Cinderella	Rise → Fall → Rise	Between each phase transition	During any steep change
Oedipus	Fall → Rise → Fall	At the recovery peak, between phases	During any steep change

Genre Ontology 03

The concept space is finite. Eight primary genres, 40–50 sub-genres, 150–200 pacing profiles. Not 6,000 possibilities — a tractable taxonomy.

The FrameBright Asset 04

An existing technology that already classifies content at the scene level, embeds metadata into video streams, and was built for 2 billion impressions per day.

FrameBright is a Canadian content classification startup whose CTO, Eric MacDougall, built classification engines at MindGeek scale — 2 billion daily impressions. Their technology already operates at arbitrary temporal granularity: scene, segment, track, and sample level. Classification metadata is embedded directly INTO the video file via a proprietary protocol that survives remuxing and streaming.

Currently positioned as a child-safety / content-moderation tool, FrameBright’s technology maps directly to the ad-timing problem. Their thesis: “Finer classification enables less restrictive access.” In ad-timing terms: finer content analysis enables less disruptive insertion. This is not a pivot — it’s a second application of the same core capability.

Technology Mapping

FrameBright Capability	Ad Timing Application
Scene-level classification	Emotional valence scoring per scene for insertion suitability
Embedded metadata protocol	Tag optimal insertion points as metadata in the video stream
Multi-dimensional classification	Score scenes across tension, humor, calm dimensions simultaneously
Real-time streaming analysis	Process live streams for real-time ad timing optimization
Scale architecture (2B/day)	Handle YouTube’s volume requirements without new infrastructure
Human-in-the-loop verification	QA workflow for validating insertion point quality at scale

SHUR Gap-Finder • Competitive Intelligence

The Field Is Open

Seven companies are working adjacent to this problem. None combines emotional arc analysis with real-time ad insertion optimization for creator platforms.

7 Known Competitors Market Gap: Emotional Arc + Creator Scale CAVVA (IEEE 2014) Validated but Unproductized

Competitive Matrix 01

Most existing solutions focus on broadcast and CTV, not YouTube creator content at scale. The intersection of narrative intelligence and dynamic ad insertion is unoccupied.

Company	Approach	Key Metric	Target Market	Gap vs. Our Thesis
Synchronized (France)	AI scene/sequence detection for ad marker placement	96% accuracy, 80% task reduction	Broadcast (TF1)	Scene-level only, no narrative arc understanding
Nomalab + AWS	Claude 3 Haiku + Titan embeddings for scene-audio alignment	~90% accuracy	Broadcast, VOD libraries	Library processing, not real-time creator content
Mirriad (UK, Public)	Virtual product placement inside content, 35 patents	85% viewer approval, 35% purchase lift	Brand entertainment	Different problem — in-content placement, not break timing
Setplex	Neural network frame-by-frame analysis	Real-time live + VOD	FAST channels, CTV	Technical detection, no narrative intelligence
Google Cloud Video AI	Shot detection, entity recognition, content classification API	Developer platform	GCP customers	Building blocks exist but not assembled for ad timing
BENlabs	AI-powered branded entertainment integration	Partnership with Mirriad	Brand deals	Branded content, not mid-roll optimization
Vibe.co	Self-serve programmatic CTV ads, AI-optimized	SMB focus	CTV, streaming	Ad buying platform, not insertion timing layer

Academic Validation 02

The concept is academically proven. Emotional arc analysis improves ad placement. But no one has productized it at scale.

Defensibility Assessment 03

The moat is not the algorithm. Google can replicate any algorithm. The moat is the proprietary ontology, validated results, and speed to market.

SHUR Gap-Finder • Go-to-Market Strategy

Show Them They Left Billions on the Table

YouTube already validated the concept in May 2025. The pitch is not “new idea” — it’s “your current approach stops at pauses. Ours understands story arcs.”

3 Phases POC → Validation → Pitch 12-Week Timeline

“It’s one thing to walk through the door with a solution. It’s another thing to walk through the door and show them you understand their problem better than anybody else.”

Kevin Mowrer

Phased Execution 01

The Pitch Narrative 02

Kevin’s question — “Why haven’t they done something?” — now has an answer. They have. But they stopped at the surface.

What YouTube did (May 2025): ML-based detection of pauses, transitions, and chapter markers for mid-roll ad placement. Hybrid manual + automatic system. Reported 5% revenue increase.

Where they stopped: The ML finds silence and scene cuts. It does not understand narrative arcs, emotional valence, genre-specific pacing, or the Pixar Story Spine. It cannot distinguish between a dramatic pause before a reveal (do not interrupt) and a natural pause between segments (safe to insert).

“Reducing content disruption without reducing ad presence — that will increase retention numbers and that turns into more ad dollars.”

Kevin Mowrer

Our pitch: We’ve built what comes after pauses. A genre-specific ontology that maps 200 content categories to pacing profiles. A multimodal engine that reads transcript, audio, and visual signals to identify narrative valleys. Empirical data showing the retention delta between timer-based and story-aware insertion. It outputs timestamps. It doesn’t touch the ad stack. It’s a timing layer, not a platform play.

The dollar case: YouTube’s own May 2025 update proved that better timing = more revenue. They got 5% from surface-level changes. We’re offering the next 5%. At $40B in ad revenue, that’s $2 billion — for moving the ad sixty seconds.

Team & Leverage Points 03

Person	Role	Leverage
Kevin Mowrer	Concept originator, media industry relationships	Narrative accessibility — translates technical depth into buyer language
Limore Shur	Activation, positioning, FrameBright connection	“Benevolence to friction” framing. Bridge to FrameBright team and Evan Shapiro
Nuri Djavit	Technical scoping, market strategy	Scoped the problem to “timing layer only” — avoids Google-competitive positioning
Jonny Dubowsky	Ontology design, AI pipeline, analysis engine	ShurAI negative space methodology, knowledge graph architecture, multimodal pipeline
Howard Postley (potential)	Academic friction research, problem framing	Academic credibility — peer-reviewed problem definition strengthens the pitch
Evan Shapiro (potential)	Media cartographer, YouTube advocate	Public platform + YouTube relationships — could amplify or introduce
Eric MacDougall / FrameBright (potential)	Scene-level classification technology, scale engineering	MindGeek-scale credibility + existing IP. Not vaporware — redirected technology

Open Questions 04

1. Why did YouTube stop at surface-level? Organizational inertia? Premium cannibalization? Technical debt in the ad-serving pipeline? Understanding the blocker shapes the pitch.

2. Patent landscape? Mirriad has 35 patents for in-content placement. CAVVA exists as academic prior art. Quick freedom-to-operate assessment needed before investing heavily.

3. Minimum viable demo? Transcript-only on 10 videos? Or do we need audio/visual for credibility? The cheapest path to a compelling heatmap.

4. FrameBright involvement? Do we bring Eric in as technical co-architect? His scale credibility is powerful but introduces coordination complexity.

5. Creator tool angle? YouTube Certified Partner integration as the entry wedge? Or pitch directly to YouTube Ads team?

6. Evan Shapiro timing? Before or after the POC? His endorsement adds weight but he needs something concrete to react to.

The ad still plays. The advertiser still gets impressions. But the system waits for the right moment.

Content-aware ad insertion is not about removing friction. It’s about making friction kind. YouTube validated the concept. We’re building the depth.

Adding Benevolence to Friction

Space Around — What Exists Adjacent

What They’re NOT Saying — YouTube’s Silence

The Silence — Industry-Wide Avoidance

Challenge the Obvious — Assumptions Worth Breaking

Contradictions — Structural Inconsistencies

Current State: The Leaking Pipeline

Proposed State: Content-Aware Insertion

Revenue Impact Model

The Three-Layer Engine

Composite Scoring

The Six Fundamental Arcs

Scripted Drama

Comedy & Stand-Up

Educational / Tutorial

Gaming Content

Music Content

Vlog / Unscripted

Technology Mapping

The Field Is Open

CAVVA: Computational Affective Video-in-Video Advertising

Reagan et al.: Six Fundamental Emotional Arc Shapes

Neuroscience-Validated Content Pacing

Show Them They Left Billions on the Table