⚡ FME Version 19 · May 2026

Forensic Media Evaluation
Methodology

Every Rhetoric Audit scan is grounded in a four-stage forensic pipeline — from span-level propaganda detection to external claim grounding. This page explains what we measure, how we measure it, and where the limits are.

Principles

Four commitments that shape every scan

Forensic, not editorial

We score what is structurally present — framing, evidence quality, omissions, emotional load — not whether we agree with the conclusion.

Multi-axis, not left/right

A single ideology label flattens complex argumentation. FME decomposes text into independent rhetorical dimensions that can be reasoned about separately.

Span-anchored evidence

Every score traces back to the exact words that produced it. "Why this 72?" is always answerable with a quoted passage and a scholarly technique name.

Reproducible & auditable

Every scan returns a prompt_hash and fme_version. Same inputs produce equivalent outputs. Bench F1 is published per release.

Capabilities

What FME V19 measures

V19 ships seven capabilities in P1, with argument mining and cognitive bias schema-reserved for P2. The table below shows the full traceability from scholarly literature to current status.

#CapabilityScholarly SourceStatus
1Span-level propaganda detectionDa San Martino et al., SemEval-2020 Task 11P1 ✓
2Aristotelian appealsAristotle's Rhetoric; LEPAnP1 ✓
3Full fallacy enumerationSemEval persuasion + classical logicP1 ✓
4Emotion model + arcPlutchik-8; VAD modelP1 ✓
5Full-document analysisEliminates V18.6 truncation biasP1 ✓
6External claim groundingGoogle Fact Check Tools, WikidataP1 ✓
7Ground-truth validation pipelineSemEval-2020 public setP1 ✓
8Argument miningStab & Gurevych (2017)P2 reserved
9Cognitive bias / nudge detectionKahneman, ThalerP2 reserved
10Cross-source framingMedia Frames CorpusP3 — CSCE
11Strategic silenceAgenda-setting theoryP3 — corpus
12Discourse structureRhetorical Structure TheoryP3 — parser
Pipeline

The four-stage forensic pipeline

V19 replaces the V18.6 monolithic single-pass LLM prompt with a deterministic four-stage architecture. The LLM observes; code scores.

Stage 0

Preprocessing

Paragraph-aware chunking into 3-paragraph windows with 1-paragraph overlap. Eliminates V18.6 truncation bias.

Deterministic · No LLM
Stage 1

Span Annotation (batched, parallel)

Each chunk is sent as one LLM call. Output per span: char offsets, technique, appeal, emotion, confidence, and rationale.

LLM · mimo-v2-flash
Stage 1.5

Claim Grounding (parallel)

Factual claim spans are canonicalized and queried against Google Fact Check Tools and Wikidata. Stage 2 is never blocked.

External APIs · 30-day cache
Stage 2

Aggregation (deterministic)

Document scores rolled up from paragraph scores using a prevalence-weighted, paragraph-length-normalized formula.

Deterministic · No LLM
Stage 3

Validation

Zod schema check, span offset sanity, and bench-score CI gate (merge blocked on >2pp macro-F1 regression).

Schema · CI gate

See FME V19 applied to a real article

Install the Chrome extension or sign up to run your first forensic analysis. Free credits included on signup.