Research Library — 4CITE.ai | Structural Integrity Research by 4 SHIELD LLC

Validation Evidence

The empirical case for structural integrity measurement — controlled studies, retroactive scoring of historical disclosures, archetype validation across domains.

Evidence

Validation Results

Cross-domain corpus scores · Mata v. Avianca · SVB · Enron · Founding documents

The full validation page: 81-point gap between genuine federal opinion and AI-fabricated brief; 71-point discrimination delta on a blinded 20-document batch; 13/13 archetype classification accuracy; the corpus scores spanning Federalist No. 51 (91) down to Enron FY2000 10-K (8).

81ptMata gap

71ptDiscrimination delta

13/13Archetype accuracy

2×Federal court win rate corr.

View Evidence →

White Papers — Theoretical Foundation

Why structural integrity measurement works, why the discrimination gap is so large, and why accuracy-based detection methods measure the wrong layer.

WP-17

Published · July 19, 2026

The Category Error of Banning Words

Why Meaning Lives in Context, and What That Makes Possible

Banning a word is a category error: meaning lives in the relations between words, not inside the token. This paper traces that error from the euphemism treadmill to the Scunthorpe problem, shows why attention-based AI reads meaning relationally — the same commitment structural integrity analysis makes when it reads the rendered whole rather than the fragment — and argues that a machine that reads context is the threshold of a translation layer between speakers, under two conditions: that it is not weaponized, and that it carries a published reference standard against drift.

12 sections Context principle · Layer 3 · Model Verification standard

Read Paper (HTML) →

WP-16

Published · April 24, 2026

The Hallucination Gap

Why Shannon's Law Explains Structural Integrity Measurement

A hallucinating AI is a low-bandwidth channel impersonating a high-bandwidth signal. This paper applies Shannon's 1948 channel-capacity framework to AI hallucination, documenting a 71-point discrimination delta between genuine and hallucinated content and explaining why accuracy-based detection measures the wrong layer. Positions structural integrity analysis as a measurement of channel capacity grounded in the same theoretical framework the AI models themselves are built on.

12 sections 71-pt delta · Shannon framework · Layer 3 positioning

Read Paper (HTML) →

WP-14

Published · April 2026

Hallucination Is Not an Accuracy Problem

Why AI Confabulation Is a Structural Integrity Event

Retrieval-augmented generation and citation verification are necessary but not sufficient for hallucination detection. A document with perfectly accurate citations can still fail structural integrity analysis if the reasoning connecting those citations is performed rather than genuine. Establishes the taxonomy of hallucination modes and maps each to the measurement layer capable of detecting it.

Hallucination taxonomy · RAG limitations · Layer 3 necessity

Read Paper (PDF) →

WP-12

Published · April 2026

The Collapse Dividend

Why AI Model Collapse Makes Structural Integrity Measurement Infrastructure

Shumailov et al. (2024) demonstrated that AI models trained on recursively generated data collapse toward a bland distributional mean. Argues that model collapse is not just a model quality problem — it is a documentary ecosystem problem. As AI-generated content replaces human-authored content in training data, the structural integrity gap between genuine and generated documents narrows from below. Measurement infrastructure becomes more critical, not less, as the surface differences diminish.

Model collapse · Shumailov 2024 · Documentary ecosystem

Read Paper (PDF) →

White Papers — Strategic & Architectural

Why structural integrity is a property of systems rather than people, why the AI accountability crisis is already behind institutions rather than ahead, and what the regulatory environment is about to require.

WP-15

Published · April 2026

The Retrospective Liability Thesis

Why the AI Accountability Crisis Is Behind Us, Not Ahead

The sanctions event horizon has already passed. Mata v. Avianca (2023) and Brigandi v. GEICO ($110,000+) established the liability pattern. FRE 707 codifies it. Argues that institutions are not preparing for future AI accountability risk — they are managing existing, undiscovered liability in documents already filed and relied upon.

FRE 707 · Retrospective scoring · Liability architecture

Read Paper (PDF) →

WP-13

Published · April 2026

The Accountability Architecture

Why Structural Integrity Is a Property of Systems, Not People

Structural integrity measurement is not a judgment about individual authors — it is a measurement of the accountability architecture in which documents are produced. Establishes the theoretical basis for why foundational accountability (the G4/G6 gate pair) is the most fundamental dimension and why no amount of surface polish can compensate for its absence.

Foundational accountability · Accountability architecture · Institutional analysis

Read Paper (PDF) →

Earlier Papers — Validation Roots

The empirical results that grounded the theoretical framework — the 81-point Mata gap and the 71-point discrimination delta. Both are summarized on the Evidence page.

WP-7

Available · 2025–2026

The 81-Point Gap

Structural Integrity Analysis of AI-Fabricated Legal Briefs: Mata v. Avianca Case Study

Detailed structural integrity scoring of the ChatGPT-fabricated brief submitted in Mata v. Avianca, Inc. (S.D.N.Y. 2023), which resulted in Rule 11 sanctions from Judge P. Kevin Castel. The fabricated brief scored 7/100 (T4 Fabricated). Authentic briefs from the same legal domain scored 72–88/100 (T1 Integrated). The 81-point gap is the largest single-document delta in the 4CITE validation corpus.

Score: 7/100 · T4 · 81pt gap · Rule 11 sanctions

View on Evidence Page →

WP-5

Available · 2025–2026

71-Point Discrimination Delta

Blinded Batch Comparison of Genuine and Hallucinated Content Across Five Professional Domains

Controlled study: 20 matched AI responses (10 genuine, 10 hallucinated) across five professional domains, scored blind on multi-dimensional structural integrity analysis. Genuine responses averaged 82.4 (T1 Integrated — all). Hallucinated responses averaged 11.4 (T4 Fabricated — all). Zero overlap in score distributions. The 71-point average discrimination delta is the primary empirical validation of the structural integrity measurement methodology.

n=20 · 71pt avg delta · Zero distribution overlap · 5 domains

View on Evidence Page →

Research Reports — R Series

R35

Available · April 3, 2026

13/13 Archetype Classification Validation

Integrity Archetype Self-Tuning Layer: Cross-Corpus Validation Study

Validation study of the 4CITE archetype classification layer across 13 documents spanning legal, corporate, and government domains. 13 of 13 documents received archetype classifications consistent with human expert review. Validates the archetype layer as a reliable subtype descriptor operating above tier designation.

13/13 accuracy · Cross-domain · Archetype layer validation

Research Areas

What the Research Covers

Shannon Framework & Channel Capacity

Applying Shannon's 1948 information-theoretic framework to document reliability. The foundational theory behind why structural integrity measurement works and why the discrimination gap is so large.

AI Hallucination Detection

Structural integrity as a hallucination detection method. Why accuracy-based tools (RAG, citation verification) measure the wrong layer, and what Layer 3 adds to the complete integrity stack.

Model Collapse & Ecosystem Effects

How recursive AI training degrades documentary ecosystems. The Shumailov et al. (2024) findings applied to the institutional document corpus and long-run measurement infrastructure requirements.

Legal Accountability Theater

Structural integrity analysis of fabricated legal briefs. The Mata v. Avianca and Brigandi case studies, Rule 11 sanctions patterns, and FRE 707 regulatory implications for AI-generated legal content.

Corporate Disclosure Integrity

Retroactive scoring of SEC filings from SVB, Enron, and the broader EDGAR corpus. Score drift as a leading indicator. The accountability theater pattern in risk disclosures that precedes institutional failure.

Founding Document Benchmarks

High-integrity calibration corpus: Federalist Papers (91), Gettysburg Address (89), Declaration of Independence. The structural standard that existed before it was measurable — now measured.

Meaning, Context & Language

Why meaning lives in the relations between words, not inside the token — from the euphemism treadmill and the Scunthorpe problem to the context principle that attention-based models implement. The philosophical foundation of reading the rendered whole (WP-17).

Research Partnerships

Academic institutions and independent researchers interested in corpus access, methodology validation, or collaborative research are invited to reach out directly.

Research Inquiry → See Pricing →