NFL KOREA
by NFL Dictionary

COACHING TREE DATA LAB

Public-facing snapshot of the new coaching-tree.app migration pipeline built on May 6, 2026. This page shows what today's development run captured, normalized, and validated before any full-scale graph import.

Sample run scope: The current report is intentionally capped at 2 directory pages and 6 coach profile pages, so unresolved references are expected until the crawl expands.
Run ID: 20260506-012700-coaching-tree-app Validation: WARN Generated: May 6, 2026

Pipeline Snapshot

The sample run wrote raw snapshots to local storage, extracted structured staging rows, normalized canonical graph files, and produced validation manifests in a reproducible repository layout.

Today's Output Layers

The migration pipeline separates raw crawl artifacts from reproducible repository data so that large HTML snapshots stay local while compact graph-ready files remain versionable.

Raw Capture

Directory pages, coach profile HTML, and team directory snapshots written outside Git.

raw/coaching-tree-app/<RUN_ID>/

Structured Staging

Directory coaches, parsed profiles, career history rows, worked-under rows, team rows, and source metadata.

data/nfl-coaching-tree/staging/

Canonical Graph Files

Normalized coaches, franchises, aliases, staff tenures, edges, claims, and source-backed graph import inputs.

data/nfl-coaching-tree/canonical/

Validation and Export

Validation summaries, unresolved lists, pipeline manifests, and optional Neo4j export bundles.

validation/, manifests/, exports/<RUN_ID>/

Sample Coach Profiles

These examples are drawn directly from today's parsed profile sample and show the kind of canonical coach records now produced by the pipeline.

Career Timeline Slice

The sample tenure table below shows how coach-season-role rows are separated from narrative profile pages into graph-friendly records.

Validation Watchlist

Current unresolved items come from the intentionally partial crawl window, not from fabricated data. They stay visible until a broader run resolves them.

Validation is currently WARN because 3 mentor-name references and 1 historical franchise reference remain unresolved in the sample snapshot.

Source Scope

Today's sample run used the public coaching-tree.app directory and a small coach profile subset.