Measured product lead custody receipt.
See What AI Models Really See
A byte-identical perception receipt for AI video pipelines · ZPE-Video · PyPI zpe-video v0.1.0 · github.com/Zer0pa/ZPE-Video
AI systems decide what gets flagged in video, who gets identified, what a generation model is trained on — and until now, no one outside the pipeline could check what the detector was actually shown.
ZPE-Video closes that gap. Two independent Python writers, built from the same spec, produce a byte-identical perception receipt for the same frames. The receipt records detector, tracker, and manifest state; it does not reconstruct pixels or audio. It is a re-derivable record, not a video codec.

An AI processes video — but no one can check what it was actually shown.
What AI saw in the video can now be checked.
Perception traces from AI video pipelines scatter across Parquet, JSON, pickle, and MCAP containers. There is no portable record format and no second writer that can independently rebuild the same bytes from the spec.
zpe-video v0.1.0 ships a zero-dependency Python library with a documented wire format, per-frame CRC32, stable receipt hashes, and SHA-256 manifest binding. Two independent writers — one built from the spec by hand — produce byte-identical output across 3 receipt-core cases. The record is re-derivable, not just inspectable.
Receipt evidence lives in bytes, hash, CRC, and manifest.
Two independent Python writers, one byte-identical receipt.
Deterministic means the same detector and tracker input plus the same wire-format spec produce a byte-identical perception receipt from two independent Python writers, with SHA-256 manifest binding verified on the same three receipt-core cases. Cross-writer SHA-stable measures the receipt only — it is not deterministic computer vision, deterministic LLM output, a legal evidence chain, full-video replay, or a competitor to AV1 or VVC. The scope is the record, and the scope is named.
The receipt carries detector and tracker state — boxes, track IDs, timestamps, CRCs, manifest binding. It does not reconstruct pixels, appearance, or audio. Cross-runtime replay is open, C2PA integration is not in scope yet, and the PyPI v0.1.0 README still carries stale private-repo and video-codec wording.
WHAT THE MODEL SAW, on the record.
The ambition is not to compete with video codecs. It is to make the question "what was this AI actually shown?" answerable by anyone who can run Python. When that record exists at intake, AI video systems stop being black boxes that decide on inputs no one else can inspect.