Measured product lead custody receipt.
Market Data That Keeps Its Shape
A compression codec for delayed-feed market archives — compact, exact, queryable · ZPE-FT · PyPI zpe-ft v0.1.1 · github.com/Zer0pa/ZPE-FT
Market archives store rows well. They rarely retain the price pattern. Finding a six-month-old chart shape today means rebuilding from scratch, not retrieving.
ZPE-FT compresses delayed and public feeds 5.9–10.9× smaller than raw, replays price fields at RMSE = 0.0, and runs OHLCV pattern queries up to 62.9× faster than Parquet+zstd through DuckDB. The public-corpus result is real. The enterprise benchmark still waits on Phase 06 inputs and FT-C004 labels.

A market archive stores what happened. it does not remember what price did.
Encode the pattern. The archive knows its shape.
Delayed market data lives in raw CSV, Parquet+zstd, or vendor stores. Cheap to write, fast to scan. But the file holds bytes, not patterns. Asking what a price did means rebuilding the answer, not retrieving it.
ZPE-FT encodes pattern structure into the archive itself. Price fields replay at RMSE 0.0. OHLCV pattern queries run up to 62.9× faster than Parquet+zstd through DuckDB. SPY 10-year: 5.94× smaller. Binance BTC aggTrades: 10.90×. Kaggle SPY full history: 7.31×. Public, delayed-feed corpora only.
Three public corpora stand. phase 06 still needs its inputs.
Price fields replay exactly; zero error decides.
The 62.9× figure is the p95 query latency win on Yahoo SPY OHLCV versus Parquet+zstd through DuckDB. BTC aggTrades is size-positive at latency parity or slower — no latency win is claimed on tick data. Price-field RMSE = 0.0 holds on reported fields across all three public corpora. Deterministic replay is declared on public inputs with committed benchmark artifacts in the repo. Anyone with the corpora can rerun the numbers and get the same bytes. Phase 06 enterprise inputs remain missing. FT-C004 retrieval truth labels remain unresolved.
Public-corpus benchmarks are not the enterprise benchmark. Phase 06 still needs 33 missing input series and unresolved FT-C004 truth labels. ZPE-FT is not a real-time feed, not a trading system, and makes no lossless volume claim. PyPI ships v0.1.1; v0.1.2 is pending.
FIVE FUTURES FROM ONE delayed-feed archive.
The bet is not to beat the warehouse. The bet is that a market archive can stay compact, exact, and queryable at the same time — and that when delayed-feed history behaves that way, the warehouse stops being the only place a fintech team is allowed to ask questions.