Measured product lead custody receipt.
Encoding Speech's Shape And Feeling
Prosodic feature-store codec — pitch, energy, duration, voiced mask · ZPE-Prosody · PyPI zpe-prosody v0.1.1 · github.com/Zer0pa/ZPE-Prosody
A voice carries more than words. Pitch rises, stress lands, and rhythm marks how speech moves through time.
ZPE-Prosody captures that shape as a deterministic ZPRS/v1 stream — F0, energy, duration, and the voiced/unvoiced mask — at 13.0× mean compression and 0.64% voiced-F0 RMSE on 100 LibriSpeech test-clean utterances. It stores acoustic prosody cues, not emotion, intent, semantic meaning, or a speaker-state diagnosis. Encoder only: retrieval misses target; transfer is paused.

Speech systems compute prosody again and again. the shape of the voice is rarely kept.
Speech carries feeling. its shape can now be held.
Mainstream TTS and voice-analytics stacks compute pitch, energy and timing every time they need them, then throw the contours away or stash them as undocumented bytes. No published fidelity figure, no public limit, no shared archive format.
ZPE-Prosody encodes the four prosodic primitives — F0, energy, duration, voiced mask — as a deterministic ZPRS/v1 stream at 13.0× mean compression and 0.64% voiced-F0 RMSE on real LibriSpeech utterances, with mean encode latency of 2.67 ms. Four primitive checks pass. Retrieval and transfer are excluded from the product on purpose, with the numbers.
The encoder passes four checks. retrieval and transfer do not.
The encoder holds speech's shape. retrieval does not yet follow.
On 100 LibriSpeech test-clean utterances the encoder records 13.0× mean compression at 0.64% voiced-F0 RMSE with duration RMSE of 0.000 ms, across 5/5 hash-identical encoder runs. The same input bytes produce the same ZPRS/v1 stream every time, on every host. PRO-C001..C004 PASS on primitive encoder checks; they do not override the retrieval and transfer gates. Retrieval (PRO-C006) misses target at p@5 0.31 vs 0.80; OOD p@5 0.1707. Transfer (PRO-C005) is PAUSED_EXTERNAL. The page reports both, not one.
MISS on PRO-C006 retrieval, p@5 0.31 vs 0.80; OOD p@5 0.1707. PRO-C005 transfer PAUSED_EXTERNAL; no commercial-safe substitute proven in-lane. Status packet on PR #50 branch-public; PyPI stale at v0.1.1. No transfer learning, retrieval product, or TTS-ready system is claimed.
A voice carries a fidelity receipt.
The product is a bounded ZPRS/v1 feature store for the shape of speech — F0, energy, duration, voiced mask — that a TTS team, a call-centre analytics owner or a linguistics lab can store, ship and re-read with a stated fidelity per recording. Retrieval and transfer arrive later, on their own terms.