Design rationale

Tetration combines HDF5-shaped persistence (many large arrays in one durable file) with Zarr-shaped chunking (regular grid, per-chunk compression, parallel I/O) — in a single mmap-friendly .tet file, not a directory of shard blobs.

Design goals

Goal	How `.tet` achieves it
Single-file portability	One file to copy, archive, or attach to object storage
Partial I/O by default	Chunk index → mmap only intersecting payloads
Query-friendly metadata	JSON/TOML control plane; footer carries axis names and attrs
Read-many / write-once	Immutable sealed files; unlimited concurrent readers

When to use `.tet`

Good fits:

Large multi-dimensional arrays that benefit from chunk-level access
Workflows that need a single portable artifact (not a Zarr directory tree)
JSON/TOML queries for aggregates, transforms, and QC counts without loading full tensors

Less ideal:

Tabular analytics with heavy columnar filtering → consider Parquet
Cloud-native sharded writes with many concurrent writers → consider Zarr
Simple single-array interchange → NPY may suffice
SQL-on-files or ad-hoc relational queries → not a v1 goal

See the full Format comparison guide.

Control plane vs storage

JSON/TOML is the control plane, not the storage encoding. Hosts validate input, cap size, and enforce spill path policy. The on-disk format is a fixed binary layout optimized for mmap.

Query documents are flat — e.g. "mean": [] / mean = []. Nested "operation" objects are rejected.

Non-goals (v1)

SQL-on-files
Arbitrary codec plugins in the file format
GPU codecs in the on-disk layout (optional GPU is a query-time execution concern)
Multiple concurrent writers on one file

Design rationale ​

Design goals ​

When to use .tet ​

Control plane vs storage ​

Non-goals (v1) ​

Further reading ​

Design rationale

Design goals

When to use `.tet`

Control plane vs storage

Non-goals (v1)

Further reading