Analytics Guide

Tumult embeds DuckDB and Apache Arrow for SQL analytics over experiment journals.

Architecture

TOON Journal (.toon) → Arrow RecordBatch (columnar) → DuckDB (SQL) → Parquet (export)

Tables

experiments

Column	Type	Description
experiment_id	VARCHAR	UUID per run
title	VARCHAR	Experiment title
status	VARCHAR	Completed, Deviated, Aborted, Failed
started_at_ns	BIGINT	Start time (epoch nanoseconds)
ended_at_ns	BIGINT	End time
duration_ms	UBIGINT	Total duration
method_step_count	BIGINT	Method steps executed
rollback_count	BIGINT	Rollback steps executed
hypothesis_before_met	BOOLEAN	Steady state before?
hypothesis_after_met	BOOLEAN	Steady state after?
estimate_accuracy	DOUBLE	Estimate vs actual (0-1)
resilience_score	DOUBLE	Overall resilience (0-1)

activity_results

Column	Type	Description
experiment_id	VARCHAR	Links to experiments
name	VARCHAR	Activity name
activity_type	VARCHAR	Action or Probe
status	VARCHAR	Succeeded, Failed, Timeout
started_at_ns	BIGINT	Start time
duration_ms	UBIGINT	Execution duration
output	VARCHAR	Activity output
error	VARCHAR	Error message
phase	VARCHAR	hypothesis_before, method, hypothesis_after, rollback

Example Queries

-- Summary by status
SELECT status, count(*) as runs, avg(duration_ms) as avg_ms
FROM experiments GROUP BY status;

-- Slowest activities
SELECT name, phase, avg(duration_ms) as avg_ms
FROM activity_results GROUP BY name, phase ORDER BY avg_ms DESC LIMIT 10;

-- Failure rate
SELECT count(*) FILTER (WHERE status != 'Completed')::float / count(*) * 100 as failure_pct
FROM experiments;

Persistent Store

Every tumult run automatically ingests the journal into a persistent DuckDB store at ~/.tumult/analytics.duckdb. This enables cross-run analytics without manually specifying journal paths.

# Query the persistent store (no path needed)
tumult analyze --query "SELECT status, count(*) FROM experiments GROUP BY status"

# View store statistics
tumult store stats

# Show store path
tumult store path

Backup and Restore

# Export the entire store to Parquet files
tumult store backup --output my-backup/

# Restore from backup into the persistent store
tumult import my-backup/

Retention

# Purge experiments older than 90 days
tumult store purge --older-than-days 90

Disabling Auto-Ingest

# Run without ingesting into persistent store
tumult run experiment.toon --no-ingest

Security

DuckDB does not encrypt data at rest by default. The persistent store at ~/.tumult/analytics.duckdb is written as plaintext on disk. For environments where experiment data is sensitive, take the following steps:

Filesystem-level encryption

Place the store on an encrypted volume appropriate for your OS:

OS	Recommended approach
Linux	LUKS full-disk encryption, or directory-level encryption with `fscrypt` / `ecryptfs`
macOS	FileVault 2 (whole-disk), or an encrypted APFS volume
Windows	BitLocker, or an encrypted home directory

Redirect the store path

Use the TUMULT_STORE_PATH environment variable to point the persistent store at a location on an encrypted volume:

export TUMULT_STORE_PATH=/mnt/encrypted/tumult/analytics.duckdb
tumult run experiment.toon

Directory permissions

AnalyticsStore::open automatically creates the store directory with mode 0700 (owner read/write/execute only). This limits file-system access to the process owner, but is not a substitute for encryption — a privileged user or physical attacker with direct disk access can read the file without filesystem permission checks.

Journal Format for LLM Consumption

Tumult journals use the TOON format, which is concise, structured, and uses deterministic key ordering via IndexMap. This makes journals predictable and easy for LLMs to parse.

Key properties of the journal format:

Compact: TOON uses 40-50% fewer tokens than equivalent JSON
Deterministic key order: Keys always appear in insertion order (IndexMap), so LLMs see a consistent structure across runs
Structured phases: Each journal section (hypothesis_before, method, rollbacks, hypothesis_after) is clearly delimited

Best practices for LLM consumption:

Use tumult export --format json to convert journals to JSON for LLM input when broader tool compatibility is needed
Query journals directly via the MCP tool tumult_read_journal for interactive LLM workflows
For batch analysis, load journals into DuckDB and export query results as CSV

CLI Usage

# Default summary (from persistent store)
tumult analyze

# Load from journal files
tumult analyze journals/

# Custom SQL
tumult analyze journals/ --query "SELECT * FROM experiments WHERE status = 'Deviated'"

# Single journal
tumult analyze journal.toon

# Export to Parquet
tumult export journal.toon --format parquet

# Export to CSV
tumult export journal.toon --format csv

# Trend analysis
tumult trend journals/ --metric resilience_score --last 30d