Token Efficiency

Tumult uses TOON as its primary serialization format. One key advantage over JSON is significantly lower token consumption when journals are fed to LLMs.

TOON vs JSON Token Comparison

TOON achieves 40-50% fewer tokens than equivalent JSON for typical experiment journals. The savings come from:

No curly braces or square brackets for structure
No quoted keys
Indentation-based nesting instead of delimiters
No commas between values

Example

The same experiment steady-state hypothesis in both formats:

JSON (approx. 85 tokens)

{
  "steady-state-hypothesis": {
    "title": "API responds within SLA",
    "probes": [
      {
        "type": "probe",
        "name": "api-latency-check",
        "provider": {
          "type": "http",
          "url": "https://api.example.com/health",
          "timeout": 5
        },
        "tolerance": {
          "type": "range",
          "range": [0, 300]
        }
      }
    ]
  }
}

TOON (approx. 45 tokens)

steady-state-hypothesis
  title = "API responds within SLA"
  probes
    - type = "probe"
      name = "api-latency-check"
      provider
        type = "http"
        url = "https://api.example.com/health"
        timeout = 5
      tolerance
        type = "range"
        range = [0, 300]

Rough Numbers

Format	Avg Tokens (typical journal)	Reduction
JSON	~1,200	baseline
TOON	~650	~46%

These numbers vary by journal complexity. Larger experiments with many activities see greater savings because TOON eliminates more structural overhead per entry.

Recommendations

Store and transmit journals in TOON format to minimize token usage
Use tumult export --format json only when downstream tools require JSON
When feeding journals to an LLM, prefer TOON or use tumult_read_journal via MCP

Sparse Journal Serialization

Tumult journals omit optional fields whose value is None (or equivalent zero-value) from serialized output. This is enforced via #[serde(skip_serializing_if = "Option::is_none")] on all optional fields in the journal types:

Type	Optional fields omitted when absent
`Journal`	`baseline_result`, `during_result`, `post_result`, `analysis`
`ActivityResult`	`output`, `error`, `trace_id`, `span_id`
`BaselineResult`	`anomaly_reason`
`DuringResult`	`recovery_at`
`PostResult`	`mttr_s`
`ProbeDuring`	`output`, `error`, `trace_id`, `span_id`
`AnalysisResult`	`accuracy`, `note`

Effect on token counts

A minimal successful experiment journal that completes all phases without deviations omits all error and anomaly fields. A partial run (e.g. baseline-only mode) omits the during_result and post_result sections entirely, reducing output by roughly 30% compared to a journal that serialized every None as null.

Recommendations

Do not add null fields back for completeness — absent fields are semantically identical to null
Downstream code should use Option unwrapping with a default (e.g. result.mttr_s.unwrap_or(0.0))
When writing custom analytics queries against exported JSON, treat missing keys as null