TOON: Token-Efficient API Output for LLM Workflows

Mon, 15 Jun 2026 00:00:00 +0000

When a person reads an API response, formatting is free. When an LLM agent reads one, every character is metered. The response lands in a context window, the context window is billed by the token, and the format you chose decides how many tokens the same data costs.

JSON spends a lot of tokens on that job. A list of one hundred records repeats every key one hundred times, wraps every string in quotes, and spends tokens on braces and brackets that a model does not need to understand a table of data.

Llm on Restish

TOON: Token-Efficient API Output for LLM Workflows