PtcRunner

Build LLM agents that write and execute programs. SubAgents combine the reasoning power of LLMs with the computational precision of a sandboxed interpreter.

Using an MCP client (Claude Desktop, Cursor, Cline, Claude Code)? See mcp_server/ for ptc_runner_mcp, an MCP server that exposes the PTC-Lisp sandbox over stdio JSON-RPC. The README there has ready-to-paste claude_desktop_config.json, cline_mcp_settings.json, and Cursor mcp.json snippets. For the conceptual overview (when to use it, comparison with Python / JS execution servers, security model), see docs/mcp-server.md.

Quick Start

# Runnable doctest — uses a mock LLM so it works without API access.
# In production, swap `mock_llm` for `PtcRunner.LLM.callback("haiku")`.
iex> mock_llm = fn _request ->
...>   {:ok, "(->> (tool/get_orders) (filter #(> % data/threshold)) (reduce +))"}
...> end
iex> {:ok, step} = PtcRunner.SubAgent.run(
...>   "Total value of orders over ${{threshold}}",
...>   tools: %{"get_orders" => fn _ -> [1500.0, 950.0, 50.0] end},
...>   context: %{threshold: 100},
...>   llm: mock_llm,
...>   max_turns: 1
...> )
iex> step.return
2450.0

The llm: option accepts any 1-arity function — for tests, pass an inline lambda like mock_llm above. There is no separate stub/mock helper. See the Testing guide for scripted callbacks and integration patterns.

Try it yourself: The Getting Started guide includes fully runnable examples you can copy-paste.

The SubAgent doesn't answer directly - it writes a program that computes the answer:

(->> (tool/get_orders)
     (filter #(> (:amount %) 100))
     (sum-by :amount))

This is Programmatic Tool Calling: instead of the LLM being the computer, it programs the computer.

Why PtcRunner?

LLMs as programmers, not computers. Most agent frameworks treat LLMs as the runtime. PtcRunner inverts this: LLMs generate programs that execute deterministically in a sandbox. Tool results stay in memory — the LLM explores data through code, exposing only relevant findings. This scales to thousands of items without context limits and eliminates hallucinated counts.

Best suited for: Document analysis (agentic RAG), log analysis, data aggregation, multi-source joins — any task where raw data volume would overwhelm an LLM's context window.

Key Features

Two execution modes: PTC-Lisp for multi-turn agentic workflows with tools, or text mode for direct LLM responses with optional native tool calling
Signatures: Type contracts ({sentiment :string, score :float}) that validate outputs and drive auto-retry on mismatch
Context firewall: _ prefixed fields stay in BEAM memory, hidden from LLM prompts
Transactional memory: def persists data across turns without bloating context
Composable SubAgents: Nest agents as tools with isolated state and turn budgets
Recursive agents (RLM): Agents call themselves via :self tools to subdivide large inputs
Ad-hoc LLM queries: llm-query calls an LLM from within PTC-Lisp with signature-validated responses
Observable: Telemetry spans for every turn, LLM call, and tool call with parent-child correlation. JSONL trace logs with Chrome DevTools flame chart export for debugging multi-agent flows (interactive Livebook)
Context compaction: Pressure-triggered trimming for long-running multi-turn agents — opt in with compaction: true to drop older turns once a turn or token threshold is hit
BEAM-native: Parallel tool calling (pmap/pcalls), process isolation with timeout and heap limits, fault tolerance

Examples

Parallel tool calling - fetch data concurrently:

;; LLM generates this - executes in parallel automatically
(let [[user orders stats] (pcalls #(tool/get_user {:id data/user_id})
                                   #(tool/get_orders {:id data/user_id})
                                   #(tool/get_stats {:id data/user_id}))]
  {:user user :order_count (count orders) :stats stats})

Context firewall - keep large data out of LLM prompts:

# The LLM sees: %{summary: "Found 3 urgent emails"}
# Elixir gets: %{summary: "...", _email_ids: [101, 102, 103]}
signature: "{summary :string, _email_ids [:int]}"

Ad-hoc LLM judgment from code - the LLM writes programs that call other LLMs, with typed responses and parallel execution:

;; LLM generates this - each llm-query runs in parallel via pmap
(pmap (fn [item]
        (tool/llm-query {:prompt "Rate urgency: {{desc}}"
                         :signature "{urgent :bool, reason :string}"
                         :desc (:description item)}))
      data/items)

The agent decides what to ask and how to structure the response — at runtime, from within the generated program. Enable with llm_query: true. See the LLM Agent Livebook for a full example.

Compile SubAgents - LLM writes the orchestration logic once, execute deterministically:

# Orchestrator with SubAgentTools + pure Elixir functions
{:ok, compiled} = SubAgent.compile(orchestrator, llm: my_llm)

# LLM generated: (loop [joke initial, i 1] (if (tool/check ...) (return ...) (recur ...)))

# Execute with zero orchestration cost - only child SubAgents call the LLM
compiled.execute.(%{topic: "cats"}, llm: my_llm)

See the Joke Workflow Livebook for a complete example.

Text Mode

Not every task needs PTC-Lisp. Text mode (output: :text) uses the LLM provider's native tool calling API — ideal for smaller models or straightforward tasks:

# Plain text — no signature, raw string response
{:ok, step} = SubAgent.run(
  "Summarize this article: {{text}}",
  context: %{text: article},
  output: :text,
  llm: my_llm
)
step.return  #=> "The article discusses..."

# Structured JSON — signature validates the response
{:ok, step} = SubAgent.run(
  "Classify the sentiment of: {{text}}",
  context: %{text: "I love this product!"},
  output: :text,
  signature: "() -> {sentiment :string, score :float}",
  llm: my_llm
)
step.return  #=> %{"sentiment" => "positive", "score" => 0.95}

Text mode also supports tools. Define tools as arity-1 functions that receive a map of arguments:

defmodule Calculator do
  @doc "Add two numbers"
  @spec add(%{a: integer(), b: integer()}) :: integer()
  def add(%{"a" => a, "b" => b}), do: a + b

  @doc "Multiply two numbers"
  @spec multiply(%{a: integer(), b: integer()}) :: integer()
  def multiply(%{"a" => a, "b" => b}), do: a * b
end

PtcRunner auto-extracts the @doc and @spec into tool descriptions and JSON Schema for the LLM provider's native tool calling API — just pass bare function references:

{:ok, step} = SubAgent.run(
  "What is (3 + 4) * 5?",
  output: :text,
  signature: "() -> {result :int}",
  tools: %{
    "add" => &Calculator.add/1,
    "multiply" => &Calculator.multiply/1
  },
  llm: my_llm
)
step.return["result"]  #=> 35

For full control (or anonymous functions), pass an explicit signature string instead. See the Text Mode guide for all four variants (plain text, JSON, tool+text, tool+JSON).

PTC-Lisp Transport (`ptc_transport`)

For output: :ptc_lisp agents, ptc_transport controls how the LLM ships its program. :content (default) parses a markdown-fenced PTC-Lisp block from the assistant message — one program, one deterministic orchestration, lower latency and cost in a single LLM turn. :tool_call (opt-in) exposes a single internal ptc_lisp_execute tool to the provider's native tool-calling API; the model can call it zero or more times before returning a final answer directly. App tools stay inside PTC-Lisp in both transports — only ptc_lisp_execute is exposed natively.

Transport	Default?	Use when
`:content`	yes	One PTC-Lisp program is enough. Lowest latency and cost.
`:tool_call`	opt-in	Native tool calling is materially more reliable than fenced-code parsing on your provider/model, or the workload genuinely needs iterative refinement across multiple program executions.

:tool_call turns one program into a ReAct-style loop: that's a tradeoff, not an upgrade. Pay for it deliberately. Models without native tool calling cannot use :tool_call — those runs surface as :llm_error, with no fallback. See the PTC-Lisp Transport guide for the full decision and a runnable walkthrough.

Signatures and JSON Schema

Signatures are compact type contracts that validate SubAgent inputs and outputs:

"(query :string, limit :int) -> {total :float, items [{id :int, name :string}]}"

Under the hood, PtcRunner converts signatures to JSON Schema in two places:

Where	When	Purpose
Tool definitions	Text mode with tools	Tool signatures → JSON Schema parameters sent to the LLM provider's native tool calling API
Structured output	Text mode with complex return type	Return signature → JSON Schema passed to the LLM callback for provider-specific structured output (e.g., OpenAI `response_format`)

In PTC-Lisp mode, signatures stay in their compact form — the LLM sees them in the prompt and PtcRunner validates the result directly. JSON Schema is only generated when interfacing with LLM provider APIs that require it.

Auto-extraction from @spec means you can define tools as regular Elixir functions and skip writing signatures by hand. For full control, pass an explicit signature string:

"search" => {&MyApp.search/2, signature: "(query :string, limit :int) -> [{id :int}]"}

See Signature Syntax for the full type reference.

Installation

def deps do
  [
    {:ptc_runner, "~> 0.10.1"},
    {:req_llm, "~> 1.8"}  # optional — enables built-in LLM adapter
  ]
end

With req_llm installed, create LLM callbacks with zero configuration:

llm = PtcRunner.LLM.callback("openrouter:anthropic/claude-haiku-4.5")

# or with prompt caching
llm = PtcRunner.LLM.callback("bedrock:haiku", cache: true)

PtcRunner.LLM.callback/2 routes by model prefix (openrouter:, bedrock:, anthropic:, ollama:, etc.) and handles structured output, tool calling, and prompt caching. See the LLM Setup guide for all providers, streaming, custom adapters, and framework integration.

Documentation

Guides

Getting Started - Build your first SubAgent
LLM Setup - Providers, streaming, custom adapters, framework integration
Core Concepts - Context, memory, and the firewall convention
PTC-Lisp Transport - ptc_transport: :content (default) vs :tool_call (opt-in)
Text Mode + PTC-Lisp Compute - Combined mode (output: :text, ptc_transport: :tool_call) for chat agents that escalate to deterministic compute
Patterns - Chaining, orchestration, and composition
Testing - Mocking LLMs and integration testing
Troubleshooting - Common issues and solutions
MCP Getting Started - Using ptc_runner_mcp from any MCP client (overview: docs/mcp-server.md)

Reference

Signature Syntax - Input/output type contracts
PTC-Lisp Specification - The language SubAgents write (a Clojure subset: 211 of 534 clojure.core vars, plus clojure.string, clojure.set, and java.lang.Math)
Function Reference - All 272 built-in functions with signatures
Clojure Conformance - Core | String | Set | Math | Java Interop
Benchmark Evaluation - LLM accuracy by model

Interactive

mix ptc.repl - Interactive REPL for testing PTC-Lisp expressions
Playground Livebook - Try PTC-Lisp interactively
LLM Agent Livebook - Build an agent end-to-end
Output Modes in an App Loop - Pick :text plain, :text structured, or :ptc_lisp per user message in a chat-shaped app
Examples - Runnable example applications including Wire Transfer (human-in-the-loop workflow)
Blog - Articles and updates

Trace Viewer

A built-in web UI for browsing execution traces with turn-by-turn drill-down:

mix ptc.viewer --trace-dir traces

See Observability Guide for details.

Low-Level API

For direct program execution without the agentic loop:

{:ok, step} = PtcRunner.Lisp.run(
  "(->> data/items (filter :active) (count))",
  context: %{items: items}
)
step.return  #=> 3

Programs run in isolated BEAM processes with resource limits (1s timeout, 10MB heap).

See PtcRunner.Lisp module docs for options.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 1,555 Commits
.claude		.claude
.github		.github
LICENSES		LICENSES
Plans		Plans
blog		blog
config		config
demo		demo
docs		docs
examples		examples
images		images
lib		lib
livebooks		livebooks
mcp_server		mcp_server
priv		priv
ptc_viewer		ptc_viewer
scripts		scripts
test		test
usage-rules		usage-rules
.credo.exs		.credo.exs
.env.example		.env.example
.formatter.exs		.formatter.exs
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
REUSE.toml		REUSE.toml
cliff.toml		cliff.toml
mix.exs		mix.exs
mix.lock		mix.lock
usage-rules.md		usage-rules.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PtcRunner

Quick Start

Why PtcRunner?

Key Features

Examples

Text Mode

PTC-Lisp Transport (`ptc_transport`)

Signatures and JSON Schema

Installation

Documentation

Guides

Reference

Interactive

Trace Viewer

Low-Level API

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PtcRunner

Quick Start

Why PtcRunner?

Key Features

Examples

Text Mode

PTC-Lisp Transport (ptc_transport)

Signatures and JSON Schema

Installation

Documentation

Guides

Reference

Interactive

Trace Viewer

Low-Level API

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

PTC-Lisp Transport (`ptc_transport`)

Packages