ComparisonAgent flows

xFlow vs LangGraph

What LangGraph defines as a workflow runtime and an action (tool) runtime, what its execution and persistence models actually buy you, and where xFlow's data-as-definition model goes structurally beyond it. Includes the optimization and provability frame xFlow unlocks once flows stop being opaque code and start being typed IR.

LangGraph is excellent at what it does — single-process agent orchestration with great tooling around it. This page is not "LangGraph is bad." It's a structural comparison so teams choosing between the two understand which axis they're optimizing for.

Setup

The question this page answers.

LangGraph is the most popular agent-graph framework in TypeScript / Python. It owns the agent-loop slot in the LangChain ecosystem. xFlow is a different shape — graph-as-data, statechart-shaped, multi-substrate, signed. This page strips LangGraph down to its primitives, names where the model bottoms out, and shows what xFlow adds. If LangGraph alone fits your situation, use LangGraph alone. If the gaps below describe your reality, xFlow is justified.

What is LangGraph?

A typed-state graph builder, a Pregel-like superstep runtime, a tool-call action runtime, and a checkpointer interface. Production-grade in its center.

What's outside its model?

Definition distribution, statechart richness, multi-substrate runtime, multi-writer claims, browser placement, signing, optimization, and provability.

When is LangGraph alone enough?

Single-process agent loops, LangChain-native tooling, linear-ish flows, trusted authors, server-only execution.

Baseline

A minimal LangGraph agent loop.

The canonical shape: a state schema with a messages channel, a model node, a ToolNode, a conditional edge that decides 'tools or done?', and a checkpointer. This is the default React Agent pattern.

hello-langgraph.ts

// LangGraph.js — minimal agent loop.
import { StateGraph, START, END, Annotation } from "@langchain/langgraph"
import { ChatOpenAI } from "@langchain/openai"
import { ToolNode } from "@langchain/langgraph/prebuilt"
import { tool } from "@langchain/core/tools"
import { z } from "zod"

const State = Annotation.Root({
  messages: Annotation<BaseMessage[]>({
    reducer: (left, right) => left.concat(right),  // ← channel reducer
    default: () => [],
  }),
})

const search = tool(async ({ q }) => fetchSearch(q), {
  name: "search",
  description: "search the web",
  schema: z.object({ q: z.string() }),
})

const model = new ChatOpenAI({ model: "gpt-4o" }).bindTools([search])

const graph = new StateGraph(State)
  .addNode("agent", async (state) => ({
    messages: [await model.invoke(state.messages)],
  }))
  .addNode("tools", new ToolNode([search]))
  .addEdge(START, "agent")
  .addConditionalEdges("agent", (state) => {
    const last = state.messages[state.messages.length - 1]
    return last.tool_calls?.length ? "tools" : END
  })
  .addEdge("tools", "agent")
  .compile({ checkpointer })   // ← persistence

await graph.invoke({ messages: [...] }, { configurable: { thread_id: "u1" } })

Workflow runtime

Pregel-like supersteps over a typed state.

LangGraph's execution model is BSP — bulk synchronous parallel. Each superstep schedules a wave of nodes whose preconditions are met, runs them in parallel, merges their partial state into channels via reducers, persists a checkpoint, and computes the next wave. recursion_limit caps the supersteps.

execution model

// LangGraph executes in supersteps (Pregel-like BSP model).

  superstep 0      superstep 1       superstep 2
  ┌────────┐       ┌────────┐        ┌────────┐
  │ agent  │  ──→  │ tools  │   ──→  │ agent  │  ──→  END
  └────────┘       └────────┘        └────────┘
                   (parallel
                   if many calls)

// Inside each superstep:
//  1. Schedule nodes whose preconditions are met.
//  2. Execute scheduled nodes IN PARALLEL.
//  3. Merge each node's returned partial state into channels (via reducers).
//  4. Persist the new state via the checkpointer.
//  5. Evaluate edges to schedule the next superstep.

// Configuration knobs:
//   recursion_limit   — max supersteps (default 25)
//   thread_id         — keys persistence + interrupts
//   stream_mode       — values | updates | messages | debug | custom

State + channels

State is a typed object (Annotation in TS, TypedDict in Py). Each field is a channel with a reducer — default is "last value wins"; messages use append.

Nodes

async (state, config) => Partial<State>. Returned partials merge into channels at the end of the superstep.

Edges

addEdge for unconditional, addConditionalEdges for branches. Multiple outgoing edges from one node = parallel fanout.

Streaming

.stream() with stream_mode: values · updates · messages · debug · custom. Token-streaming is supported per node via LangChain runnables.

Subgraphs

A compiled graph can be added as a node in another graph. State is propagated by key intersection or via a wrapper.

Recursion limit

Default cap of 25 supersteps prevents runaway agent loops. Configurable per invocation.

Action runtime

ToolNode + a tool registry, all in-process.

The 'action runtime' in LangGraph is the slot where tools execute. ToolNode is the prebuilt: it reads the last AIMessage from state.messages, runs each tool_call's matching tool, and appends ToolMessages back. Tools are LangChain Tool objects (or zod-defined tool() functions). Everything runs in the same Node.js / Python process as the graph.

ToolNode shape

// The "action runtime" in LangGraph is ToolNode + a tool registry.

const tools = [search, fetchUser, sendEmail]
const toolNode = new ToolNode(tools)

// ToolNode internals (paraphrased):
//   1. Read the last AIMessage from state.messages.
//   2. For each tool_call in that message:
//        a. Look up the tool by name in the registry.
//        b. Run tool.invoke(args, config) — in-process, same Node.js runtime.
//        c. Wrap result/error as a ToolMessage.
//   3. Return { messages: [...toolMessages] } — channel reducer appends.

// Action surface ≡ functions in this Node.js process.
// To run a tool elsewhere (different machine / browser / sandbox / signed)
// you write a custom node that proxies. There is no built-in placement
// or claim model — the graph runtime owns one execution scope.

Where the action runtime stops

ToolNode owns the orchestration of tool calls. It does not own — and LangGraph does not provide — placement (where a tool runs), claim mode (who's authorized to run it), cross-repo resolution (id@version), signing, sandboxing, or remote execution. Any of those needs a custom node that proxies. Every team builds this themselves.

Persistence

The Checkpointer interface — one snapshot per superstep.

A checkpointer persists a serialized state snapshot at each superstep boundary, keyed by thread_id. This is what gives LangGraph its durability story, time-travel debugging, and HITL pause/resume. Storage backends: in-memory, SQLite, Postgres, Redis.

Checkpointer interface

// Persistence is a single interface — the Checkpointer.
//   MemorySaver       — in-process, lost on restart
//   PostgresSaver     — pg-backed
//   SqliteSaver       — sqlite-backed
//   RedisSaver        — community
//   Custom            — implement put / get / list / putWrites

interface Checkpointer {
  put(config, checkpoint, metadata, newVersions): Promise<RunnableConfig>
  get(config): Promise<Checkpoint | undefined>
  list(config, options?): AsyncIterableIterator<CheckpointTuple>
  putWrites(config, writes, taskId, taskPath?): Promise<void>
}

// One checkpoint per superstep boundary.
// One thread_id = one logically continuous run; supports time-travel
// (load any historical checkpoint and branch from it).

What this gives you, what it doesn't

You get: durability across process restarts, ability to load any historical checkpoint, branch from a past state, and resume from a paused step. You don't get: an event log per se (it's snapshot-shaped), signed history, replay onto a different substrate from individual events, or multi-writer participation on the same thread.

Human-in-the-loop

interrupt() + Command for pauses and resumes.

LangGraph's HITL primitive is interrupt() inside a node — it pauses the graph and surfaces the value. The caller resumes with new Command({ resume: ... }) (or update / goto). Static interrupts via interrupt_before / interrupt_after on compile are also supported.

interrupt()

// HITL via the dynamic interrupt() primitive.
import { interrupt, Command } from "@langchain/langgraph"

const graph = new StateGraph(State)
  .addNode("approve", async (state) => {
    const decision = interrupt({                  // ← pauses graph
      ask: "approve charge?",
      amount: state.amount,
    })
    if (decision !== "yes") return { aborted: true }
    return { approved: true }
  })
  // ...

// Resume with Command:
await graph.invoke(new Command({ resume: "yes" }),
  { configurable: { thread_id: "u1" } })

Gap analysis

Where LangGraph's model bottoms out — nine axes.

Each row names what LangGraph does (or doesn't do), the consequence of using only LangGraph, and what xFlow adds. xFlow's additions are designed to coexist with LangGraph — a LangGraph node can wrap an xFlow runtime call, and an xFlow definition can be hand-translated into a LangGraph graph for runtimes that only speak LangGraph.

Axis

Graph-is-code vs graph-is-data

LangGraph: A LangGraph graph is built imperatively in TypeScript / Python. Nodes are arbitrary host-language functions. The graph is materialized at process start and lives in memory. There is no canonical, language-independent serialization.

Consequence: You can't inspect, sign, diff, version, content-address, or distribute a LangGraph graph as data. Sharing a graph means sharing source code. Cross-language consumption is impossible. Static analysis or optimization passes have to be re-implemented in each language SDK.

What xFlow adds

Graph-is-code vs graph-is-data

Flow definitions are canonical JSON (RFC 8785) and SCXML. Each registry entry has a sha256 content hash and an ed25519 signature. The graph can be inspected, diffed, validated, and authorized without executing any of its bodies. Any HTTP/S3 client can read an xFlow registry — the consumer doesn't have to be JavaScript.

Axis

Statechart richness

LangGraph: LangGraph is DAG + cycles + conditional edges. There are no parallel-region states with onDone semantics, no hierarchical state nesting beyond compiled subgraphs, and no history states for resumption. The execution model is a Pregel-like superstep loop over a typed-state object.

Consequence: Patterns that need 'fork these branches and join on all-done' have to be hand-coded with channel reducers and barrier nodes. Hierarchical 'macro-state with sub-flow that knows its parent' patterns are mostly emulated via subgraphs, which lose static analyzability. Resume-where-you-were after a transition needs custom state.

What xFlow adds

Statechart richness

xState as the canonical language. Statecharts give you parallel, hierarchical, and history states; guards and actions; invoked actors; SCXML serialization for W3C interop. The xState semantics are formal and mature — that's why xFlow uses them as the substrate language rather than re-inventing one.

xFlow — parallel review state with onDone

states: {
  review: {
    type: "parallel",
    states: {
      humanApproval: { /* ... */ },
      fraudCheck:    { /* ... */ },
    },
    onDone: "charge",                    // joins on all-done
  },
}

Axis

Action runtime model

LangGraph: Tool/action nodes are functions in the same Node.js process as the graph. ToolNode looks up tools in a registry passed at compile time. There is no built-in concept of where an action runs, who's authorized to run it, or how to resolve actions across repositories.

Consequence: Running a tool on a different machine, in a sandbox, in a browser tab, or remotely-but-signed requires a custom node that proxies. Every team builds this themselves; nothing is portable. Tenant-supplied tools require a separate sandboxing product.

What xFlow adds

Action runtime model

Actions are first-class registry entries (`action:<id>@<version>`). They resolve via content-addressed registry, can declare placement (`server` / `browser` / `gpu`), claim mode (authority / lease / deterministic-election / optimistic-idempotent), retries, idempotency keys, and side-effect kind. The runtime resolves actions at run-time — they don't have to live in the same process as the flow.

xFlow — action with placement and claim

defineAction({
  id: "search",
  version: "1.0.0",
  placement: { kind: "server", capabilities: ["network.public"] },
  claim: { mode: "lease", ttlMs: 30_000 },
  sideEffects: { kind: "pure", idempotencyRequired: false },
  retry: { maxAttempts: 3, backoffMs: 2_000 },
})

Axis

Persistence model

LangGraph: A pluggable Checkpointer interface saves a serialized state snapshot at each superstep boundary. Storage is keyed by thread_id; time-travel works by loading any historical checkpoint. The state is opaque to the persistence layer — it's a serialized blob plus channel versions.

Consequence: There's no event log per se — the persistence is checkpoint-shaped. You can't replay a run from individual events on a new substrate, only resume from a checkpoint. There's no signing of history; if you want a tamper-evident audit log, that's separate plumbing.

What xFlow adds

Persistence model

An xSync actor log per run: each lifecycle transition is a signed (ed25519) event with causal predecessors. Current state is a deterministic reducer over the log. Storage is pluggable down to plain S3WORM or IndexedDB. The log itself is the audit artifact.

Axis

Single executor vs multi-writer

LangGraph: One process owns a thread_id at a time. Concurrent writes to the same thread are serialized by the checkpointer. The execution model assumes a single host driving the graph forward.

Consequence: Browsers, edge workers, and long-running backend jobs can't be peer participants in the same run. Human-approval steps need a separate channel. Multi-region failover means a separate hot-standby story.

What xFlow adds

Single executor vs multi-writer

Multiple peers can be writers on the same run. Claim modes describe who's allowed to advance which step: `authority` (single designated actor), `lease` (TTL-bounded ownership), `deterministic-election` (peers derive the same winner), `optimistic-idempotent` (first valid output wins).

Axis

Browser participation

LangGraph: LangGraph.js technically runs in browsers, but the design center is server-side execution. Node functions run wherever the runtime is hosted; the browser is typically a client of an HTTP API, not a peer.

Consequence: Steps that should run client-side — local file IO, OS keychain access, offline-first capture, human approval in a tab — need a separate orchestration mechanism. There's no single audit trail across server and client work.

What xFlow adds

Browser participation

A flow can have steps with `placement: { kind: 'browser' }` that execute in a tab as typed peers, append events to the same substrate, and join the same run. xSync transports (BroadcastChannel, WebSocket, SSE) make this real today.

Axis

Signing and trust

LangGraph: Graphs are JS code. Tools are JS functions. There's no signing of definitions or events. Trust = 'whoever can deploy the code.'

Consequence: Loading a community-contributed graph or tool means executing untrusted code. Multi-tenant platforms need a sandboxed compute layer. There's no tamper-evident history of what a run actually did.

What xFlow adds

Signing and trust

Flow defs, action defs, and event log entries are ed25519-signed. A trust list scopes which signers can run inside a substrate. Runtime can require signature verification before resolving an `id@version`.

Axis

Deployment model

LangGraph: LangGraph Server (open-source) provides an HTTP API with persistence, queues, and auth. LangGraph Cloud / Platform is the managed offering. LangGraph Studio is the visual debugger. The SDK runs anywhere Node.js / Python runs.

Consequence: To get production-grade durability + queueing + UI, you adopt LangGraph Server (self-host or managed). The runtime is the deployable unit. The graph itself is bundled into that deployment.

What xFlow adds

Deployment model

A flow definition is JSON in a bucket. A Next route, a Fly Docker worker, a Bitlaunch worker, and a browser tab all resolve the same `id@version` and execute the same definition. The deployment unit is the substrate + worker pool, not the graph.

Axis

Optimization and provability

LangGraph: Because nodes are opaque host-language functions, the graph is not amenable to whole-program analysis or transformation. There is no static IR for the flow. You can hand-tune prompts and node order; LangSmith provides traces, but optimization stops at human-in-the-loop tuning.

Consequence: There's no equivalent of DSPy for LangGraph — no compiler that takes a flow + a metric and emits an optimized variant. There's no path to formal verification, ZK-provable execution, or cross-substrate compilation, because the input language is 'arbitrary JavaScript.'

What xFlow adds

Optimization and provability

A flow def is a typed, statechart-shaped IR — guards, actions, and invoked actors are symbolic references, not opaque functions. That makes whole-flow optimization tractable: prompt-rewriting passes (DSPy-style), branch reordering, dead-state elimination, action substitution, parallel-region inference, target-substrate compilation. It also opens the door to tamper-evident or zero-knowledge execution: a flow's structure is provable-as-data, and the action layer is the only place untrusted code runs.

xFlow — flow as IR (illustrative)

// Optimization pass — DSPy-style prompt search bound to a metric.
const optimized = await xflow.optimize(def, {
  metric: "agent.task-success",
  bench:  "datasets/triage/v3",
  passes: ["prompt-search", "branch-reorder", "action-substitution"],
})

// Compilation pass — emit a target-specific executor.
const wasm = await xflow.compile(def, { target: "wasm-component" })
const wdk  = await xflow.compile(def, { target: "wdk-flat" })

// Verification pass — produce a circuit/witness for ZK execution.
const proof = await xflow.prove(def, runId, { backend: "halo2" })

Forward direction

Flows-as-IR — the optimization and provability story.

The deepest structural difference between LangGraph and xFlow isn't 'parallel regions' or 'multi-writer.' It's that xFlow's definition is a typed IR (statechart with symbolic action references), while LangGraph's definition is host-language code with opaque function nodes. That single distinction unlocks an entire research direction LangGraph cannot reach: whole-flow optimization, target-substrate compilation, and verifiable execution.

DSPy of flows

DSPy compiles prompt-and-module programs against a metric and a dataset. xFlow can do the same for entire flows — search over prompts, branch ordering, action choice, guard predicates — because the flow shape is data the optimizer can manipulate.

ASIC metaphor

A flow IR is portable across execution targets. The same definition compiles to a memory-substrate runner, a Postgres-backed worker, a WASM component, a flat-DAG WDK workflow, or a future hardware-accelerated executor — without re-authoring.

Provable computation

Because the flow's structure is signed JSON and the action layer is the only place untrusted code runs, the path to tamper-evident, zero-knowledge, or otherwise verifiable execution is concrete: prove the trace through the IR; sandbox or verify the actions individually.

Why LangGraph cannot reach this

Once a node is "an arbitrary JS / Python function," whole-flow optimization is stuck at human-in-the-loop tuning. There's no IR to manipulate, no static graph to analyze, no symbolic action layer to substitute, no portable shape to compile to a different target. LangSmith adds traces and evals on top, but the graph itself stays opaque. xFlow inverts that: the graph is the artifact, the actions are the only opaque part, and they're addressable, signable, and replaceable by id@version.

xFlow — flow as IR (illustrative)

// xFlow — the same shape, but the graph is data.

defineFlow({
  id: "agent.search-loop",
  version: "1.0.0",
  initial: "agent",
  states: {
    agent: {
      invoke: { src: "action:llm.chat@^2", input: "$.messages" },
      on: {
        TOOL_CALLS: "tools",
        DONE:       "done",
      },
    },
    tools: {
      type: "parallel",                       // ← LangGraph can't express
      states: {                               //   parallel-region semantics
        search: { invoke: { src: "action:search@^1" } },
        fetch:  { invoke: { src: "action:fetch@^1" } },
      },
      onDone: "agent",
    },
    done: { type: "final" },
  },
})

// What you get on top of LangGraph:
//   • action:* refs resolve from a content-addressed registry (id@version)
//   • parallel + history + hierarchical state semantics (xState)
//   • multi-substrate (memory · sqlite · postgres · xSync · S3WORM)
//   • signed event log; multi-writer claims; browser placement
//   • the graph is JSON — analyzable, optimizable, signable, diff-able

Honest accounting

When LangGraph alone is sufficient.

If all five conditions hold, xFlow is overkill. LangGraph + LangGraph Server is a great place to live. The value xFlow adds is real, but it's not free — there's a registry, a substrate model, and a flow-IR mindset to take on. Don't take that on if you don't need it.

Single-process agent loop

Your agent runs in one Node.js or Python process. Tools are local functions. You're happy deploying a LangGraph Server when you need durability.

LangChain-native tooling

Your tools, retrievers, and memory all already live in LangChain abstractions. The integration cost of LangGraph is roughly zero.

Linear-ish agent flows

Your flow is model → tools → model. You don't need parallel regions, hierarchical states, or multi-writer participation — a Pregel DAG with cycles fits.

Trust model = your team

Tools and graphs are written by your team and deployed by your team. You don't need signed definitions or tenant-supplied flows.

Server-only execution

Nothing needs to run in a browser tab as a peer. Human-approval steps go through your existing UI hitting an API.

Why this tranche

When xFlow is justified.

The decision isn't 'LangGraph or xFlow' in the abstract. It's whether the gaps in the previous section describe your situation. For this ecosystem tranche — multi-product family, cross-substrate, optimization-and-provability-curious, agent-integrated — they do.

Multi-product family

xFlow + Switchboard.WTF + xCoder + xAgent.WTF all need to read the same flow definitions. A registry is mandatory.

Cross-substrate flows

The same definition needs to run in a Next route, a Docker worker, a CLI, and a browser tab. LangGraph's single-host execution model can't carry this.

Tenant-supplied flows

Customers upload flow definitions. They must be signed, validated, and policy-checked before running — and the action surface must be sandboxable.

Statechart semantics actually used

Parallel regions with onDone, hierarchical macro-states with sub-flows, history states for resume. If your flows really are flat DAGs forever, this isn't a reason — but the agent loops we care about aren't.

Optimization and provability

You want flows that can be optimized by a compiler (DSPy-style for prompts, branch reordering, action substitution), compiled to alternative targets (WASM components, WDK flat), or executed with verifiable / zero-knowledge guarantees. That requires a typed IR — not arbitrary JS.

Signed audit trail

Compliance or trust requires a tamper-evident log of every transition. xSync's signed event log gives you this for free; LangGraph's checkpointer doesn't.

Recap

One sentence.

LangGraph is a typed-state graph builder, a Pregel-like superstep runtime, and a ToolNode-based action runtime — all centered on a single-process executor with a checkpointer for durability.

xFlow trades that single-host execution model for a graph-is-data model: statechart-shaped flow definitions resolved by id@version from a content-addressed registry, signed actions with placement and claim semantics, multi-substrate runtime, and a typed IR that makes optimization, target compilation, and verifiable execution tractable in a way arbitrary-JS-node graphs cannot reach.

For projects that fit LangGraph's center — single-process agent loops, server-only, LangChain-native — LangGraph is the right call. For projects in the xFlow tranche — cross-substrate, multi-writer, signed-and-portable, optimization-curious — the additions pay for themselves.

xFlow vs open WDK →Registry spec →xState as canonical language →