Middleware

Lifecycle hooks that let you intercept and modify every step of the agent loop. Define them inline in config or as TypeScript files.

yaml

# ra.config.yml
middleware:
  beforeModelCall:
    - "(ctx) => { console.log('Calling model...'); }"
  afterToolExecution:
    - "./middleware/log-tools.ts"

Each middleware is an async (ctx) => void function. Every context object has stop() and signal:

ctx.stop()          // halt the agent loop
ctx.signal.aborted  // check if already stopped

Hooks

Hook	When	Context type
`beforeLoopBegin`	Once at loop start	`LoopContext`
`beforeModelCall`	Before each LLM call	`ModelCallContext`
`onStreamChunk`	Per streaming token	`StreamChunkContext`
`afterModelResponse`	After model finishes	`ModelCallContext`
`beforeToolExecution`	Before each tool call	`ToolExecutionContext`
`afterToolExecution`	After each tool returns	`ToolResultContext`
`afterLoopIteration`	After each full iteration	`LoopContext`
`afterLoopComplete`	After the final iteration	`LoopContext`
`onError`	On exceptions	`ErrorContext`

Middleware runs in array order. Multiple hooks of the same type are executed sequentially.

Context types

LoopContext

Available on all hooks via ctx.loop (or directly for loop-level hooks like beforeLoopBegin, afterLoopIteration, afterLoopComplete).

{
  messages: IMessage[]     // full conversation history
  iteration: number        // current loop iteration (0-indexed)
  maxIterations: number
  sessionId: string
  usage: {
    inputTokens: number
    outputTokens: number
  }
  stop(): void
  signal: AbortSignal
}

ModelCallContext

Used by beforeModelCall and afterModelResponse. You can inspect or modify the request before it's sent.

{
  request: {
    model: string
    messages: IMessage[]
    tools?: ITool[]
    thinking?: 'low' | 'medium' | 'high'
  }
  loop: LoopContext
}

StreamChunkContext

Used by onStreamChunk. Fires for every chunk the model streams back.

{
  chunk:
    | { type: 'text'; delta: string }
    | { type: 'thinking'; delta: string }
    | { type: 'tool_call_start'; id: string; name: string }
    | { type: 'tool_call_delta'; id: string; argsDelta: string }
    | { type: 'tool_call_end'; id: string }
    | { type: 'done'; usage?: { inputTokens: number; outputTokens: number } }
  loop: LoopContext
}

ToolExecutionContext

Used by beforeToolExecution. Fires before each tool is invoked.

{
  toolCall: { id: string; name: string; arguments: string }
  loop: LoopContext
}

ToolResultContext

Used by afterToolExecution. Fires after each tool returns.

{
  toolCall: { id: string; name: string; arguments: string }
  result: { toolCallId: string; content: string; isError?: boolean }
  loop: LoopContext
}

ErrorContext

Used by onError. Fires when an exception occurs during the loop.

{
  error: Error
  phase: 'model_call' | 'tool_execution' | 'stream'
  loop: LoopContext
}

File-based middleware

Export a default async function from a .ts or .js file:

// middleware/audit-log.ts
export default async (ctx) => {
  await Bun.file('audit.jsonl').writer().write(JSON.stringify({
    tool: ctx.toolCall.name,
    args: ctx.toolCall.arguments,
    result: ctx.result.content,
    timestamp: Date.now()
  }) + '\n')
}

Reference it by path in config:

yaml

middleware:
  afterToolExecution:
    - "./middleware/audit-log.ts"

Paths can be relative (to project root), absolute, or use ~ for home directory. Both .ts and .js files are supported — TypeScript is transpiled automatically by Bun.

Inline middleware

Inline expressions are TypeScript strings in your config. They're transpiled at load time. Best for simple, single-expression hooks.

yaml

middleware:
  beforeModelCall:
    - "(ctx) => { console.log('Messages:', ctx.request.messages.length); }"
  onStreamChunk:
    - "(ctx) => { process.stdout.write(ctx.chunk.type === 'text' ? ctx.chunk.delta : '') }"
  onError:
    - "(ctx) => { console.error(`[${ctx.phase}]`, ctx.error.message); }"

Stopping the loop

Any middleware can call ctx.stop() to halt the agent loop early:

// middleware/token-budget.ts — stop if we've used too many tokens
export default async (ctx) => {
  if (ctx.loop.usage.inputTokens + ctx.loop.usage.outputTokens > 100_000) {
    ctx.stop()
  }
}

Timeout

All hooks support a configurable timeout via toolTimeout (default: 30 seconds). If a middleware function exceeds the timeout, the loop continues without waiting.

Middleware ​

Hooks ​

Context types ​

LoopContext ​

ModelCallContext ​

StreamChunkContext ​

ToolExecutionContext ​

ToolResultContext ​

ErrorContext ​

File-based middleware ​

Inline middleware ​

Stopping the loop ​

Timeout ​

See also ​

Middleware

Hooks

Context types

LoopContext

ModelCallContext

StreamChunkContext

ToolExecutionContext

ToolResultContext

ErrorContext

File-based middleware

Inline middleware

Stopping the loop

Timeout

See also