Performance Evaluation

Overview

PerformanceEval measures runtime performance metrics against configurable limits.

Quick Start

import { PerformanceEval } from "@radaros/eval";
import { Agent, openai } from "@radaros/core";

const agent = new Agent({ name: "fast-bot", model: openai("gpt-4o-mini") });

const eval = new PerformanceEval({
  name: "latency-test",
  agent,
  maxDurationMs: 5000,
  maxTimeToFirstTokenMs: 1000,
  maxTokens: 500,
  cases: [
    { name: "greeting", input: "Hello!" },
    { name: "complex", input: "Explain quantum computing" },
  ],
});

const result = await eval.run();

Metrics Tracked

Metric	Config	Description
Duration	`maxDurationMs`	Total wall-clock time
TTFT	`maxTimeToFirstTokenMs`	Time to first token
Tokens	`maxTokens`	Total token consumption
Memory	—	Heap memory delta (always tracked)

Getting Started

Agents

Memory

Skills

Handoff

Cost Tracking

Semantic Cache

Eval Framework

Compliance & Audit

Culture System

Webhooks

Capacity Planning

Observability

Voice Agents

Browser Agents

Models

Teams

Workflows

Storage

Knowledge & RAG

Toolkits

MCP (Model Context Protocol)

A2A (Agent-to-Agent)

Edge & IoT

Transport

Queue

Scheduling

Advanced Features

Performance Evaluation

Overview

Quick Start

Metrics Tracked

Getting Started

Agents

Memory

Skills

Handoff

Cost Tracking

Semantic Cache

Eval Framework

Compliance & Audit

Culture System

Webhooks

Capacity Planning

Observability

Voice Agents

Browser Agents

Models

Teams

Workflows

Storage

Knowledge & RAG

Toolkits

MCP (Model Context Protocol)

A2A (Agent-to-Agent)

Edge & IoT

Transport

Queue

Scheduling

Advanced Features

​Overview

​Quick Start

​Metrics Tracked

Overview

Quick Start

Metrics Tracked