PDF

Extract text, metadata, and page content from PDF files. Accepts file paths, URLs, and base64-encoded data.

Requires the pdf-parse peer dependency.

Quick Start

import { Agent, openai, PdfToolkit } from "@radaros/core";

const pdf = new PdfToolkit();

const agent = new Agent({
  name: "document-analyst",
  model: openai("gpt-4o"),
  instructions: "Extract and summarize content from PDF documents.",
  tools: [...pdf.getTools()],
});

const result = await agent.run("Extract the text from ./report.pdf and summarize the key findings.");

Config

maxLength

number

default:"50000"

Maximum characters to return per extraction. Longer text is truncated.

Tools

Tool	Description
`pdf_extract_text`	Extract all text from a PDF. Accepts a file path, URL, or base64 data.
`pdf_get_metadata`	Get PDF metadata — title, author, page count, creation date.
`pdf_extract_pages`	Extract text from specific pages (1-indexed).

Peer Dependency

npm install pdf-parse

Input Sources

The source parameter accepts three formats:

File path: /path/to/document.pdf
URL: https://example.com/report.pdf
Base64: Raw base64-encoded PDF data

// From file
await agent.run("Extract text from /tmp/invoice.pdf");

// From URL
await agent.run("Extract text from https://example.com/whitepaper.pdf");

SQL Database Git

Getting Started

Agents

Memory

Skills

Handoff

Cost Tracking

Semantic Cache

Eval Framework

Compliance & Audit

Culture System

Webhooks

Capacity Planning

Observability

Voice Agents

Browser Agents

Models

Teams

Workflows

Storage

Knowledge & RAG

Toolkits

MCP (Model Context Protocol)

A2A (Agent-to-Agent)

Edge & IoT

Transport

Queue

Scheduling

Advanced Features

PDF

PDF

Quick Start

Config

Tools

Peer Dependency

Input Sources

Getting Started

Agents

Memory

Skills

Handoff

Cost Tracking

Semantic Cache

Eval Framework

Compliance & Audit

Culture System

Webhooks

Capacity Planning

Observability

Voice Agents

Browser Agents

Models

Teams

Workflows

Storage

Knowledge & RAG

Toolkits

MCP (Model Context Protocol)

A2A (Agent-to-Agent)

Edge & IoT

Transport

Queue

Scheduling

Advanced Features

​PDF

​Quick Start

​Config

​Tools

​Peer Dependency

​Input Sources

PDF

Quick Start

Config

Tools

Peer Dependency

Input Sources