AI Agents

How to Instrument Your AI Agent in Minutes Using TraceAI

Q: What is traceAI in simple terms?

traceAI is an open source AI tracing and observability framework built on OpenTelemetry. It provides drop-in instrumentation for LLMs, agents, and AI frameworks so you can capture traces, spans, tokens, prompts, completions, and tool calls from your AI workflows with minimal code changes.

Q: How is traceAI different from traditional APM or logging tools for AI?

Traditional APM tools are optimized for HTTP services and databases, not LLM calls, agents, and retrieval chains. traceAI applies GenAI-specific semantic conventions on top of OpenTelemetry, mapping prompts, completions, tools, RAG steps, and agent actions into structured spans so you can actually debug and optimize AI behavior, not just infrastructure latency.

Q: Which AI frameworks and LLM providers does traceAI support today?

traceAI ships with 20+ integrations across Python and TypeScript, including OpenAI, Anthropic, AWS Bedrock, Google Vertex AI, Google Generative AI, Mistral, Groq, LangChain, LlamaIndex, CrewAI, AutoGen, SmolAgents, DSPy, Guardrails, Haystack, Vercel AI SDK, Mastra, MCP, and more. You can see the full compatibility matrix in the README and add your own integrations via OpenTelemetry.

Q: How does traceAI use OpenTelemetry for AI observability?

traceAI is OpenTelemetry-native. It plugs into your existing OTel setup by registering a tracer provider and exporting spans through standard OTLP exporters (HTTP/gRPC). All LLM, agent, and tool spans follow OpenTelemetry’s GenAI semantic conventions, so you can send the data to any OTel-compatible backend like Jaeger, Datadog, Grafana, or the Future AGI platform.

Last Updated

Nov 30, 2025

Sahil N

Time to read

1 min read

Explore Future AGI

Introduction

Everyday, developers stare at a web of prompts, retrievers, and evaluators trying to guess why an agent failed. Pipelines call half a dozen models, tools trigger other tools, yet no one can see what actually happens within each step from input to chain to output

Developers still think the “fix” is better prompts or new frameworks.
LangChain today, CrewAI tomorrow, or adding new agents, but the core problem isn’t in orchestration, it’s in observability. You can’t improve what you can’t observe.

Understanding an agent means tracing every step of its reasoning: how it interpreted a user query, what context it fetched, which tools it called, how long it spent thinking, and what it cost. Which is exactly what FutureAGI’s TraceAI makes visible.

Instead of adding another abstraction layer, it exposes what your stack is already doing, every LLM call, every retriever hit, every tool invocation as standardized, OpenTelemetry-compatible spans.

With TraceAI, debugging stops being a post-mortem. You see your agent’s reasoning trail unfold in real time complete with context, cost, and sequence so you can actually fix what matters, not just guess.

Why Instrumentation Matters

AI agentic systems aren’t static. They evolve, adapt, and depend on multiple moving parts: models, retrievers, prompts, memory storage, orchestration, and external tools.

Without tracing:

You can’t debug failures - Why did the agent pick the wrong answer?
You can’t measure latency or token usage across steps.
You can’t visualize the workflow across multiple frameworks (LangChain, CrewAI, DSPy, etc.)

Traditional logs tell you what happened.
Tracing tells you how it happened.

TraceAI standardizes this process so that every prompt, embedding, retrieval, and response is automatically captured, versioned, and observable.

What Is TraceAI?

TraceAI is an open-source (OSS) package that enables standardized tracing of AI applications and frameworks.

It extends OpenTelemetry, bridging the gap between infrastructure-level traces (API calls, latency, errors) and AI-specific signals like:

Prompts and responses
Model name and parameters
Token usage and latency
Tool invocations
Guardrail or evaluator outcomes

TraceAI works out-of-the-box with multiple orchestration frameworks like Langchain, CrewAI, OpenAI et and can export to any OpenTelemetry-compatible backend or directly into Future AGI’s observability platform.

Features at a Glance

✅ Standardized Tracing: Maps AI workflows to consistent trace attributes and spans.
✅ Framework-Agnostic: Works with OpenAI, LangChain, Anthropic, CrewAI, DSPy, and many others.
✅ Extensible Plugins: Easily add instrumentation for unsupported frameworks.
✅ Future AGI Native Support: Optimized for full-stack observability with trace-to-eval linkage.

Supported Frameworks

Framework	Package	Description
OpenAI	traceAI-openai	Traces OpenAI completions, chat, image, and audio APIs
LangChain	traceAI-langchain	Instruments chains, tools, retrievers
CrewAI	traceAI-crewai	Observes agentic multi-actor pipelines
DSPy	traceAI-dspy	Traces declarative program steps
Anthropic, Mistral, VertexAI, Groq, Haystack, Bedrock, etc.	—	Full cross-vendor support

(All available on PyPI under traceAI-* packages.)

Quickstart: Your First Trace

Step 1 - Install the Package

pip install traceAI-openai

Step 2 - Set Environment Variables

import os

os.environ["FI_API_KEY"] = "<YOUR_FI_API_KEY>"
os.environ["FI_SECRET_KEY"] = "<YOUR_FI_SECRET_KEY>"
os.environ["OPENAI_API_KEY"] = "<YOUR_OPENAI_API_KEY>"

Step 3 - Register the Tracer Provider

This connects your app to Future AGI’s observability pipeline (or any OTEL backend):

from fi_instrumentation import register
from fi_instrumentation.fi_types import ProjectType

trace_provider = register(
    project_type=ProjectType.OBSERVE,
    project_name="openai_app"
)

Step 4 - Instrument Your Framework

from traceai_openai import OpenAIInstrumentor

OpenAIInstrumentor().instrument(tracer_provider=trace_provider)

Now, every OpenAI call automatically emits spans with attributes like model name, latency, prompt length, and cost.

Step 5 - Interact with your framework

import openai

openai.api_key = os.environ["OPENAI_API_KEY"]

response = openai.ChatCompletion.create(
    model="gpt-4o",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Tell me a quick AI joke."}
    ]
)

print(response.choices[0].message['content'].strip())

You’ve just created your first observable AI.
Each run will appear as a trace with spans representing prompt input, LLM execution, and response generation.

TraceAI observability dashboard showing AI agent trace spans for SQL query generation with execution times and workflow steps

Image 1: TraceAI Trace Visualization Dashboard

Manual Instrumentation with TraceAI Helpers

While framework-level auto-instrumentation works for most cases, sometimes you want fine-grained control over what gets traced.

That’s where traceAI Helpers come in. These are lightweight decorators and context managers that let you manually trace functions, chains, and tools.

7.1 Setup

pip install fi-instrumentation-otel

from fi_instrumentation import register, FITracer
from fi_instrumentation.fi_types import ProjectType

trace_provider = register(
    project_type=ProjectType.EXPERIMENT,
    project_name="FUTURE_AGI",
    project_version_name="openai-exp",
)

tracer = FITracer(trace_provider.get_tracer(__name__))

7.2 Function Decoration

@tracer.chain
def my_func(input: str) -> str:
    return "output"

✅ Captures input/output automatically
✅ Auto-assigns span status
✅ Ideal for wrapping full functions or chain steps

7.3 Code Block Tracing

from opentelemetry.trace.status import Status, StatusCode

with tracer.start_as_current_span(
    "my-span-name",
    fi_span_kind="chain",
) as span:
    span.set_input("input")
    span.set_output("output")
    span.set_status(Status(StatusCode.OK))

✅ Perfect for tracing specific blocks
✅ Lets you set attributes manually (e.g., tool parameters, results)

7.4 Span Kinds

TraceAI defines semantic span kinds for different components:

Span Kind	Use Case
CHAIN	General logic or function
LLM	Model calls
TOOL	Tool usage
RETRIEVER	Document retrieval
EMBEDDING	Embedding generation
AGENT	Agent invocation
RERANKER	Context reranking
GUARDRAIL	Compliance checks
EVALUATOR	Eval span

These kinds make your traces semantically rich and visually distinct in dashboards.

7.5 Example: Agent Span

@tracer.agent
def run_agent(input: str) -> str:
    return "processed output"

run_agent("input data")

7.6 Example: Tool Span

@tracer.tool(
    name="text-cleaner",
    description="Cleans raw text for downstream tasks",
    parameters={"input": "dirty text"},
)
def clean_text(input: str) -> str:
    return input.strip()

clean_text("  hello world  ")

Every decorator automatically generates spans that include:

Input and output data
Tool metadata
Execution status

All visible in your Future AGI trace view.

TraceAI LLM tracing interface showing SQL query traces with latency metrics, input/output data, and execution timeline graphs

Image 2: Future AGI LLM Tracing Dashboard

Visualizing Traces

Connect TraceAI to Future AGI’s own Observe to visualize spans as nested timelines.

Each node represents a span:

LLM call → model name, latency, tokens
Tool execution → input/output details
Agent span → full decision context
This makes debugging, optimization, and auditability effortless.

TraceAI charts showing LLM system metrics: latency, token usage, traffic spans, and cost tracking over 30-day timeline period

Image 3: TraceAI System Metrics Dashboard

Best Practices

✅ Use decorators for simplicity; context managers for precision.
✅ Name spans meaningfully (fi_span_kind="agent", name="retriever-step").
✅ Combine with Future AGI Evaluate to correlate trace performance with eval metrics.
✅ Use environment-based project versions to separate dev/staging/prod.
✅ Add TraceAI early as retro-instrumentation is harder later.

Troubleshooting

Missing spans? Verify your trace_provider registration.
OTEL backend not visible? Ensure your collector endpoint is running.
Sensitive data exposure? Use redaction utilities before setting inputs/outputs.

Contribution

TraceAI is open-source and community-driven.

To contribute:

git clone https://github.com/future-agi/traceAI
cd traceAI
git checkout -b feature/<your-feature>

git clone https://github.com/future-agi/traceAI
cd traceAI
git checkout -b feature/<your-feature>

Then submit a PR!
Join the Future AGI community on LinkedIn, Twitter, or Reddit.

Observability is no longer a hurdle.

The best AI systems don’t just generate answers, they explain how they got there.
TraceAI makes that explanation visible, measurable, and improvable.

Ready to see your agents come alive with full trace visibility?
👉 Get started now: github.com/future-agi/traceAI

FAQs

What is traceAI in simple terms?

How is traceAI different from traditional APM or logging tools for AI?

Which AI frameworks and LLM providers does traceAI support today?

How does traceAI use OpenTelemetry for AI observability?

Inference Performance as a Competitive Advantage

Why Your Voice Agent Fails in Production And How to Fix It?

How to Audit Voice AI Agents for Regulatory Compliance Before Going Live

How to Implement Voice AI Observability for Real-Time Production Monitoring

How to Test 10,000 Voice Agent Scenarios in Minutes Without Manual QA

Inference Performance as a Competitive Advantage

Why Your Voice Agent Fails in Production And How to Fix It?

How to Audit Voice AI Agents for Regulatory Compliance Before Going Live

Sahil N

Data Scientist

Sahil Nishad holds a Master’s in Computer Science from BITS Pilani. He has worked on AI-driven exoskeleton control at DRDO and specializes in deep learning, time-series analysis, and AI alignment for safer, more transparent AI systems.

Sahil N

Jan 6, 2026

How to Implement Voice AI Observability for Real-Time Production Monitoring

Implement voice AI observability for real-time production monitoring. Track latency, conversation quality & detect performance drift with Future AGI.

AI Agents

Sahil N

Dec 23, 2025

How to Test 10,000 Voice Agent Scenarios in Minutes Without Manual QA

Automated voice AI testing for Vapi & Retell agents. Run 10,000 test scenarios in minutes with Future AGI. Replace manual QA, catch edge cases, ship faster.

AI Agents

Sahil N

Nov 30, 2025

How to Instrument Your AI Agent in Minutes Using TraceAI

Instrument AI agents in minutes with TraceAI. Open-source observability for LLMs, agent debugging, and workflow tracing using OpenTelemetry standards.

AI Agents

NVJK Kartik

Nov 13, 2025

Future AGI Voice AI Simulation vs Competitors

Compare Future AGI Simulate with Cekura, Hamming, Bluejay & Coval. Get automated voice AI testing, direct audio evaluation & 50+ language support today.

AI Agents

NVJK Kartik

Feb 6, 2026

How to Test 10,000 Voice Agent Scenarios in Minutes Without Manual QA

Automated voice AI testing for Vapi & Retell agents. Future AGI runs 10,000 test scenarios in minutes vs weeks of manual QA. Free trial available.

AI Evaluations

Rishav Hada

Feb 2, 2026

Inference Performance as a Competitive Advantage

Join our webinar on LLM inference optimization with FriendliAI. Learn to reduce GPU costs 90%, boost model serving speed in production AI deployment.

Webinars

Rishav Hada

Jan 19, 2026

Why Your Voice Agent Fails in Production And How to Fix It?

Master voice agent development from prototype to production using synthetic data, simulation, and AI-driven optimization. Build drive-thru agents in 1 hour.

AI Agents

NVJK Kartik

Jan 7, 2026

How to Audit Voice AI Agents for Regulatory Compliance Before Going Live

Audit voice AI agents for compliance before launch. TCPA consent, HIPAA security, PII protection, and automated testing to avoid regulatory fines.

AI Evaluations

AI Regulations

NVJK Kartik

Feb 6, 2026

How to Test 10,000 Voice Agent Scenarios in Minutes Without Manual QA

Automated voice AI testing for Vapi & Retell agents. Future AGI runs 10,000 test scenarios in minutes vs weeks of manual QA. Free trial available.

AI Evaluations

Podcasts

Products

Rishav Hada

Feb 2, 2026

Inference Performance as a Competitive Advantage

Join our webinar on LLM inference optimization with FriendliAI. Learn to reduce GPU costs 90%, boost model serving speed in production AI deployment.

Webinars

Podcasts

Products

Rishav Hada

Jan 19, 2026

Why Your Voice Agent Fails in Production And How to Fix It?

Master voice agent development from prototype to production using synthetic data, simulation, and AI-driven optimization. Build drive-thru agents in 1 hour.

Podcasts

Products

AI Agents

NVJK Kartik

Jan 7, 2026

How to Audit Voice AI Agents for Regulatory Compliance Before Going Live

Audit voice AI agents for compliance before launch. TCPA consent, HIPAA security, PII protection, and automated testing to avoid regulatory fines.

AI Evaluations

AI Regulations

Podcasts

Products

NVJK Kartik

Feb 6, 2026

How to Test 10,000 Voice Agent Scenarios in Minutes Without Manual QA

Automated voice AI testing for Vapi & Retell agents. Future AGI runs 10,000 test scenarios in minutes vs weeks of manual QA. Free trial available.

AI Evaluations

Rishav Hada

Feb 2, 2026

Inference Performance as a Competitive Advantage

Join our webinar on LLM inference optimization with FriendliAI. Learn to reduce GPU costs 90%, boost model serving speed in production AI deployment.

Webinars

Rishav Hada

Jan 19, 2026

Why Your Voice Agent Fails in Production And How to Fix It?

Master voice agent development from prototype to production using synthetic data, simulation, and AI-driven optimization. Build drive-thru agents in 1 hour.

AI Agents

NVJK Kartik

Jan 7, 2026

How to Audit Voice AI Agents for Regulatory Compliance Before Going Live

Audit voice AI agents for compliance before launch. TCPA consent, HIPAA security, PII protection, and automated testing to avoid regulatory fines.

AI Evaluations

AI Regulations

NVJK Kartik

Feb 6, 2026

How to Test 10,000 Voice Agent Scenarios in Minutes Without Manual QA

Test voice agents on Vapi & Retell at scale. Future AGI runs 10,000 automated voice AI testing scenarios in minutes without manual QA. Start free today.

NVJK Kartik

Feb 6, 2026

How to Test 10,000 Voice Agent Scenarios in Minutes Without Manual QA

Test voice agents on Vapi & Retell at scale. Future AGI runs 10,000 automated voice AI testing scenarios in minutes without manual QA. Start free today.

NVJK Kartik

Feb 6, 2026

How to Test 10,000 Voice Agent Scenarios in Minutes Without Manual QA

Test voice agents on Vapi & Retell at scale. Future AGI runs 10,000 automated voice AI testing scenarios in minutes without manual QA. Start free today.

Rishav Hada

Jan 19, 2026

Why Your Voice Agent Fails in Production And How to Fix It?

Learn to build production-ready voice agents in 5 steps using synthetic data generation, simulation testing, and automated prompt optimization with FutureAGI.

Rishav Hada

Jan 19, 2026

Why Your Voice Agent Fails in Production And How to Fix It?

Learn to build production-ready voice agents in 5 steps using synthetic data generation, simulation testing, and automated prompt optimization with FutureAGI.

Rishav Hada

Jan 19, 2026

Why Your Voice Agent Fails in Production And How to Fix It?

Learn to build production-ready voice agents in 5 steps using synthetic data generation, simulation testing, and automated prompt optimization with FutureAGI.

NVJK Kartik

Jan 7, 2026

How to Audit Voice AI Agents for Regulatory Compliance Before Going Live

Learn how to audit voice AI agents for TCPA, HIPAA compliance. Automated testing, PII protection, and regulatory requirements before going live.

NVJK Kartik

Jan 7, 2026

How to Audit Voice AI Agents for Regulatory Compliance Before Going Live

Learn how to audit voice AI agents for TCPA, HIPAA compliance. Automated testing, PII protection, and regulatory requirements before going live.

NVJK Kartik

Jan 7, 2026

How to Audit Voice AI Agents for Regulatory Compliance Before Going Live

Learn how to audit voice AI agents for TCPA, HIPAA compliance. Automated testing, PII protection, and regulatory requirements before going live.

Sahil N

Jan 6, 2026

How to Implement Voice AI Observability for Real-Time Production Monitoring

Monitor voice AI agents in production with real-time observability. Track latency, conversation quality & performance drift before customers complain.

Sahil N

Jan 6, 2026

How to Implement Voice AI Observability for Real-Time Production Monitoring

Monitor voice AI agents in production with real-time observability. Track latency, conversation quality & performance drift before customers complain.

Sahil N

Jan 6, 2026

How to Implement Voice AI Observability for Real-Time Production Monitoring

Monitor voice AI agents in production with real-time observability. Track latency, conversation quality & performance drift before customers complain.

FutureAGI for Startups: Get 6 months of Pro access free plus $5,000 in credits. Apply Now!