What is path traversal in AI?

Path traversal in AI is a security vulnerability where an LLM app or agent lets user-controlled text choose file paths outside an allowed directory. It can expose prompts, logs, secrets, tenant data, or source code.

How is path traversal different from prompt injection?

Prompt injection tries to change model instructions. Path traversal abuses file path handling so a tool reads or writes outside an allowed root, often after a prompt injection supplies the malicious path.

How do you measure path traversal?

Run FutureAGI's PathTraversalDetector on generated file paths, file tool inputs, archive entries, and retrieval artifact names. Track guardrail block rate, eval-fail-rate-by-route, and traces with suspicious normalized paths.

What Is Path Traversal (AI)? FutureAGI Guide (2026)

What Is Path Traversal (AI)?

Path traversal in AI is a security vulnerability where an LLM app or agent lets user-controlled text select a file path outside an approved directory. It appears in eval pipelines, production traces, and gateway guardrails when agents read files, retrieve documents, upload artifacts, or call filesystem-backed tools with paths such as ../, encoded separators, or symlink targets. FutureAGI uses PathTraversalDetector to flag the risky path and preserve trace evidence before secrets, prompts, tenant files, or source code leak.

Why it matters in production LLM/agent systems

Path traversal converts a natural-language request into unauthorized file access. A support agent with a file-reading tool may receive “summarize ../../billing/keys.env”; a RAG ingestion worker may accept a zip with ../tenant-b/report.pdf; a coding assistant may obey a prompt to inspect /etc/passwd instead of the project directory. The model output can look harmless, but the tool boundary turns it into a filesystem operation.

Ignoring it leads to two failures: secret disclosure and cross-tenant document access. Developers debug confusing “file not found” or “permission denied” errors, then discover successful reads in adjacent directories. SREs see spikes in blocked file tool calls, unusual artifact paths, high retry counts, or traces where a retrieval step references files outside the workspace. Security and compliance teams feel the largest pain because the incident needs path, tenant, source prompt, tool name, and user identity to determine scope.

Agentic pipelines make path traversal harder than classic web handlers. A 2026 workflow may unpack uploads, rewrite paths, summarize file lists, call a code interpreter, and store generated paths in memory. Each step can normalize or obscure the original attack string. The right control watches every file-boundary span, not only the first HTTP request.

How FutureAGI handles path traversal

FutureAGI handles path traversal as a filesystem-boundary eval, not a content-quality score. The anchor is eval:PathTraversalDetector; the PathTraversalDetector security detector is the exact FutureAGI class for CWE-22-style path traversal findings.

A real workflow: a LangChain support agent can read uploaded policy PDFs from a tenant-scoped object store. FutureAGI instrumentation with traceAI-langchain records the user request, retrieval chunk, agent.trajectory.step, tool.name, tool.input.path, tenant id, and route name such as support-file-reader. Before the file tool executes, an Agent Command Center pre-guardrail sends the proposed path to PathTraversalDetector. If the detector flags traversal indicators such as parent-directory segments, encoded separators, absolute paths, or symlink escape attempts, the route blocks the tool call, attaches the evaluator result to the trace, and returns a fallback response that asks for review. A post-guardrail can also inspect returned artifact paths before the answer streams.

FutureAGI’s approach is to keep the detector result, raw path, normalized path, route, source span, and guardrail action in the same trace the engineer debugs. Compared with Semgrep or CodeQL static scans, this catches risky paths generated at runtime by a model, retriever, archive extractor, or agent planner. The next action is concrete: narrow the file tool schema, deny raw paths where handles are safer, quarantine the source document, add the trace to a regression dataset, and fail release checks when PathTraversalDetector findings exceed the approved threshold.

How to measure or detect it

Measure path traversal where text crosses into filesystem access, not only where the user prompt enters the model:

PathTraversalDetector. detects CWE-22 path traversal vulnerabilities in generated paths, file-tool inputs, archive entries, retrieval artifacts, and code-assistant outputs.
Trace fields. inspect tool.input.path, normalized path, workspace root, tenant id, source URL, retrieval document id, prompt version, and agent.trajectory.step.
Dashboard signals. track path-traversal-fail-rate-by-route, guardrail block rate, file-tool deny count, eval-fail-rate-by-cohort, p99 latency after guardrails, and escalation-rate.
User-feedback proxy. watch reports of missing files, summaries from the wrong tenant, unexplained access denial, or agents reading files outside the requested workspace.

from fi.evals import PathTraversalDetector

candidate_path = "../tenant-b/secrets.env"
result = PathTraversalDetector().evaluate(input=candidate_path)
if result:
    print("block", result)

Alert on the canonical path after decoding and symlink resolution. A literal ../ check is only a first filter; the measurable control is whether the final path stays inside the allowed root for the tenant, route, and tool.

Common mistakes

Most path traversal bugs come from treating paths as harmless strings instead of authorization-sensitive tool inputs.

Checking only ../. Encoded separators, absolute paths, UNC paths, symlinks, and archive entries can escape the approved root.
Normalizing after authorization. Authorize the canonical path after resolving symlinks, not the raw string the model proposed.
Letting agents browse arbitrary workspaces. File tools should receive scoped IDs or handles, not open-ended paths from natural language.
Treating read-only as harmless. Reading prompts, logs, embeddings, or tenant files can be enough for data exposure.
Dropping blocked payloads. Without trace evidence, security teams cannot distinguish a harmless typo from repeated probing.