AI Agents

Integrations

Conversational AI Meets Evaluation Power: Introducing the Future AGI MCP Server

Q: What tools can connect to the Future AGI MCP server?

Currently, tools like Cursor, Claude Desktop, and any MCP-compatible client can connect. Simply configure them with the MCP server path and environment variables.

Q: What do I need to run the MCP server?

You’ll need: - A Future AGI account with API and Secret Keys - uvx installed in your system - An MCP Client Application like Claude Desktop or Cursor

Q: What’s coming next to Future AGI MCP?

Future AGI’s MCP capabilities are growing fast. Soon, you’ll be able to manage prompts more efficiently—create, update, and refine them with AI suggestions tailored to your use case. Enhanced dataset operations will make data handling smoother and more flexible. We’re also bringing advanced synthetic data generation powered by knowledge bases, making your AI workflows more intelligent and seamless than ever.

Last Updated

Jun 14, 2025

Rishav Hada

Time to read

9 mins

Conversational AI Meets Evaluation Power: Introducing the Future AGI MCP Server

Explore Future AGI

What is MCP?

Model Context Protocol (MCP) is emerging as the industry standard for LLM interactions with external tools and data sources. Just as LLM APIs converged around the OpenAI specification, we're seeing a similar consolidation around MCP as the unified protocol for LLM applications. You can read more about the MCP here

At Future AGI, we're thrilled about the possibilities that MCP brings to our ecosystem. It opens up powerful new ways for our customers to interact with our platform and create more sophisticated AI applications.

Why Future AGI MCP Matters?

LLMs are powerful—but to truly make them useful in real-world workflows, they need access to tools, data, and evaluations. That’s what the Model Context Protocol (MCP) enables.

With Future AGI’s MCP Server, you can do the following using natural language:

Run automatic evaluations — Evaluate batch and single inputs on various evaluation metrics present in Future AGI both on local points and large datasets
Prototype and Observe your Agents — You can add observability and evaluations while both prototyping and deploying your agents into production using natural language
Manage datasets — Upload, evaluate, download datasets and find insights with natural language
Add Protection Rules— Apply toxicity detection, prompt injection protection, and other guardrails to your applications automatically using chat
Synthetic Data Generation — Generate Synthetic Data by describing about the dataset and objective

Future AGI MCP use cases including evaluations, dataset management, synthetic data, code instrumentation, and natural language observability.

Image 1: Future AGI' MCP Server's features

How to Set Up Future AGI MCP?

Create an account at http://app.futureagi.com/ and obtain your API key and Secret key from the dashboard. These credentials are required for the Future AGI MCP Server to authenticate with our API.

To run the server locally:

git clone <https://github.com/future-agi/futureagi-mcp-server.git>
cd futureagi-mcp-server

# Install dependencies
brew install uv
uv sync

# Set environment variables
export FI_API_KEY="your_api_key"
export FI_SECRET_KEY="your_secret_key"

# Run server
python main.py

Configure your MCP clients (Cursor/Claude Desktop) with

{

 "mcpServers": {

   "FutureAGI-MCP": {

     "command": "uvx",

     "args": [

      "futureagi-mcp-server"

     ],

     "env": {

       "FI_SECRET_KEY": "your_api_key",

       "FI_API_KEY": "your_secret_key",      

     }

   }

 }

}

You can also add the Future AGI docs MCP to your clients by running the below command in your terminal. It will prompt you to choose the mcp clients present on your local system. You can choose all, which will add configuration for all the mcp clients

npx @mintlify/mcp@latest add futureagi

MCP server setup showing Future AGI MCP with tools like evaluate, protect, upload_dataset, and command configurations in Cursor IDE.

Image 2: Future AGI MCP with tools like evaluate, protect, upload_dataset, and command configurations in Cursor IDE.

Exploring the Future AGI Features Using MCP

4.1 Evaluating single and batch inputs

You can provide a prompt like "Evaluate the following inputs for tone, toxicity" along with your input. The AI assistant will automatically fetch all available evaluators from the Future AGI Platform. It will then run evaluation on the data and provides the output in structured format

MCP tool evaluation detecting toxicity in text, showing failure with explanation of divisive ideology suggesting racial superiority.

Image 3: MCP tool evaluation detecting toxicity in text

MCP deterministic evaluation validating image of Asian and Indian men playing badminton with matching description and result.

Image 4: MCP deterministic evaluation validating image of Asian and Indian men

4.2 A Powerful Use Case: Conversing with Your Data Using Cursor and Future AGI

Let’s explore an exciting example of how you can leverage Cursor as a natural interface to interact with your organization's data—powered by Future AGI.

Imagine being able to evaluate your data through a simple conversation, without ever touching a graphical UI. Thanks to our Future AGI MCP server, this is now a reality.

Suppose you ask Cursor:

"Can you find rag_chat.csv, upload it to Future AGI, suggest three evaluations, and add them to the dataset?"

Here's what happens behind the scenes:

File Discovery & Upload: Cursor searches for rag_chat.csv locally and uploads it to the Future AGI platform.
Evaluator Selection: It automatically fetches all available evaluators and selects three appropriate ones.
Evaluation Assignment: These selected evaluations are then attached to the dataset.

It also tries to correct itself based on the error thrown by the tool, as shown below

Cursor interface using MCP to upload rag_chat.csv, select three Future AGI evaluations, and apply them to the dataset.

Image 5: Cursor interface using MCP to upload rag_chat.csv

Future AGI MCP adding three LLM evaluations—context relevance, factual accuracy, and groundedness—to a RAG dataset using natural language.

Image 6: Future AGI MCP adding three LLM evaluations—context relevance, factual accuracy, and groundedness

Insight Delivery: Once evaluations are complete, you can ask Cursor to download the evaluated dataset and present the insights—again, all through natural language.

Evaluation insights from Future AGI MCP showing context relevance, factual accuracy, and groundedness scores for a RAG dataset.

Image 7: Evaluation insights from Future AGI MCP showing context relevance, factual accuracy, and groundedness scores

4.3 Synthetic Data Generation

You can now ask your assistant to generate Synthetic Data for a specific use case. It will decide on the dataset columns and their types and send it to Future AGI. Data generation then starts in the background; wait for some time and ask it to download the data. It will be served to your local folder. Be specific about dataset for best results

Cursor interface showing synthetic dataset generation and insights for e-commerce customer support queries using Future AGI MCP.

Image 8: Cursor interface showing synthetic dataset generation and insights for e-commerce customer support

4.4 Seamlessly Add Code Observability and Prototype Using Natural Language

A simple prompt like:

“Can you search for crew_ai instrumentation in the Future AGI docs and suggest the code changes using the custom trace_provider?”

…is all it takes.

Cursor will take care of the rest—searching the documentation, understanding the relevant instrumentation steps, and generating the necessary code changes. No need to manually comb through pages of docs. Just ask, and it delivers.

Cursor editor displaying Crew AI instrumentation setup using Future AGI’s trace_provider integration with step-by-step code suggestions.

Image 9: Cursor editor displaying Crew AI instrumentation setup using Future AGI’s trace_provider integration

Through natural conversations with tools like Claude, team members can prototype ideas, run evaluations, and analyze performance metrics or edge cases—right from their seats.

For the more technical members of your team, modern AI-powered IDEs like Cursor seamlessly integrate into the workflow, accelerating development, instrumentation, and iteration with minimal friction.

Conclusion

The Future AGI MCP Server isn’t just a tool—it’s a developer-first platform that redefines how you work with LLMs. By bringing together the power of Model Context Protocol, LLM evaluation, and natural language interfaces, it allows any team to build, test, and monitor AI applications with unmatched simplicity and speed.

Whether you're evaluating batch data, generating synthetic samples, or hardening your model against security risks, the MCP Server makes it easier than ever to move from idea to deployment.

Ready to experience the future of conversational AI and evaluation? Start here.

FAQs

What tools can connect to the Future AGI MCP server?

What do I need to run the MCP server?

What’s coming next to Future AGI MCP?

What tools can connect to the Future AGI MCP server?

What do I need to run the MCP server?

What’s coming next to Future AGI MCP?

What tools can connect to the Future AGI MCP server?

What do I need to run the MCP server?

What’s coming next to Future AGI MCP?

What tools can connect to the Future AGI MCP server?

What do I need to run the MCP server?

What’s coming next to Future AGI MCP?

What tools can connect to the Future AGI MCP server?

What do I need to run the MCP server?

What’s coming next to Future AGI MCP?

What tools can connect to the Future AGI MCP server?

What do I need to run the MCP server?

What’s coming next to Future AGI MCP?

What tools can connect to the Future AGI MCP server?

What do I need to run the MCP server?

What’s coming next to Future AGI MCP?

What tools can connect to the Future AGI MCP server?

What do I need to run the MCP server?

What’s coming next to Future AGI MCP?

Prompt Injection in LLMs: Attack Vectors & Insights

Indirect Verbal Prompts: Improve AI Conversations Naturally

API vs MCP: What's the difference?

Future AGI June Roundup

Revolutionizing Document Management: The Impact of Document Summarization Using LLM

Prompt Injection in LLMs: Attack Vectors & Insights

Indirect Verbal Prompts: Improve AI Conversations Naturally

API vs MCP: What's the difference?

Prompt Injection in LLMs: Attack Vectors & Insights

Indirect Verbal Prompts: Improve AI Conversations Naturally

API vs MCP: What's the difference?

Prompt Injection in LLMs: Attack Vectors & Insights

Indirect Verbal Prompts: Improve AI Conversations Naturally

API vs MCP: What's the difference?

Rishav Hada

Senior Applied Scientist

Rishav Hada is an Applied Scientist at Future AGI, specializing in AI evaluation and observability. Previously at Microsoft Research, he built frameworks for generative AI evaluation and multilingual language technologies. His research, funded by Twitter and Meta, has been published in top AI conferences and earned the Best Paper Award at FAccT’24.

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Explore API vs MCP differences: how Model Context Protocol transforms AI integration with two-way context streaming, tool discovery, and reduced boilerplate.

AI Agents

Integrations

Rishav Hada

May 15, 2025

Conversational AI Meets Evaluation Power: Introducing the Future AGI MCP Server

Future AGI’s MCP Server connects with LLM agents like Claude and Cursor to run evaluations, manage data, apply safety checks, and generate synthetic datasets.

AI Agents

Integrations

Rishav Hada

Jul 1, 2025

MarTech 2.0: The GenAI Revolution

Discover GenAI in MarTech 2.0: predictive marketing, data intelligence layers, and secure Generative AI frameworks for scalable, trustworthy marketing tech.

Webinars

AI Agents

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI, learn how attackers exploit LLMs, and discover effective detection and prevention strategies against injection attacks.

AI Evaluations

LLMs

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Discover how indirect verbal prompts in AI prompting enhance empathy, context understanding, and drive creative, human-like interactions across applications.

AI Evaluations

Data Quality

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Explore API vs MCP differences: how Model Context Protocol transforms AI integration with two-way context streaming, tool discovery, and reduced boilerplate.

AI Agents

Integrations

Rishav Hada

Jul 1, 2025

MarTech 2.0: The GenAI Revolution

Discover GenAI in MarTech 2.0: predictive marketing, data intelligence layers, and secure Generative AI frameworks for scalable, trustworthy marketing tech.

Webinars

Podcasts

Products

AI Agents

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI, learn how attackers exploit LLMs, and discover effective detection and prevention strategies against injection attacks.

AI Evaluations

LLMs

Podcasts

Products

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Discover how indirect verbal prompts in AI prompting enhance empathy, context understanding, and drive creative, human-like interactions across applications.

AI Evaluations

Podcasts

Products

Data Quality

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Explore API vs MCP differences: how Model Context Protocol transforms AI integration with two-way context streaming, tool discovery, and reduced boilerplate.

Podcasts

Products

AI Agents

Integrations

Rishav Hada

Jul 1, 2025

MarTech 2.0: The GenAI Revolution

Discover GenAI in MarTech 2.0: predictive marketing, data intelligence layers, and secure Generative AI frameworks for scalable, trustworthy marketing tech.

Webinars

AI Agents

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI, learn how attackers exploit LLMs, and discover effective detection and prevention strategies against injection attacks.

AI Evaluations

LLMs

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Discover how indirect verbal prompts in AI prompting enhance empathy, context understanding, and drive creative, human-like interactions across applications.

AI Evaluations

Data Quality

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Explore API vs MCP differences: how Model Context Protocol transforms AI integration with two-way context streaming, tool discovery, and reduced boilerplate.

AI Agents

Integrations

Rishav Hada

Jul 1, 2025

MarTech 2.0: The GenAI Revolution

Discover GenAI in MarTech 2.0: predictive marketing, data intelligence layers, and secure Generative AI frameworks for scalable, trustworthy marketing tech.

Webinars

Podcasts

Products

AI Agents

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI, learn how attackers exploit LLMs, and discover effective detection and prevention strategies against injection attacks.

AI Evaluations

LLMs

Podcasts

Products

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Discover how indirect verbal prompts in AI prompting enhance empathy, context understanding, and drive creative, human-like interactions across applications.

AI Evaluations

Podcasts

Products

Data Quality

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Explore API vs MCP differences: how Model Context Protocol transforms AI integration with two-way context streaming, tool discovery, and reduced boilerplate.

Podcasts

Products

AI Agents

Integrations

Rishav Hada

Jul 1, 2025

MarTech 2.0: The GenAI Revolution

Discover GenAI in MarTech 2.0: predictive marketing, data intelligence layers, and secure Generative AI frameworks for scalable, trustworthy marketing tech.

Webinars

Podcasts

Products

AI Agents

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI, learn how attackers exploit LLMs, and discover effective detection and prevention strategies against injection attacks.

AI Evaluations

LLMs

Podcasts

Products

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Discover how indirect verbal prompts in AI prompting enhance empathy, context understanding, and drive creative, human-like interactions across applications.

AI Evaluations

Podcasts

Products

Data Quality

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Explore API vs MCP differences: how Model Context Protocol transforms AI integration with two-way context streaming, tool discovery, and reduced boilerplate.

Podcasts

Products

AI Agents

Integrations

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI to see how attackers exploit LLMs and learn proven detection and prevention strategies against injection attacks.

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI to see how attackers exploit LLMs and learn proven detection and prevention strategies against injection attacks.

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI to see how attackers exploit LLMs and learn proven detection and prevention strategies against injection attacks.

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI to see how attackers exploit LLMs and learn proven detection and prevention strategies against injection attacks.

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI to see how attackers exploit LLMs and learn proven detection and prevention strategies against injection attacks.

Sahil N

Jul 1, 2025

Prompt Injection in LLMs: Attack Vectors & Insights

Explore prompt injection examples in AI to see how attackers exploit LLMs and learn proven detection and prevention strategies against injection attacks.

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Learn to apply indirect verbal prompts in AI prompting to boost user experience, contextual understanding, empathy, and creativity in NLP-driven applications.

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Learn to apply indirect verbal prompts in AI prompting to boost user experience, contextual understanding, empathy, and creativity in NLP-driven applications.

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Learn to apply indirect verbal prompts in AI prompting to boost user experience, contextual understanding, empathy, and creativity in NLP-driven applications.

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Learn to apply indirect verbal prompts in AI prompting to boost user experience, contextual understanding, empathy, and creativity in NLP-driven applications.

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Learn to apply indirect verbal prompts in AI prompting to boost user experience, contextual understanding, empathy, and creativity in NLP-driven applications.

NVJK Kartik

Jul 1, 2025

Indirect Verbal Prompts: Improve AI Conversations Naturally

Learn to apply indirect verbal prompts in AI prompting to boost user experience, contextual understanding, empathy, and creativity in NLP-driven applications.

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Discover how API vs MCP compares: Model Context Protocol enables context-aware integration, continuous context streaming, enhanced developer productivity.

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Discover how API vs MCP compares: Model Context Protocol enables context-aware integration, continuous context streaming, enhanced developer productivity.

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Discover how API vs MCP compares: Model Context Protocol enables context-aware integration, continuous context streaming, enhanced developer productivity.

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Discover how API vs MCP compares: Model Context Protocol enables context-aware integration, continuous context streaming, enhanced developer productivity.

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Discover how API vs MCP compares: Model Context Protocol enables context-aware integration, continuous context streaming, enhanced developer productivity.

Sahil N

Jul 1, 2025

API vs MCP: What's the difference?

Discover how API vs MCP compares: Model Context Protocol enables context-aware integration, continuous context streaming, enhanced developer productivity.

Rishav Hada

Jun 30, 2025

Future AGI June Roundup

Future AGI’s June 2025 roundup features Inline Evaluations, Audio QA tools, ADK integrations, MCP insights, and event highlights from SuperAI.

Rishav Hada

Jun 30, 2025

Future AGI June Roundup

Future AGI’s June 2025 roundup features Inline Evaluations, Audio QA tools, ADK integrations, MCP insights, and event highlights from SuperAI.

Rishav Hada

Jun 30, 2025

Future AGI June Roundup

Future AGI’s June 2025 roundup features Inline Evaluations, Audio QA tools, ADK integrations, MCP insights, and event highlights from SuperAI.

Rishav Hada

Jun 30, 2025

Future AGI June Roundup

Future AGI’s June 2025 roundup features Inline Evaluations, Audio QA tools, ADK integrations, MCP insights, and event highlights from SuperAI.

Rishav Hada

Jun 30, 2025

Future AGI June Roundup

Future AGI’s June 2025 roundup features Inline Evaluations, Audio QA tools, ADK integrations, MCP insights, and event highlights from SuperAI.

Rishav Hada

Jun 30, 2025

Future AGI June Roundup

Future AGI’s June 2025 roundup features Inline Evaluations, Audio QA tools, ADK integrations, MCP insights, and event highlights from SuperAI.

FutureAGI for Startups: Get 6 months of Pro access free plus $5,000 in credits. Apply now!

Products

Research

Customers

Company

Resources

Docs

Pricing

Book a Demo

FutureAGI for Startups: Get 6 months of Pro access free plus $5,000 in credits. Apply now!

Conversational AI Meets Evaluation Power: Introducing the Future AGI MCP Server