Skip to content
@promptfoo

promptfoo

Test your LLM apps
Promptfoo - AI security testing platform with detective red panda logo

Ship agents, not vulnerabilities

Website • Docs • Blog • Discord

GitHub stars npm downloads License: MIT

AI security testing for LLMs, agents, and RAG systems

Trusted by 25% of the Fortune 500 and 300K+ developers


🚀 Quick Start

npx promptfoo@latest init
npx promptfoo@latest eval
npx promptfoo@latest view

Get Started → · Enterprise →


�️ What We Do

Security Testing

  • Red Teaming — Automated vulnerability discovery with 100+ attack plugins
  • Code Scanning — Detect LLM security risks in your IDE and CI/CD

Evaluations


đź”’ Security & Privacy

What we detect:

  • Prompt injections and jailbreaks
  • PII and sensitive data leaks
  • Hallucinations and policy violations
  • Tool misuse and adversarial attacks

Compliance: SOC 2 Type II · ISO 27001 · HIPAA

Data model:

  • Evals — 100% local, API keys never leave your machine
  • Red teaming — Your target runs locally; attack generation via our API or bring your own keys

📦 Projects

Repository Description
promptfoo Test prompts, agents, and RAGs. Red teaming and vulnerability scanning for LLMs.
promptfoo-action GitHub Action for CI/CD security testing
evil-mcp-server Red team testing for Model Context Protocol servers
modelaudit Static scanner that detects malicious code, backdoors, and vulnerabilities in ML model files
promptfoo-python Python wrapper for promptfoo
js-rouge JavaScript ROUGE metrics for summarization evaluation

👥 Community

Connect: Discord · X/Twitter · Bluesky · LinkedIn

Contribute: Contributing Guide · Good First Issues · Report Issues

Learn: LLM Vulnerability Database · Security Research Blog

Popular repositories Loading

  1. promptfoo promptfoo Public

    Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…

    TypeScript 10.8k 961

  2. promptfoo-action promptfoo-action Public

    The GitHub Action for Promptfoo. Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. S…

    TypeScript 43 15

  3. modelaudit modelaudit Public

    Security scanner for AI/ML model files. Detects malicious code, backdoors, and vulnerabilities before deployment

    Python 24 1

  4. evil-mcp-server evil-mcp-server Public

    An evil MCP server used for redteam testing

    TypeScript 24 4

  5. crabcode crabcode Public

    Generic tmux-based workspace manager for multi-repo development. Lightning-fast dev productivity tool.

    Shell 9

  6. mini-foo mini-foo Public

    Mini promptfoo used for interviews

    TypeScript 3 2

Repositories

Showing 10 of 20 repositories
  • promptfoo-action Public

    The GitHub Action for Promptfoo. Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

    promptfoo/promptfoo-action’s past year of commit activity
    TypeScript 43 MIT 15 2 2 Updated Mar 6, 2026
  • promptfoo Public

    Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

    promptfoo/promptfoo’s past year of commit activity
    TypeScript 10,824 MIT 961 88 162 Updated Mar 6, 2026
  • modelaudit Public

    Security scanner for AI/ML model files. Detects malicious code, backdoors, and vulnerabilities before deployment

    promptfoo/modelaudit’s past year of commit activity
    Python 24 MIT 1 1 (1 issue needs help) 8 Updated Mar 6, 2026
  • .github Public
    promptfoo/.github’s past year of commit activity
    2 MIT 0 0 0 Updated Mar 6, 2026
  • example-app Public

    Consolidated example application with options for multiple industries, models, and security levels

    promptfoo/example-app’s past year of commit activity
    TypeScript 1 0 0 8 Updated Mar 5, 2026
  • mcp-agent-provider Public

    A promptfoo custom provider to test MCP servers with our evil mcp server

    promptfoo/mcp-agent-provider’s past year of commit activity
    JavaScript 2 MIT 0 0 0 Updated Mar 5, 2026
  • crabcode Public

    Generic tmux-based workspace manager for multi-repo development. Lightning-fast dev productivity tool.

    promptfoo/crabcode’s past year of commit activity
    Shell 9 0 1 3 Updated Mar 5, 2026
  • evil-mcp-server Public

    An evil MCP server used for redteam testing

    promptfoo/evil-mcp-server’s past year of commit activity
    TypeScript 24 MIT 4 0 0 Updated Mar 4, 2026
  • mini-foo Public

    Mini promptfoo used for interviews

    promptfoo/mini-foo’s past year of commit activity
    TypeScript 3 2 0 0 Updated Mar 2, 2026
  • redscan-lite Public

    Promptfoo Interview Exercise

    promptfoo/redscan-lite’s past year of commit activity
    JavaScript 3 MIT 0 0 0 Updated Mar 2, 2026