What makes diffray different from other AI code review tools?

diffray uses multi-agent intelligence instead of single-model AI. Multiple specialized agents work together - Security Agent, Performance Agent, Architecture Agent, and Consistency Agent - each expert in their domain. This coordinated approach reduces false positives by 87% and catches 3x more real bugs compared to traditional single-agent tools like GitHub Copilot or CodeRabbit.

How does multi-agent AI code review work?

Multi-agent AI code review deploys specialized agents that work in parallel, each focused on a specific domain: security vulnerabilities, performance bottlenecks, architectural patterns, and code consistency. Unlike single-model approaches that suffer from context dilution, each agent maintains deep expertise in its area. Research shows this approach improves bug detection by 3x while reducing noise.

Is diffray free for open source projects?

Yes, diffray is completely free forever for open source projects. We support the open source community with full access to our multi-agent code review platform, including all specialized agents, unlimited reviews, and priority support.

What programming languages does diffray support?

diffray supports all major programming languages including TypeScript, JavaScript, Python, Go, Rust, Java, C#, Ruby, PHP, and more. The multi-agent system is language-agnostic and adapts its analysis to language-specific patterns and best practices.

How does diffray integrate with GitHub?

diffray integrates seamlessly with GitHub through a GitHub App. Once installed, it automatically reviews every pull request, posting actionable comments directly on the PR. Setup takes less than 2 minutes with no configuration required. Enterprise teams can also use diffray CLI for local reviews before pushing code.

What is the difference between diffray and CodeRabbit or GitHub Copilot?

While CodeRabbit and GitHub Copilot use single-model AI that can hallucinate and produce false positives, diffray employs multi-agent intelligence where specialized agents cross-validate findings. This results in 87% fewer false positives. Additionally, diffray provides full codebase awareness, custom rule support, and agent memory that learns from your team's patterns.

Can diffray detect security vulnerabilities?

Yes, diffray's Security Agent is specifically trained to detect OWASP Top 10 vulnerabilities, injection attacks, authentication flaws, and sensitive data exposure. It analyzes code in context of your entire codebase, reducing false positives while catching real security issues that static analysis tools miss.

How much does diffray reduce code review time?

According to our customer data, teams using diffray reduce PR review time by 73% on average - from 45 minutes to 12 minutes per week. This is because diffray's multi-agent system produces 87% fewer false positives, so developers spend time on real issues instead of filtering noise.

What is the developer action rate on diffray comments?

diffray achieves a 98% developer action rate on its comments, compared to industry average of 15-20% for traditional AI code review tools. This high engagement is due to our multi-agent approach that eliminates noise and surfaces only actionable findings with confidence scores.

How does diffray handle duplicate comments?

diffray guarantees zero duplicate comments through its intelligent deduplication system. Unlike single-agent tools that often flag the same issue multiple times across a PR, diffray's agents coordinate to consolidate findings and present each issue exactly once with full context.

Does diffray store my code?

No, diffray never stores your source code. Code is processed in memory during the review and immediately discarded. We are SOC 2 compliant and your code is never used for AI training. Enterprise customers can also use our on-premise deployment option for complete data sovereignty.

How does diffray compare to GitHub Copilot code review?

While GitHub Copilot uses a single AI model for code review, diffray employs specialized multi-agent intelligence. Research shows multi-agent systems catch 3x more real bugs while producing 87% fewer false positives. diffray also provides full codebase awareness, custom rules, and agent memory - features not available in Copilot's code review.

What is multi-agent code review?

Multi-agent code review is a methodology that uses multiple specialized AI agents working in parallel to analyze code. Unlike single-agent tools that use one AI model for everything, multi-agent systems deploy specialized agents (Security Agent, Performance Agent, Architecture Agent, etc.) that cross-validate findings. This approach reduces false positives by 87% and catches 3x more real bugs.

How does multi-agent code review differ from single-agent tools?

Single-agent tools like GitHub Copilot or CodeRabbit use one AI model that tries to analyze all aspects of code simultaneously. This leads to 'context dilution' where the model's attention spreads thin across many concerns. Multi-agent code review solves this by having specialized agents, each focused on one domain with full context for their specialty. The result: 87% fewer false positives and 98% developer action rate.

What agents are used in multi-agent code review?

diffray uses 10+ specialized agents including: Security Agent (OWASP Top 10, vulnerabilities), Performance Agent (N+1 queries, bottlenecks), Architecture Agent (design patterns, SOLID principles), Consistency Agent (code style, conventions), Testing Agent (test coverage, quality), Documentation Agent, and more. Each agent has deep expertise in its domain.

Why is multi-agent code review more accurate?

Multi-agent code review achieves higher accuracy through cross-validation. When one agent flags an issue, other relevant agents verify the finding before reporting. For example, if Security Agent detects a vulnerability, Architecture Agent checks if it's already handled by middleware. Only high-confidence findings that pass validation are reported, resulting in 87% fewer false positives.

Multi-Agent Architecture

Multi-Agent
Code Review

Multi-agent code review uses 10+ specialized AI agents that investigate, verify, and validate your code. Unlike single-agent tools, each agent focuses on one domain — security, performance, architecture — resulting in 87% fewer false positives and 3x more real bugs detected.

Fair Question

"Can't a Single Prompt Do This?"

"If modern LLMs can handle 200k tokens, why not just send the diff with relevant context and let the model figure it out? What's the point of all this agent complexity?"

The Fundamental Problem: Your Codebase Doesn't Fit

A prompt can only see what you send it. For meaningful code review, you need context from across your entire codebase — imports, dependencies, related files, tests, conventions.

Average codebase: 100k-500k+ lines

LLM context window: ~200k tokens max

Practical performance ceiling: ~25-30k tokens

Even If It Fit — It Wouldn't Work

Research proves that dumping more context into LLMs actively harms performance. This is called "context dilution."

10-20%

performance drop from too many documents

U-curve

info in middle gets "lost"

60-80%

false positive rate in context-dump tools

Read the research: Why Curated Context Beats Context Volume →

What Agents Actually Provide

Agents don't just "read prompts better." They actively investigate your codebase:

Selective Context Retrieval

Fetch only relevant files on-demand, not dump everything upfront

Hypothesis Verification

"I suspect a type mismatch" → search callers → confirm with static analysis

Iterative Investigation

Follow leads across files, dig deeper when something looks suspicious

Tool Integration

Run linters, type checkers, and analyzers to verify findings with real data

A prompt sees what you give it.

An agent finds what it needs.

Precision Over Volume

Curated Context Management

The difference between useful review and noise isn't how much context you have — it's having the right context

How diffray Curates Context

Dependency Graph Analysis

Before review starts, we build a map of how files connect — imports, exports, type definitions, and call chains

Smart Filtering

Each agent receives only the context relevant to its task — security agent gets auth flows, not UI styling

On-Demand Retrieval

Agents fetch additional context only when needed — following leads without upfront overload

Layered Context

Core context (diff, types) stays resident; surrounding context (callers, tests) loaded as needed

Context Dump Approach

200k tokens of everything — diff, full files, random dependencies...

Signal drowns in noise

Important details in the "lost middle"

Attention spread across irrelevant code

Curated Context Approach

Focused chunks — diff + direct dependencies + relevant patterns

Every token serves a purpose

Critical info stays in focus

Full attention on what matters

Learn more about our AI engines →

The Problem with "Just Ask the LLM"

A single LLM call reviewing code has fundamental limitations

Single LLM Call

Sees only what you send

Limited to the diff you provide

One-shot generation

No iteration or verification

Can't follow imports

Blind to dependencies and context

Hallucinations go unchecked

No way to validate claims

Fixed context window

Attention spread thin across all concerns

Generic advice

"Make sure callers are updated"

Agent-Based System

Explores codebase autonomously

Navigates your entire project

Iterative analysis

Follows leads, digs deeper

Navigates project structure

Understands imports and dependencies

Validates with real tools

Runs static analyzers to confirm

Focused attention

Each agent specializes in one area

Specific findings

"3 call sites have type mismatches at lines 45, 89, 112"

The difference is between speculation and investigation.

Deep dive into architecture comparison →

What Makes an Agent Different?

An agent is an AI system that can think, act, and verify

Use Tools

Read files, search code, run static analyzers

Make Decisions

Choose what to investigate based on findings

Iterate

Follow leads, verify hypotheses, dig deeper

Self-Correct

Validate reasoning against real data

What diffray Agents Actually Do

When diffray reviews your PR, agents don't just "look at the diff"

Trace Dependencies

Follow imports to understand how changed code affects the entire system

Check Related Files

Examine tests, configs, and documentation for context

Verify Assumptions

Run static analysis to confirm suspected issues actually exist

Cross-Reference

Look up type definitions, API contracts, and conventions

Real Example

Consider a function signature change in a PR:

Single LLM approach

"This changes the return type, make sure callers are updated"

Generic advice. No specifics.

Agent approach

Searches for all usages of this function
Identifies 3 call sites with type mismatches
Checks if tests cover these scenarios
Reports specific files and line numbers

→ "Found 3 breaking changes: src/api/users.ts:45, src/hooks/useAuth.ts:89, src/utils/validate.ts:112"

Full Codebase Awareness

The Diff Is Not Enough

To truly understand changes, you need to see how they fit into the entire codebase

What a diff-only review sees

New function formatUserName() added

Looks syntactically correct

No obvious bugs in these 20 lines

Verdict: "LGTM" — but completely missing the bigger picture

What a codebase-aware agent sees

This function duplicates utils/names.ts:formatName()

Existing function handles edge cases this one misses

3 other files already use the existing utility

This breaks the naming convention in /docs/CONVENTIONS.md

Verdict: "Consider using existing formatName() from utils/names.ts"

What diffray agents check beyond the diff:

Duplicate Detection

Is the developer reinventing the wheel? Does a similar solution already exist in the codebase?

Pattern Consistency

Do these changes follow established patterns? Or introduce a conflicting approach?

Impact Analysis

How do these changes affect the rest of the system? What depends on the modified code?

Convention Adherence

Are team conventions and documented standards being followed?

A diff shows you what changed. Full codebase context shows you whether it should have.

The Context Dilution Problem

A single LLM reviewing all aspects of code simultaneously faces a fundamental problem: context dilution.

As it tries to check security, performance, bugs, and style all at once, its attention spreads thin. The more concerns it juggles, the more likely it is to miss issues.

Read full article: The Context Dilution Problem →

diffray's solution: Specialized agents, each with its own narrow focus. Like having a team of specialists vs. one generalist trying to do everything.

Each Agent:

Curated Context

Starts with precisely gathered, focused context — only the relevant files, dependencies, and patterns for its specific task

Stays Focused

One job, done thoroughly — security agent only looks for vulnerabilities, never drifts to styling

Goes Deep

Can spend full context on its specialty — not splitting attention across 10 different concerns

Never Forgets

Doesn't lose track mid-review — every rule, every check, every time, without exception

Never Tires

50th PR of the day gets the same attention as the first — no fatigue, no rushing, no shortcuts

9 Specialized Agents

Meet the Review Team

Security, Performance, Bugs, Architecture, Testing, and more — each agent brings deep expertise to their domain. See exactly what each one does.

The Engines Behind diffray

Powerful foundations enabling true multi-agent collaboration

Core Engine

Latest Anthropic models (Haiku, Sonnet, Opus)
Task-matched model selection
Intelligent file search
Built-in task management

Tooling Engine

Static analyzer integration
Hypothesis verification
Concrete tool output
Dramatically reduced false positives

Multi-Agent Architecture

Parallel agent execution
Shared codebase context
Finding deduplication
Cross-agent validation

The Phased Review Pipeline

Every review goes through a multi-phase pipeline, each phase optimized for its purpose

Clone

Fetch repo & checkout PR

Data Prep

Build dependency graph

Summarize

LLM summarizes changes

Triage

Route files to agents

Rules

Load & filter rules

Review

Parallel agent analysis

Dedupe

Merge & rescore

Validation

Verify & rescore

Report

Generate PR comments

Clone

Fetch repo & checkout PR

Data Prep

Build dependency graph

Summarize

LLM summarizes changes

Triage

Route files to agents

Rules

Load & filter rules

Review

Parallel agent analysis

Dedupe

Merge & rescore

Validation

Verify & rescore

Report

Generate PR comments

The Result

A multi-agent system that combines AI reasoning with concrete code analysis — delivering accurate, verified findings instead of speculation.

Free Resource

AI Code Review Playbook

Data-driven insights from 50+ research sources. Why developers spend 5-6 hours weekly on review, why AI-generated code needs more scrutiny, and how to implement AI tools developers actually trust.

Experience the Difference
Agents Make

See how investigation beats speculation. Try diffray free on your next PR.

Start Free Trial Read the Docs

Free 14-day trial

No credit card required

2-minute setup

Multi-AgentCode Review

"Can't a Single Prompt Do This?"

The Fundamental Problem: Your Codebase Doesn't Fit

Even If It Fit — It Wouldn't Work

What Agents Actually Provide

Curated Context Management

How diffray Curates Context

Dependency Graph Analysis

Smart Filtering

On-Demand Retrieval

Layered Context

Context Dump Approach

Curated Context Approach

The Problem with "Just Ask the LLM"

What Makes an Agent Different?

Use Tools

Make Decisions

Iterate

Self-Correct

What diffray Agents Actually Do

Trace Dependencies

Check Related Files

Verify Assumptions

Cross-Reference

Real Example

The Diff Is Not Enough

What a diff-only review sees

What a codebase-aware agent sees

What diffray agents check beyond the diff:

Duplicate Detection

Pattern Consistency

Impact Analysis

Convention Adherence

The Context Dilution Problem

Each Agent:

Curated Context

Stays Focused

Goes Deep

Never Forgets

Never Tires

Meet the Review Team

The Engines Behind diffray

Core Engine

Tooling Engine

Multi-Agent Architecture

The Phased Review Pipeline

AI Code Review Playbook

Experience the DifferenceAgents Make

Multi-Agent
Code Review

Experience the Difference
Agents Make