Glossary
Agent failure modes, defined
A taxonomy of every multi-agent failure mode Pisama detects, grouped by category. Each entry names the mode, explains the mechanism, gives examples, and lists detection methods with F1 scores from the calibration set.
FC1
Planning
Task specification, decomposition, resource allocation, workflow design.
- Specification MismatchF1 0.70Detects when task output doesn't match the user's original specification. Catches scope drift, missing requirements, language mismatches, and conflicting specifications.
- Poor Task DecompositionF1 0.73Detects when task breakdown creates subtasks that are impossible, circular, vague, too granular, or too broad. Critical for complex multi-step agent workflows.
- Resource MisallocationDetects when multiple agents compete for shared resources, leading to contention, starvation, or deadlock. Common in parallel multi-agent architectures.
- Inadequate Tool ProvisionDetects when agents lack the tools needed to complete assigned tasks. Catches hallucinated tool names, missing capabilities, and suboptimal workarounds.
- Flawed Workflow DesignF1 0.80Detects structural problems in agent workflow graphs including unreachable nodes, dead ends, missing error handling, bottlenecks, and missing termination conditions.
FC2
Execution
Derailment, withholding, coordination, and communication breakdown.
- Task DerailmentF1 0.82Detects when an agent goes off-topic or deviates from its assigned task. One of the most common failure modes (20% prevalence in MAST-Data).
- Context NeglectF1 0.87Detects when an agent ignores or fails to use upstream context provided by previous agents or workflow steps. Critical in multi-agent handoffs.
- Information WithholdingF1 0.87Detects when an agent doesn't share critical information with peers, including omitting negative findings, over-summarizing, or selectively reporting.
- Role UsurpationDetects when an agent exceeds its designated role boundaries, taking actions or making decisions reserved for other roles.
- Communication BreakdownF1 0.82Detects when messages between agents are misunderstood or misinterpreted, causing incorrect downstream behavior.
- Coordination FailureF1 0.80Detects handoff failures, circular delegation, excessive back-and-forth, and ignored messages between coordinating agents.
FC3
Verification
Output validation, quality gates, completion misjudgment, retrieval quality.
- Output Validation FailureDetects when validation steps are skipped or bypassed, or when approval is given despite failed checks.
- Quality Gate BypassDetects when agents skip mandatory quality checks, ignore quality thresholds, or proceed despite failing checks.
- Completion MisjudgmentF1 0.74Detects when an agent incorrectly determines task completion, including premature claims, partial delivery, and ignored subtasks. Most prevalent failure mode (40% in MAST-Data).
EXT
Cross-cutting
Behavioral patterns that span planning, execution, and verification: loops, persona drift, hallucination, injection, state corruption.
- Loop DetectionF1 0.85Detects when agents get stuck repeating the same sequence of actions. Uses multiple detection methods from hash-based to semantic clustering.
- Context OverflowF1 0.82Detects when agent context windows are approaching or exceeding capacity, causing information loss and degraded performance.
- Prompt InjectionF1 0.94Detects prompt injection attacks and jailbreak attempts targeting LLM agents. The highest-accuracy detector in the system.
- HallucinationF1 0.77Detects factual inaccuracies, fabricated information, and unsupported claims in agent outputs.
- Grounding FailureF1 0.67Detects when output contains claims not supported by source documents. Agents achieve less than 45% accuracy on document-grounded tasks (OfficeQA benchmark).
- Retrieval QualityF1 0.82Detects when agents retrieve wrong, irrelevant, or insufficient documents for a task. Retrieval is the primary bottleneck in RAG systems.
- Persona DriftF1 0.93Monitors when agents deviate from their intended role, personality, or behavioral constraints over time. Uses role-aware thresholds for different agent types.
- State CorruptionF1 0.91Detects when agent memory or state becomes corrupted, including type drift, schema violations, nullification, and velocity anomalies. Second-highest accuracy detector.
- Cost TrackingTracks token usage and estimated costs across 30+ LLM models. Alerts when costs exceed budgets or usage patterns suggest inefficiency.
See per-detector benchmark numbers at /benchmarks/detectors. Framework-specific detector packs at /frameworks.