Name: Pisama
Author: Pisama

Question 1

What is specification mismatch in AI agent systems?

Accepted Answer

Detects when task output doesn't match the user's original specification. Catches scope drift, missing requirements, language mismatches, and conflicting specifications.

Question 2

How does Pisama detect specification mismatch?

Accepted Answer

Semantic Coverage: Measures how well output covers each requirement using embeddings Keyword Matching: Checks for presence of required elements, topics, and constraints Code Quality Checks: Validates language match, deprecated syntax, stub implementations Numeric Tolerance: Handles approximate constraints like word counts (within 20%)

Question 3

How accurate is the specification mismatch detector?

Accepted Answer

F1 0.703, precision 0.592, recall 0.866 on the Pisama calibration set.

Specification Mismatch

Examples

Detection methods

Calibration accuracy

Subtypes