Pisama
Publications

Peer-reviewable scholarly output.

DOI-anchored preprints with full code and data release.

  • 2026-05-08

    Do Human Cognitive Failures Map onto AI Agent Failures? A Structured Literature Review

    PRISMA 2020 structured review of 45 human cognitive failure categories mapped onto AI agent research. Headline: 6 / 24 / 12 / 0 / 3 across Substantial / Partial / Nascent / Absent / Substrate-absent. Appendix C contains a complete crosswalk between MAST’s 14 multi-agent failure modes and the 45-category human cognitive failure taxonomy.

  • 2026-05-08

    Tiered Detection of Multi-Agent LLM Failures: An Empirical Calibration on TRAIL and Who&When

    Empirical companion. 18-detector tiered pipeline, calibrated on 8,338 trace entries; ‘in-distribution’ TRAIL evaluation, held-out Who&When attribution, four-substrate Cohen’s κ analysis against MAST-released labels. Concept DOI 10.5281/zenodo.20027840 covers v1–v3.

Research notes

Empirical work on agent failure detection.

Reproducible experiments. Code and data linked from each report.