Publications

Peer-reviewable scholarly output.

Name: Pisama
Author: Pisama

DOI-anchored preprints with full code and data release.

2026-05-08
Do Human Cognitive Failures Map onto AI Agent Failures? A Structured Literature Review
PRISMA 2020 structured review of 45 human cognitive failure categories mapped onto AI agent research. Headline: 6 / 24 / 12 / 0 / 3 across Substantial / Partial / Nascent / Absent / Substrate-absent. Appendix C contains a complete crosswalk between MAST’s 14 multi-agent failure modes and the 45-category human cognitive failure taxonomy.
OSF · 10.17605/OSF.IO/XT4WP Zenodo mirror · 10.5281/zenodo.20091430 PDF
2026-05-08
Tiered Detection of Multi-Agent LLM Failures: An Empirical Calibration on TRAIL and Who&When
Empirical companion. 18-detector tiered pipeline, calibrated on 8,338 trace entries; ‘in-distribution’ TRAIL evaluation, held-out Who&When attribution, four-substrate Cohen’s κ analysis against MAST-released labels. Concept DOI 10.5281/zenodo.20027840 covers v1–v3.
Zenodo v3 · 10.5281/zenodo.20091432 PDF

Research notes

Empirical work on agent failure detection.

Reproducible experiments. Code and data linked from each report.