Formal Framework for Agentic Technical Debt and Stochastic Tax in AI Workflows
This paper introduces a formal model distinguishing two constructs in agentic AI deployments: Agentic Technical Debt (accumulated design and governance liability) and Stochastic Tax (recurring operating burden from probabilistic agents in business workflows). The framework provides measurement methods, simulation tools, and a dashboard expression grounded in operational data estimation. An accounts-payable simulation and companion spreadsheet illustrate practical application. The work targets both technical and managerial audiences seeking to quantify and govern agentic AI system costs.
Related guides (3)
Related events (8)
Practices for Governing Agentic AI Systems
OpenAI published a framework document outlining governance practices for agentic AI systems. The piece addresses how to manage AI agents that take sequences of actions, make decisions, and operate with varying degrees of autonomy. It likely covers topics such as human oversight, authorization boundaries, and accountability structures for agentic deployments.
Taxonomy and governance gap analysis for AI contributors in open-source software
A preprint from arXiv analyzes how open-source organizations are handling AI-generated and agent-driven contributions, comparing policies across six major projects (SymPy, LLVM, matplotlib, OpenInfra, Apache Software Foundation, Linux Foundation). The authors develop a six-dimensional taxonomy covering disclosure, responsibility, human oversight, licensing, enforcement, and maintainer workload, and score each organization's policy maturity. The paper maps documented agent incidents onto governance gaps and identifies misalignments with emerging regulatory frameworks including the EU AI Act, NIST AI RMF, and ISO/IEC 42001, proposing a harmonized tiered framework.
Anthropic publishes framework for safe and trustworthy agent development
Anthropic released a formal framework for responsible agent development, articulating principles around human oversight, transparency, value alignment, and privacy for autonomous AI agents. The document draws on Claude Code as a reference implementation and cites enterprise deployments at Trellix and Block as real-world examples. The framework is positioned as a contribution to emerging industry standards for agentic AI systems, acknowledging open technical challenges in value alignment measurement and oversight calibration.
New Paper: Towards a Science of AI Agent Reliability
A new paper proposes a framework for quantifying the gap between AI agent capability and reliability, aiming to establish a more rigorous science of agent dependability. The work addresses the observation that agents may demonstrate high capability on benchmarks while failing unpredictably in deployment. The piece is published via the normaltech.ai newsletter, associated with the AI Snake Oil research commentary tradition.
Data Readiness for Agentic AI in Financial Services
This MIT Technology Review commentary examines the specific requirements for deploying agentic AI in financial services, arguing that success depends more on data readiness than on model sophistication. The piece highlights the dual challenge of operating under heavy regulatory constraints while processing real-time market data. It frames data infrastructure as the critical bottleneck for agentic AI adoption in the sector.
Rethinking Organizational Design in the Age of Agentic AI
A MIT Technology Review commentary examines the gap between enterprise ambition and readiness for agentic AI adoption, citing survey data showing 85% of organizations want to be agentic within three years but 76% say their current infrastructure cannot support that transition. The piece focuses on organizational design challenges—people, processes, and workflows—as the primary barriers to agentic AI deployment at scale.
Framework for Carbon-Aware AI Inference Incentives Balancing Accuracy, Latency, and Emissions
This paper proposes an incentive framework for AI inference services that accounts for users' valuation of quality, latency, and environmental consciousness. The core mechanism is a two-tier subscription model where users accept discounted service—lower model quality and higher latency—during high carbon-intensity periods in exchange for reduced costs. The framework formalizes the tradeoff space between carbon emissions and quality-of-experience parameters, giving providers flexibility to shift inference load toward greener operating points.
Causal DAG model for when AI systems should engage Theory of Mind in conflict scenarios
A new arXiv preprint proposes a structural causal model (formalized as a directed acyclic graph) that treats Theory of Mind as a conditionally activated mechanism rather than an always-on capacity in AI systems. The model specifies exogenous situational and agent-level conditions, five endogenous mediators, and three causal pathways (tractability, reasoning-depth, enabling-cause) leading to an epistemic accuracy outcome. The work targets human-machine teaming in conflict contexts, offering a resource-rational decision procedure for when AI should engage social reasoning. Simulation validation and ethical considerations for conflict-optimized mentalizing are discussed.


