5arXiv cs.AI (Artificial Intelligence)·24d ago

Formal Framework for Agentic Technical Debt and Stochastic Tax in AI Workflows

This paper introduces a formal model distinguishing two constructs in agentic AI deployments: Agentic Technical Debt (accumulated design and governance liability) and Stochastic Tax (recurring operating burden from probabilistic agents in business workflows). The framework provides measurement methods, simulation tools, and a dashboard expression grounded in operational data estimation. An accounts-payable simulation and companion spreadsheet illustrate practical application. The work targets both technical and managerial audiences seeking to quantify and govern agentic AI system costs.

Inference Economics Enterprise Deployment Patterns Agent and Tool Ecosystem Agentic AI Systems Stochastic Tax Agentic Technical Debt

Related guides (3)

Enterprise Deployment PatternsTopic guide

Enterprise Deployment Patterns: From AI Demo to Production Reality

Read asBeginner In-depth

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost of Running AI in Production

Read asBeginner In-depth

Related events (8)

6Openai Blog·1mo ago·source ↗

Practices for Governing Agentic AI Systems

OpenAI published a framework document outlining governance practices for agentic AI systems. The piece addresses how to manage AI agents that take sequences of actions, make decisions, and operate with varying degrees of autonomy. It likely covers topics such as human oversight, authorization boundaries, and accountability structures for agentic deployments.

AI Safety Research Enterprise Deployment Patterns OpenAI +1 more

5arXiv · cs.AI·5d ago·source ↗

Taxonomy and governance gap analysis for AI contributors in open-source software

A preprint from arXiv analyzes how open-source organizations are handling AI-generated and agent-driven contributions, comparing policies across six major projects (SymPy, LLVM, matplotlib, OpenInfra, Apache Software Foundation, Linux Foundation). The authors develop a six-dimensional taxonomy covering disclosure, responsibility, human oversight, licensing, enforcement, and maintainer workload, and score each organization's policy maturity. The paper maps documented agent incidents onto governance gaps and identifies misalignments with emerging regulatory frameworks including the EU AI Act, NIST AI RMF, and ISO/IEC 42001, proposing a harmonized tiered framework.

AI Safety Research Regulatory Developments LLVM Linux Foundation NIST AI RMF +6 more

7Anthropic News·18d ago·source ↗

Anthropic publishes framework for safe and trustworthy agent development

Anthropic released a formal framework for responsible agent development, articulating principles around human oversight, transparency, value alignment, and privacy for autonomous AI agents. The document draws on Claude Code as a reference implementation and cites enterprise deployments at Trellix and Block as real-world examples. The framework is positioned as a contribution to emerging industry standards for agentic AI systems, acknowledging open technical challenges in value alignment measurement and oversight calibration.

AI Safety Research Regulatory Developments Block Claude Code Trellix +2 more

5Ai Snake Oil·1mo ago·source ↗

New Paper: Towards a Science of AI Agent Reliability

A new paper proposes a framework for quantifying the gap between AI agent capability and reliability, aiming to establish a more rigorous science of agent dependability. The work addresses the observation that agents may demonstrate high capability on benchmarks while failing unpredictably in deployment. The piece is published via the normaltech.ai newsletter, associated with the AI Snake Oil research commentary tradition.

Evaluation and Benchmarking AI Safety Research Towards a Science of AI Agent Reliability normaltech.ai AI Snake Oil +2 more

4Mit Technology Review — Ai·1mo ago·source ↗

Data Readiness for Agentic AI in Financial Services

This MIT Technology Review commentary examines the specific requirements for deploying agentic AI in financial services, arguing that success depends more on data readiness than on model sophistication. The piece highlights the dual challenge of operating under heavy regulatory constraints while processing real-time market data. It frames data infrastructure as the critical bottleneck for agentic AI adoption in the sector.

Enterprise Deployment Patterns Regulatory Developments financial services agentic AI MIT Technology Review +1 more

4Mit Technology Review — Ai·25d ago·source ↗

Rethinking Organizational Design in the Age of Agentic AI

A MIT Technology Review commentary examines the gap between enterprise ambition and readiness for agentic AI adoption, citing survey data showing 85% of organizations want to be agentic within three years but 76% say their current infrastructure cannot support that transition. The piece focuses on organizational design challenges—people, processes, and workflows—as the primary barriers to agentic AI deployment at scale.

Enterprise Deployment Patterns Agent and Tool Ecosystem agentic AI MIT Technology Review

4arXiv · cs.LG·24d ago·source ↗

Framework for Carbon-Aware AI Inference Incentives Balancing Accuracy, Latency, and Emissions

This paper proposes an incentive framework for AI inference services that accounts for users' valuation of quality, latency, and environmental consciousness. The core mechanism is a two-tier subscription model where users accept discounted service—lower model quality and higher latency—during high carbon-intensity periods in exchange for reduced costs. The framework formalizes the tradeoff space between carbon emissions and quality-of-experience parameters, giving providers flexibility to shift inference load toward greener operating points.

Training Infrastructure Inference Economics carbon intensity AI inference carbon emissions two-tier service subscription model +1 more

4arXiv · cs.AI·4d ago·source ↗

Causal DAG model for when AI systems should engage Theory of Mind in conflict scenarios

A new arXiv preprint proposes a structural causal model (formalized as a directed acyclic graph) that treats Theory of Mind as a conditionally activated mechanism rather than an always-on capacity in AI systems. The model specifies exogenous situational and agent-level conditions, five endogenous mediators, and three causal pathways (tractability, reasoning-depth, enabling-cause) leading to an epistemic accuracy outcome. The work targets human-machine teaming in conflict contexts, offering a resource-rational decision procedure for when AI should engage social reasoning. Simulation validation and ethical considerations for conflict-optimized mentalizing are discussed.

AI Safety Research Agent and Tool Ecosystem A Causal Model of Theory of Mind in Conflict for Artificial Intelligence