Almanac
← Events
5arXiv cs.AI (Artificial Intelligence)·24d ago

Formal Framework for Agentic Technical Debt and Stochastic Tax in AI Workflows

This paper introduces a formal model distinguishing two constructs in agentic AI deployments: Agentic Technical Debt (accumulated design and governance liability) and Stochastic Tax (recurring operating burden from probabilistic agents in business workflows). The framework provides measurement methods, simulation tools, and a dashboard expression grounded in operational data estimation. An accounts-payable simulation and companion spreadsheet illustrate practical application. The work targets both technical and managerial audiences seeking to quantify and govern agentic AI system costs.

Related guides (3)

Related events (8)

6Openai Blog·1mo ago·source ↗

Practices for Governing Agentic AI Systems

OpenAI published a framework document outlining governance practices for agentic AI systems. The piece addresses how to manage AI agents that take sequences of actions, make decisions, and operate with varying degrees of autonomy. It likely covers topics such as human oversight, authorization boundaries, and accountability structures for agentic deployments.

5arXiv · cs.AI·5d ago·source ↗

Taxonomy and governance gap analysis for AI contributors in open-source software

A preprint from arXiv analyzes how open-source organizations are handling AI-generated and agent-driven contributions, comparing policies across six major projects (SymPy, LLVM, matplotlib, OpenInfra, Apache Software Foundation, Linux Foundation). The authors develop a six-dimensional taxonomy covering disclosure, responsibility, human oversight, licensing, enforcement, and maintainer workload, and score each organization's policy maturity. The paper maps documented agent incidents onto governance gaps and identifies misalignments with emerging regulatory frameworks including the EU AI Act, NIST AI RMF, and ISO/IEC 42001, proposing a harmonized tiered framework.

7Anthropic News·18d ago·source ↗

Anthropic publishes framework for safe and trustworthy agent development

Anthropic released a formal framework for responsible agent development, articulating principles around human oversight, transparency, value alignment, and privacy for autonomous AI agents. The document draws on Claude Code as a reference implementation and cites enterprise deployments at Trellix and Block as real-world examples. The framework is positioned as a contribution to emerging industry standards for agentic AI systems, acknowledging open technical challenges in value alignment measurement and oversight calibration.

5Ai Snake Oil·1mo ago·source ↗

New Paper: Towards a Science of AI Agent Reliability

A new paper proposes a framework for quantifying the gap between AI agent capability and reliability, aiming to establish a more rigorous science of agent dependability. The work addresses the observation that agents may demonstrate high capability on benchmarks while failing unpredictably in deployment. The piece is published via the normaltech.ai newsletter, associated with the AI Snake Oil research commentary tradition.

4Mit Technology Review — Ai·1mo ago·source ↗

Data Readiness for Agentic AI in Financial Services

This MIT Technology Review commentary examines the specific requirements for deploying agentic AI in financial services, arguing that success depends more on data readiness than on model sophistication. The piece highlights the dual challenge of operating under heavy regulatory constraints while processing real-time market data. It frames data infrastructure as the critical bottleneck for agentic AI adoption in the sector.

4Mit Technology Review — Ai·25d ago·source ↗

Rethinking Organizational Design in the Age of Agentic AI

A MIT Technology Review commentary examines the gap between enterprise ambition and readiness for agentic AI adoption, citing survey data showing 85% of organizations want to be agentic within three years but 76% say their current infrastructure cannot support that transition. The piece focuses on organizational design challenges—people, processes, and workflows—as the primary barriers to agentic AI deployment at scale.

4arXiv · cs.LG·24d ago·source ↗

Framework for Carbon-Aware AI Inference Incentives Balancing Accuracy, Latency, and Emissions

This paper proposes an incentive framework for AI inference services that accounts for users' valuation of quality, latency, and environmental consciousness. The core mechanism is a two-tier subscription model where users accept discounted service—lower model quality and higher latency—during high carbon-intensity periods in exchange for reduced costs. The framework formalizes the tradeoff space between carbon emissions and quality-of-experience parameters, giving providers flexibility to shift inference load toward greener operating points.

4arXiv · cs.AI·4d ago·source ↗

Causal DAG model for when AI systems should engage Theory of Mind in conflict scenarios

A new arXiv preprint proposes a structural causal model (formalized as a directed acyclic graph) that treats Theory of Mind as a conditionally activated mechanism rather than an always-on capacity in AI systems. The model specifies exogenous situational and agent-level conditions, five endogenous mediators, and three causal pathways (tractability, reasoning-depth, enabling-cause) leading to an epistemic accuracy outcome. The work targets human-machine teaming in conflict contexts, offering a resource-rational decision procedure for when AI should engage social reasoning. Simulation validation and ethical considerations for conflict-optimized mentalizing are discussed.