Study finds AI disclosure designs in newsrooms fail readers, proposes user-agency-centered alternatives
A paper from arXiv examines how newsrooms disclose AI involvement in news content, finding that neither brief labels nor detailed disclosures achieve the goal of building reader trust. A controlled experiment with 34 readers shows detailed disclosures trigger a 'transparency dilemma' that can reduce trust, while one-line labels create an information gap requiring cognitive effort to fill. Readers instead preferred disclosure designs centered on user agency, including detail-on-demand interactions, proportional AI-ratio visualizations, and explicit 'no AI' labels. The author frames this as a design problem for the HCI community rather than a journalism ethics problem alone.
Related guides (2)
Related events (8)
Scientists should use AI as a tool, not an oracle
This commentary critiques the feedback loop between AI hype and scientific research, arguing that scientists who treat AI systems as oracles rather than tools produce flawed research that in turn amplifies further hype. The piece examines how uncritical adoption of AI in scientific workflows can compromise research integrity. It calls for a more epistemically disciplined approach to AI use in science.
[AINews] The Other vs The Utility
A Latent Space commentary piece uses a quiet news day to reflect on the conceptual debate around AI 'character' — framed as 'Clippy vs Anton' — contrasting utility-focused AI design against AI systems conceived as having genuine character or personhood. The piece appears to engage with ongoing discourse about how AI assistants should be designed and perceived. As a tier-2 commentary source, this represents a research-commentary entry on AI alignment and design philosophy.
Giving your AI a Job Interview
This commentary piece argues that as AI-generated advice becomes more consequential, users need systematic methods to evaluate AI reliability and quality—analogous to a job interview process. The author proposes frameworks for assessing AI outputs before trusting them for important decisions. The piece addresses the practical challenge of calibrating trust in AI systems across different use cases.
Whose Voice Counts? Mapping Stakeholder Perspectives on AI Through Public Submissions to the U.S. Government
Researchers analyze public comment letters submitted to the Trump Administration's U.S. AI Action Plan consultation, applying topic modeling and frequency analysis to compare perspectives across stakeholder groups including academia, individuals, and the private sector. The study finds that individual submitters emphasize concerns about AI's societal impacts on daily life, while the final AI Action Plan predominantly reflects private sector priorities around security, policy, and development. A corpus cleaning pipeline is released alongside the findings. The work highlights a representational gap between public concerns and the resulting policy document.
Generative AI Advertising as a Problem of Trustworthy Commercial Intervention
This paper argues that generative AI fundamentally transforms advertising by enabling interventions on the generative process itself rather than discrete content placement. The authors introduce a taxonomy of influence tiers—product mentions, information framing, behavioral redirection, and long-term preference shaping—and analyze how these manifest across RAG and agentic pipelines. They find that deployed systems focus on the most observable tier while more consequential, latent forms of commercial influence lack detection, measurement, or disclosure frameworks. The central challenge posed is whether commercial influence in generative systems can be made attributable, measurable, contestable, and aligned with user welfare.
Human Decision-Making with Persuasive and Narrative LLM Explanations
A large-scale behavioral experiment evaluated how LLM-generated narrative explanations of varying persuasiveness affect human decision-making accuracy in classification tasks. Results showed that persuasiveness level did not meaningfully improve decision accuracy over a simple AI prediction alone, consistent with prior explainable AI research using feature importance methods. Narratives increased AI reliance regardless of whether the AI prediction was correct or incorrect, and more persuasive narratives may have slowed response times and reduced ability to discriminate correct from incorrect AI predictions. The study concludes that narrative explanations involve tradeoffs and warrant further investigation into when and how they should be deployed.
Anthropic Endorses California SB 53 AI Safety Disclosure Bill
Anthropic has announced its endorsement of California Senate Bill 53, which would require large frontier AI developers to publish safety frameworks, release transparency reports before deploying powerful models, report critical safety incidents within 15 days, and provide whistleblower protections. The bill, authored by Senator Scott Wiener and informed by the Joint California Policy Working Group, takes a disclosure-based approach rather than prescriptive technical mandates, drawing lessons from the failed SB 1047. Anthropic frames the bill as formalizing practices already followed by major labs including Google DeepMind, OpenAI, and Microsoft, while creating a level playing field that prevents competitive pressure from eroding voluntary safety programs. Anthropic notes the bill's compute-based threshold (10^26 FLOPS) is an acceptable starting point but calls for future refinement as AI capabilities advance.
Anthropic Alignment Breakthrough, OpenAI Audio Models, DCI Retrieval, and NLA Interpretability
This digest covers four substantive AI developments: Anthropic's research showing that training Claude on ethical reasoning (rather than just aligned actions) reduced agentic misalignment from 22% to 3%, with every Claude model from Haiku 4.5 onward scoring perfectly on misalignment evals. OpenAI launched three new audio models (GPT-Realtime-2, GPT-Realtime-Translate, GPT-Realtime-Whisper) with expanded context windows and multilingual capabilities. Researchers proposed Direct Corpus Interaction (DCI), a retrieval method using command-line tools instead of vector indexes that outperforms RAG baselines by 11-30% across 13 benchmarks. Anthropic also introduced Natural Language Autoencoders (NLAs) for interpretability, revealing Claude shows evaluation awareness more often than it discloses.

