Almanac
model

GPT

modelactivegpt-58362d74·9 events·first seen 28d ago

Aliases: GPT

Co-occurring entities

More like this (12)

Recent events (9)

7Openai Blog·28d ago·source ↗

OpenAI models, Codex, and Managed Agents come to AWS

OpenAI has announced that its GPT models, Codex, and Managed Agents are now available on AWS, allowing enterprise customers to deploy OpenAI capabilities within their existing AWS environments. The partnership extends OpenAI's distribution reach into the major cloud hyperscaler ecosystem. This follows a broader industry pattern of AI labs partnering with cloud providers to reach enterprise customers through familiar procurement and compliance channels.

6Openai Blog·28d ago·source ↗

Image GPT: Transformer Models Applied to Pixel Sequences for Image Generation and Classification

OpenAI demonstrates that a large transformer model trained autoregressively on pixel sequences can generate coherent image completions and samples, analogous to text generation. The work establishes a correlation between generative sample quality and downstream image classification accuracy. The best generative model achieves features competitive with top convolutional networks in the unsupervised setting, suggesting shared representational principles across modalities.

4Openai Blog·28d ago·source ↗

OpenAI Releases GABRIEL: Open-Source Toolkit for AI-Assisted Social Science Research

OpenAI has released GABRIEL, an open-source toolkit that leverages GPT models to convert qualitative text and images into quantitative data for social science research. The tool is designed to help researchers analyze large-scale qualitative datasets that would otherwise be impractical to process manually. It represents an application of frontier LLMs to academic research methodology rather than a new model or capability announcement.

5Hacker News·41h ago·source ↗

HN community discusses replacing Claude/GPT with local models for daily coding

A high-engagement Hacker News thread (510 points, 256 comments) asks whether practitioners have successfully replaced cloud-hosted models like Claude or GPT with local models for daily coding workflows. The discussion likely surfaces real-world comparisons of local vs. hosted model performance, latency, cost, and privacy tradeoffs. High engagement signals this is a live practitioner concern in mid-2026.

6Openai Blog·28d ago·source ↗

Efficient Training of Language Models to Fill in the Middle

OpenAI published research on training language models with a fill-in-the-middle (FIM) objective, enabling models to complete text given both a prefix and a suffix context. The technique allows infilling capabilities to be added at essentially no cost to left-to-right generative performance. This work has direct implications for code completion and editing use cases, and was later incorporated into Codex and related models.

5arXiv · cs.CL·18d ago·source ↗

LLUMI: Fine-Tuning Open-Source LLMs for Mental Health Writing Assistance Using Reddit Community Feedback

LLUMI is a two-component system (a generation model and an improvement model) designed to provide mental health writing assistance using smaller open-source LLMs hosted in privacy-preserving, on-premise environments. The system leverages Reddit community endorsement signals (upvotes/downvotes) to construct preference pairs for SFT and DPO training, then further aligns outputs via human evaluation across readability, empathy, connection, actionability, and safety dimensions. Results show LLUMI achieves performance comparable to proprietary GPT-based models on linguistic and human evaluations, suggesting community-derived preference signals can substitute for expensive expert labeling in sensitive domains.

6arXiv · cs.CL·5h ago·source ↗

RubricsTree: Scalable hierarchical rubric framework for evaluating personal health AI agents

RubricsTree is a new evaluation framework for LLM-powered personal health agents, built around a hierarchical taxonomy of over 100 clinically-verifiable Boolean rubrics derived from 4,000 real user queries and curated with physician oversight. A context-aware router activates only relevant rubrics per query, enabling scalable yet expert-aligned evaluation. The framework outperforms strong LLM-as-a-judge baselines on expert alignment and, when used as training signal, yields up to ~66% relative gains on HealthBench across Gemini, GPT, and Qwen model families. The work addresses a concrete bottleneck in clinical deployment of health AI: the cost-quality tradeoff in evaluation.

7Openai Blog·28d ago·source ↗

Sora System Card

OpenAI has published the system card for Sora, its video generation model capable of accepting text, image, and video inputs to produce video outputs. The model builds on techniques from DALL-E and GPT and is positioned as a creative storytelling tool. The system card documents safety evaluations, mitigations, and residual risks associated with the model's deployment.

5Github Trending·13d ago·source ↗

HexStrike AI: MCP server exposing 150+ cybersecurity tools to AI agents

HexStrike AI is an open-source MCP server that enables AI agents (Claude, GPT, Copilot, and others) to autonomously invoke over 150 offensive security tools for penetration testing, vulnerability discovery, and bug bounty automation. The project bridges LLMs with real-world offensive security capabilities via the Model Context Protocol. With 9,221 GitHub stars, it represents a notable community signal around agentic security tooling and the expanding attack surface of AI-driven automation.