Almanac
model

GPT-4

modelactiveprovisionalgpt-4-5a7c19b8·21 events·first seen 28d ago

Aliases: GPT-4

Co-occurring entities

More like this (12)

Recent events (21)

7Openai Blog·28d ago·source ↗

GPT-4 API General Availability and Completions API Deprecation Plan

OpenAI has announced general availability of the GPT-4 API, alongside GPT-3.5 Turbo, DALL·E, and Whisper APIs. Concurrently, OpenAI is releasing a deprecation plan for older models in the Completions API, which are set to retire at the beginning of 2024. This marks a significant milestone in OpenAI's API product lifecycle, transitioning GPT-4 from limited access to broad developer availability.

9Openai Blog·28d ago·source ↗

GPT-4 Release

OpenAI released GPT-4, a large multimodal model accepting image and text inputs and producing text outputs. The model demonstrates human-level performance on various professional and academic benchmarks. It represents OpenAI's latest milestone in scaling deep learning.

7Openai Blog·28d ago·source ↗

Extracting Concepts from GPT-4: 16 Million Patterns via Sparse Autoencoders

OpenAI applied scaled sparse autoencoders (SAEs) to GPT-4 to automatically identify approximately 16 million interpretable features or patterns in the model's internal computations. This represents a significant scaling of mechanistic interpretability techniques previously demonstrated on smaller models. The work advances the ability to understand what concepts and representations large frontier models encode internally.

7Openai Blog·28d ago·source ↗

Building an Early Warning System for LLM-Aided Biological Threat Creation

OpenAI published a blueprint for evaluating whether LLMs can meaningfully assist in biological threat creation. In a controlled study with biology experts and students, GPT-4 was found to provide at most mild uplift in biological threat creation accuracy. The results are inconclusive but are framed as a starting point for ongoing safety research and community deliberation on biosecurity risks from AI.

6Openai Blog·28d ago·source ↗

Using GPT-4 for Content Moderation

OpenAI describes using GPT-4 to assist with content policy development and moderation decisions, replacing or reducing human moderator involvement. The approach aims to improve labeling consistency and accelerate policy iteration cycles. This represents a practical deployment of a frontier model in a high-stakes operational role within OpenAI itself.

5Openai Blog·28d ago·source ↗

Stripe Leverages GPT-4 to Streamline User Experience and Combat Fraud

Stripe has integrated GPT-4 into its platform to improve user experience and enhance fraud detection capabilities. This represents an early enterprise deployment of GPT-4 coinciding with its launch on March 14, 2023. The partnership demonstrates a major fintech company adopting frontier AI models for both customer-facing and security applications.

4Openai Blog·28d ago·source ↗

Ada Uses GPT-4 to Deliver a New Customer Service Standard

Ada, a customer service platform, has integrated GPT-4 to power its AI-driven support capabilities. The announcement, published on OpenAI's blog, highlights the deployment of GPT-4 in an enterprise customer service context. This represents a concrete enterprise deployment case study for GPT-4 in production customer-facing workflows.

7Openai Blog·28d ago·source ↗

Finding GPT-4's Mistakes with GPT-4: CriticGPT

OpenAI has developed CriticGPT, a GPT-4-based model trained to write critiques of ChatGPT outputs, helping human trainers identify errors during RLHF. The system is designed to address a core scalable oversight challenge: human raters often miss subtle mistakes in long or complex model outputs. CriticGPT-assisted trainers outperformed unassisted trainers in catching model errors, suggesting a path toward more reliable RLHF pipelines.

8Openai Blog·28d ago·source ↗

OpenAI Announces Function Calling, Longer Context, and API Price Reductions

OpenAI introduced function calling capabilities to its API, enabling models to reliably output structured JSON for calling developer-defined functions. The update also includes longer context windows, more steerable models (gpt-3.5-turbo-16k and gpt-4 updates), and reduced pricing on several API tiers. These changes significantly expand the practical utility of OpenAI models for agentic and tool-use applications.

6Openai Blog·28d ago·source ↗

Language models can explain neurons in language models

OpenAI uses GPT-4 to automatically generate and score natural-language explanations for the behavior of individual neurons in large language models. The methodology is applied to all neurons in GPT-2, producing a public dataset of explanations and quality scores. The authors acknowledge the explanations are imperfect, framing this as an early step toward automated mechanistic interpretability. This work establishes a scalable pipeline for neuron-level analysis that could inform future interpretability and safety research.

5Openai Blog·28d ago·source ↗

Khan Academy Explores GPT-4 in Limited Pilot Program

OpenAI announced a partnership with Khan Academy to pilot GPT-4 in educational settings. The initiative explores using GPT-4 to power virtual tutoring and classroom assistance tools. This represents an early enterprise deployment of GPT-4 in the education sector.

5Openai Blog·28d ago·source ↗

Iceland Government Uses GPT-4 for Icelandic Language Preservation

The Government of Iceland is partnering with OpenAI to use GPT-4 for preserving the Icelandic language. This initiative represents an early government-level deployment of a frontier language model for cultural and linguistic preservation purposes. The effort highlights GPT-4's multilingual capabilities and its application to low-resource or endangered language contexts.

5Openai Blog·28d ago·source ↗

Be My Eyes Integrates GPT-4 for Visual Accessibility

Be My Eyes, a visual assistance app for blind and low-vision users, has integrated GPT-4 to enhance its accessibility capabilities. The partnership leverages GPT-4's multimodal vision features to provide richer, AI-powered visual interpretation for users. This represents an early real-world deployment of GPT-4's vision capabilities in an assistive technology context.

5Openai Blog·28d ago·source ↗

Duolingo Integrates GPT-4 for Deeper Language Learning Conversations

OpenAI announced a partnership with Duolingo to integrate GPT-4 into the language learning platform. The integration aims to fill gaps in conversational practice and explanation that traditional language learning apps struggle to provide. This represents one of the early enterprise deployments of GPT-4 at scale in an educational context.

6Openai Blog·28d ago·source ↗

OpenAI Opens First Asia Office in Japan, Releases Japanese-Optimized GPT-4 Custom Model

OpenAI has announced the opening of its first Asian office in Japan, marking a significant geographic expansion. Alongside the office launch, OpenAI is releasing a custom GPT-4 model specifically optimized for the Japanese language. This represents both a strategic business move into the Asia-Pacific market and a technical effort to improve model performance for non-English languages.

7Openai Blog·28d ago·source ↗

GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models

OpenAI published research examining the potential labor market impacts of large language models, analyzing which occupations and tasks are most exposed to automation or augmentation by GPT-class models. The study introduces a framework for assessing LLM 'exposure' across job categories, finding that a significant share of U.S. workers could see at least 50% of their tasks affected. The paper represents an early systematic attempt to quantify economic disruption potential from frontier AI systems.

8Mistral Ai News·15d ago·source ↗

Mistral AI Releases Mistral Large, Claims Second-Best API Model After GPT-4

Mistral AI has released Mistral Large, its most capable model to date, claiming second place among API-accessible models behind GPT-4 on standard benchmarks including MMLU, HellaSwag, and coding/math evals. The model features a 32K context window, native fluency in five European languages, function calling, and constrained output mode. Simultaneously, Mistral is launching a new Mistral Small optimized for latency, restructuring its endpoint lineup, and announcing Microsoft Azure as its first major distribution partner. This marks Mistral's first significant commercial partnership and expansion beyond its own infrastructure.

3Openai Blog·28d ago·source ↗

Using GPT-4 to Improve Teaching and Learning in Brazil

OpenAI has partnered with Arco Education, a Brazilian edtech company, to deploy GPT-4 in educational settings across Brazil. The initiative aims to enhance teaching and learning outcomes by integrating large language model capabilities into Arco's existing platforms. This represents an enterprise deployment of GPT-4 in the Latin American education sector.

7Openai Blog·28d ago·source ↗

GPT-4V(ision) System Card

OpenAI published the system card for GPT-4V(ision), the multimodal extension of GPT-4 that accepts image inputs alongside text. The document covers capability evaluations, safety assessments, and known limitations of the vision-enabled model. It represents OpenAI's formal safety and transparency disclosure accompanying the GPT-4V release.

9Openai Blog·28d ago·source ↗

Hello GPT-4o

OpenAI announces GPT-4o (Omni), a new flagship multimodal model capable of reasoning across audio, vision, and text in real time. The model represents a significant step toward natively multimodal AI, processing and generating across modalities without separate pipeline stages. It is positioned as OpenAI's primary production model going forward.

4arXiv · cs.CL·7d ago·source ↗

Pipeline detects curriculum knowledge gaps from student-AI conversational logs using prerequisite graphs

Researchers present a pipeline that classifies student questions directed at a conversational AI teaching assistant into curriculum topics using a few-shot classifier grounded in a GPT-4-extracted prerequisite knowledge graph. Evaluated on 1,340 questions from 164 graduate students, the classifier achieves 80% accuracy across 43 labels. Topic-level question volume significantly correlates with student-reported difficulty (rho=0.491), validating that AI interaction logs carry actionable diagnostic signals about knowledge gaps.