Almanac
← Events
5OpenAI Blog·1mo ago

Introducing text and code embeddings

OpenAI launched a new embeddings endpoint in its API, enabling natural language and code tasks such as semantic search, clustering, topic modeling, and classification. The endpoint provides vector representations of text and code, making it easier for developers to build applications requiring semantic understanding. This was a significant early step in OpenAI's API product expansion beyond text generation.

Related guides (3)

Related events (8)

5Openai Blog·1mo ago·source ↗

New embedding models and API updates from OpenAI

OpenAI announced new embedding models alongside API updates, expanding their developer-facing infrastructure offerings. The release likely includes updated text-embedding models with improved performance or cost characteristics. This is part of OpenAI's ongoing effort to maintain and grow its API platform for enterprise and developer use cases.

5Openai Blog·1mo ago·source ↗

Text and Code Embeddings by Contrastive Pre-training

OpenAI published research on generating text and code embeddings using contrastive pre-training. The approach trains models to produce dense vector representations useful for semantic search, classification, and code retrieval tasks. This work underpins OpenAI's embeddings API offerings and represents an early public articulation of their embedding methodology.

5Openai Blog·1mo ago·source ↗

OpenAI Releases New and Improved Embedding Model

OpenAI announced a new embedding model described as significantly more capable, cost-effective, and simpler to use than prior offerings. The announcement was published in December 2022 and represents an update to OpenAI's text embedding API surface. No specific benchmark numbers or architectural details are provided in the available body text.

6Mistral Ai News·19d ago·source ↗

Mistral AI Releases Codestral Embed: First Code-Specialized Embedding Model

Mistral AI has launched Codestral Embed (codestral-embed-2505), its first embedding model specialized for code retrieval and semantic understanding. The model claims to outperform leading competitors including Voyage Code 3, Cohere Embed v4.0, and OpenAI's large embedding model across benchmarks including SWE-Bench, CodeSearchNet, and Text2SQL tasks. It supports variable output dimensions and precisions (including int8), enabling storage/quality trade-offs, and is priced at $0.15 per million tokens via Mistral's API with batch discounts available.

5Openai Blog·1mo ago·source ↗

JetBrains Embeds OpenAI API into Developer Tools, Achieving Fastest-Growing Product

JetBrains has integrated OpenAI's API into its developer tooling suite, resulting in what the company describes as its fastest-growing product ever. The partnership represents a significant enterprise deployment of OpenAI's API within a major IDE ecosystem used by millions of developers. The announcement highlights growing adoption of AI coding assistance in professional development environments.

4Hugging Face Blog·1mo ago·source ↗

Deploy Embedding Models with Hugging Face Inference Endpoints

Hugging Face published a guide on deploying embedding models using their Inference Endpoints service. The post covers how to set up dedicated endpoints for embedding models, enabling scalable vector generation for downstream tasks like semantic search and retrieval-augmented generation. This is part of Hugging Face's broader push to make production deployment of specialized model types more accessible.

8Openai Blog·1mo ago·source ↗

OpenAI API Launch

OpenAI announced the release of an API providing programmatic access to its AI models. This marked a significant infrastructure and commercialization milestone, enabling third-party developers to integrate OpenAI's models into their own applications. The launch established the foundation for OpenAI's developer ecosystem and API-first business model.

4Openai Blog·1mo ago·source ↗

Powering next generation applications with OpenAI Codex

OpenAI announced that Codex is now powering 70 different applications across various use cases via the OpenAI API. The post highlights the breadth of adoption of Codex as a developer tool for code generation and related tasks. This represents an early milestone in the enterprise and developer ecosystem deployment of large language models for coding.