Almanac
← Events
5Hugging Face Blog·1mo ago

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context

IBM released Granite Embedding Multilingual R2, an open-weights (Apache 2.0) multilingual embedding model with 32K context window, claiming best-in-class retrieval quality among sub-100M parameter models. The model is positioned for enterprise RAG and retrieval use cases across multiple languages. It is hosted and announced via Hugging Face.

Related guides (4)

Related events (8)

5Hugging Face Blog·1mo ago·source ↗

Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents

IBM released Granite 4.0 3B Vision, a compact multimodal model targeting enterprise document understanding tasks. The model is hosted on Hugging Face and positioned for deployment in resource-constrained enterprise environments. As a 3B-parameter vision-language model, it competes in the small-but-capable segment increasingly favored for on-premise and edge deployments.

5Hugging Face Blog·1mo ago·source ↗

Granite 4.1 LLMs: How They're Built

IBM has published a blog post on Hugging Face detailing the construction of its Granite 4.1 language models. The post covers architectural and training decisions behind the new model family. As a tier-2 source with default commentary depth, this provides insight into IBM's continued investment in open enterprise LLMs but lacks the full technical depth of a primary research paper.

5Hugging Face Blog·1mo ago·source ↗

Granite 4.0 Nano: Just how small can you go?

IBM has released Granite 4.0 Nano, a small-footprint language model in the Granite 4.0 family, published via the Hugging Face blog. The post explores the capabilities and trade-offs of pushing model size to its lower limits while maintaining practical utility. This release is part of IBM's ongoing effort to develop efficient, enterprise-deployable AI models under the Granite brand.

7Qwen Research·1mo ago·source ↗

Qwen3 Embedding: State-of-the-Art Text Embedding and Reranking Models Released

Alibaba's Qwen team has released the Qwen3 Embedding series, a set of open-weights text embedding and reranking models built on the Qwen3 foundation model. The models are designed for retrieval and reranking tasks and claim state-of-the-art performance across multiple benchmarks. They are released under the Apache 2.0 license and are available on Hugging Face and ModelScope.

7Hugging Face Blog·1mo ago·source ↗

Welcome Gemma 3: Google's All-New Multimodal, Multilingual, Long-Context Open LLM

Google has released Gemma 3, a new family of open-weights large language models featuring multimodal capabilities, multilingual support, and extended context windows. The Hugging Face blog post introduces the model family and its key features. Gemma 3 represents a significant update to Google's open-weights model line, expanding beyond text-only capabilities to include vision and broader language coverage.

9Hugging Face Blog·1mo ago·source ↗

Llama 3.1 Released: 405B, 70B & 8B Models with Multilinguality and Long Context

Meta released Llama 3.1, a family of open-weights models at three scales (405B, 70B, 8B) featuring multilingual support and extended context windows. The 405B model represents Meta's largest open-weights release to date, positioning it as a frontier-class open model. Hugging Face published a blog post covering the release, integration details, and deployment options across the ecosystem.

6Hugging Face Blog·1mo ago·source ↗

Welcome EmbeddingGemma, Google's new efficient embedding model

Google has released EmbeddingGemma, a new embedding model announced via the Hugging Face blog. The model appears to be positioned as an efficient option for generating text embeddings, likely derived from or related to the Gemma model family. Details on architecture, benchmarks, and use cases are expected in the full post.

8Mistral Ai News·19d ago·source ↗

Mistral Large 2 (123B): New Frontier Model with 128k Context, Multilingual and Code Capabilities

Mistral AI releases Mistral Large 2, a 123-billion-parameter model with a 128k context window, supporting 80+ coding languages and over a dozen natural languages. The model claims competitive performance with GPT-4o, Claude 3 Opus, and Llama 3 405B on code generation, reasoning, and multilingual benchmarks, while targeting cost-efficient single-node inference. Weights are available under a Mistral Research License for non-commercial use, with a commercial license required for self-deployment. The model is accessible via Mistral's la Plateforme API (mistral-large-2407), HuggingFace, and Google Cloud Vertex AI.