Entity · model

Gemini Omni

modelactivegemini-omni-c888f25b·4 events·first seen May 19, 2026

Aliases: Gemini Omni, Gemini-Omni

Co-occurring entities

Google DeepMind Gemini Seedance 2.0 Black Forest Labs Grok Imagine FLUX-mimic Google xAI FLUX 3 MMAE Nano Banana 2

More like this (12)

Gemini Omni (NanoBanana)Gemini Omni Flash OmniGibson Gemini Advanced Gemini App Gemini Audio Google Gemini Gemini Robotics Gemini Gemini 2.5 Gemini Spark Gemini API

Recent events (4)

7Latent Space·6d ago·source ↗

Black Forest Labs releases FLUX 3 multimodal flow model, reportedly outperforming Seedance 2.0, Gemini Omni, and Grok Imagine

Black Forest Labs has released FLUX 3, a multimodal flow model that reportedly beats competing image/video generation systems including Seedance 2.0, Gemini Omni, and Grok Imagine on key benchmarks. The release also includes a FLUX-mimic video-action robotics model, extending the FLUX family into embodied AI applications. This represents a significant capability advance for BFL in the competitive generative media space.

Frontier Model Releases Multimodal Progress Seedance 2.0 Black Forest Labs Grok Imagine +5 more

5arXiv · cs.CL·Jun 8, 2026·source ↗

MMAE: First comprehensive benchmark for instruction-based audio editing across 7 modalities

Researchers introduce MMAE, a 2,000-sample benchmark for evaluating general-purpose instruction-based audio editing systems, covering 7 audio modalities (sound, speech, music, and mixtures) and 6 levels of task complexity. The benchmark uses a rubric-based evaluation framework decomposing tasks into 17,741 verifiable criteria to assess instruction following and context consistency. Evaluation of leading models reveals severe limitations: Exact Match Rate falls below 5% overall and hits 0% on complex mixed-modality tasks, exposing fundamental gaps in current audio editing systems.

Evaluation and Benchmarking Multimodal Progress MMAE Gemini Omni Nano Banana 2

7Hacker News·May 19, 2026·source ↗

Gemini Omni Model Announced by Google DeepMind

Google DeepMind has published a page for 'Gemini Omni,' a new model in the Gemini family. The announcement appears on DeepMind's official models page, suggesting a new multimodal or omni-capable variant. Limited detail is available from the source, but the HN community engagement (190 points, 87 comments) indicates notable interest.

Frontier Model Releases Multimodal Progress Gemini Omni Google DeepMind Gemini

8Google Deepmind Blog·May 19, 2026·source ↗

Introducing Gemini Omni

DeepMind has announced Gemini Omni, a new model or capability in the Gemini family, published on their official blog in May 2026. The article body was not available for ingestion, so specific capability details, benchmarks, or deployment information cannot be extracted. Based on the naming convention, this likely represents a multimodal or unified-modality extension of the Gemini model line. Further details should be retrieved from the source URL.

Frontier Model Releases Multimodal Progress Gemini Omni Google DeepMind Gemini