Gemini Omni
gemini-omni-c888f25b·3 events·first seen 28d agoAliases: Gemini Omni, Gemini-Omni
Co-occurring entities
More like this (12)
Recent events (3)
Introducing Gemini Omni
DeepMind has announced Gemini Omni, a new model or capability in the Gemini family, published on their official blog in May 2026. The article body was not available for ingestion, so specific capability details, benchmarks, or deployment information cannot be extracted. Based on the naming convention, this likely represents a multimodal or unified-modality extension of the Gemini model line. Further details should be retrieved from the source URL.
Gemini Omni Model Announced by Google DeepMind
Google DeepMind has published a page for 'Gemini Omni,' a new model in the Gemini family. The announcement appears on DeepMind's official models page, suggesting a new multimodal or omni-capable variant. Limited detail is available from the source, but the HN community engagement (190 points, 87 comments) indicates notable interest.
MMAE: First comprehensive benchmark for instruction-based audio editing across 7 modalities
Researchers introduce MMAE, a 2,000-sample benchmark for evaluating general-purpose instruction-based audio editing systems, covering 7 audio modalities (sound, speech, music, and mixtures) and 6 levels of task complexity. The benchmark uses a rubric-based evaluation framework decomposing tasks into 17,741 verifiable criteria to assess instruction following and context consistency. Evaluation of leading models reveals severe limitations: Exact Match Rate falls below 5% overall and hits 0% on complex mixed-modality tasks, exposing fundamental gaps in current audio editing systems.