Almanac
person

jundot

personactiveprovisionaljundot-49f6ad13·1 events·first seen 12d ago

Aliases: jundot

Co-occurring entities

More like this (12)

Recent events (1)

5Github Trending·12d ago·source ↗

omlx: LLM inference server with continuous batching and SSD caching for Apple Silicon

omlx is an open-source Python project providing an LLM inference server optimized for Apple Silicon, featuring continuous batching and SSD caching managed via a macOS menu bar interface. The project has accumulated nearly 16,000 GitHub stars with strong daily momentum. It targets local inference on Apple hardware, a growing niche as consumer-grade silicon becomes increasingly capable for running open-weights models.