4GitHub Trending (AI/LLM filtered)·1mo ago

whichllm: Hardware-Aware Local LLM Recommender Tool

whichllm is an open-source Python tool that recommends local LLMs based on actual hardware compatibility and recency-weighted benchmark performance rather than parameter count. It operates as a single command to identify which models will run and perform best on a user's specific machine. The project gained 209 stars in a single day, reaching 1,178 total, indicating notable community traction.

Open Weights Progress Inference Economics Andyyyy64 whichllm

Related guides (2)

Open Weights ProgressTopic guide

Open Weights Progress: How Freely Available AI Models Caught Up to the Frontier

Read asBeginner In-depth

Inference EconomicsTopic guide

Inference Economics: The Cost of Running AI in Production

Read asBeginner In-depth

Related events (8)

3Github Trending·1mo ago·source ↗

vLLM: High-Throughput LLM Inference and Serving Engine Trending on GitHub

vLLM is an open-source Python library providing high-throughput and memory-efficient inference and serving for large language models. The project has accumulated over 80,500 GitHub stars with 98 new stars today, indicating continued strong community interest. It is a widely adopted inference backend in the AI/ML ecosystem, supporting PagedAttention and various optimization techniques for LLM deployment.

Inference Economics Agent and Tool Ecosystem vllm-project vLLM

4Github Trending·10h ago·source ↗

LLM Wiki: desktop app that builds persistent knowledge bases from documents using LLMs

LLM Wiki is an open-source cross-platform desktop application that uses LLMs to incrementally build and maintain a persistent, interlinked wiki from user documents rather than performing retrieval-augmented generation on each query. The project has accumulated 12,217 GitHub stars with 111 added today, suggesting notable community traction. It represents an alternative architectural pattern to standard RAG pipelines.

Enterprise Deployment Patterns Agent and Tool Ecosystem LLM Wiki nashsu

3Hacker News·13d ago·source ↗

Lathe: open-source tool for using LLMs as domain-learning aids rather than answer machines

Lathe is an open-source project shared on Hacker News that positions LLMs as active learning companions for acquiring new domain knowledge, rather than tools to bypass the learning process. The project received 205 upvotes and 41 comments, indicating meaningful community interest. It represents a pedagogical framing of LLM use that contrasts with typical productivity-focused applications.

Agent and Tool Ecosystem devenjarvis Lathe

4Simon Willison'S Weblog·1mo ago·source ↗

LLM 0.32a2 Released

Simon Willison has released version 0.32a2 of the LLM command-line tool and Python library. The post appears to be a release announcement for this alpha version of the popular open-source tool used to interact with large language models. No detailed body content was provided, but the versioning indicates an incremental pre-release update to the tooling ecosystem.

Agent and Tool Ecosystem LLM Simon Willison

3Simon Willison'S Weblog·11d ago·source ↗

LLM CLI tool version 0.32a3 released

Simon Willison released version 0.32a3 of the LLM command-line tool, an alpha pre-release. The post is a brief release note with minimal body content. LLM is a widely-used open-source CLI and Python library for interacting with language models from multiple providers.

Agent and Tool Ecosystem LLM Simon Willison

3Github Trending·8d ago·source ↗

mlx-lm: LLM inference library for Apple MLX framework trending on GitHub

mlx-lm is an open-source Python library for running LLMs using Apple's MLX framework, designed for Apple Silicon hardware. The repository has accumulated 5,817 stars with 43 new stars today, indicating steady community interest. It represents a key piece of the Apple-native ML inference ecosystem.

Inference Economics mlx-lm Apple MLX

5Github Trending·16d ago·source ↗

omlx: LLM inference server with continuous batching and SSD caching for Apple Silicon

omlx is an open-source Python project providing an LLM inference server optimized for Apple Silicon, featuring continuous batching and SSD caching managed via a macOS menu bar interface. The project has accumulated nearly 16,000 GitHub stars with strong daily momentum. It targets local inference on Apple hardware, a growing niche as consumer-grade silicon becomes increasingly capable for running open-weights models.

Open Weights Progress Inference Economics omlx jundot Apple Silicon

4arXiv · cs.CL·19d ago·source ↗

Benchmarking Local LLMs for Confidential Translation Workflows

This paper evaluates locally runnable LLMs (via Ollama) for offline, privacy-constrained translation workflows targeting freelance translators and smaller language service providers. The authors expand their Reeve Foundation corpus to include German and Simplified Chinese, then benchmark local models across four language directions against commercial NMTs (DeepL, Baidu), a frontier LLM (GPT-5.2), and professional local NMT systems. Results show substantial performance variation by language direction and model size, with the best local LLMs matching or exceeding local NMT systems and the frontier LLM, though falling short of top commercial NMTs. The study supports the viability of local LLMs for confidentiality-sensitive translation use cases.

Evaluation and Benchmarking Open Weights Progress Ollama GPT-5.2 DeepL +8 more