4GitHub Trending (AI/LLM filtered)·2d ago

Hyper-Extract: LLM-powered extraction of graphs, hypergraphs, and spatio-temporal structures from text

Hyper-Extract is a Python library that uses LLMs to transform unstructured text into structured knowledge representations including graphs, hypergraphs, and spatio-temporal extractions via a single command interface. The project is trending on GitHub with 1,723 stars and 124 new stars today. It targets a practical gap in the LLM tooling ecosystem for structured knowledge extraction beyond simple key-value or flat-schema outputs.

Agent and Tool Ecosystem Hyper-Extract yifanfeng97

Related guides (1)

Agent and Tool EcosystemTopic guide

Agent and Tool Ecosystem: How AI Is Learning to Act, Not Just Answer

Read asBeginner In-depth

Related events (8)

3Github Trending·1mo ago·source ↗

vLLM: High-Throughput LLM Inference and Serving Engine Trending on GitHub

vLLM is an open-source Python library providing high-throughput and memory-efficient inference and serving for large language models. The project has accumulated over 80,500 GitHub stars with 98 new stars today, indicating continued strong community interest. It is a widely adopted inference backend in the AI/ML ecosystem, supporting PagedAttention and various optimization techniques for LLM deployment.

Inference Economics Agent and Tool Ecosystem vllm-project vLLM

3Github Trending·35h ago·source ↗

OpenKB: Open-source LLM knowledge base library gains traction on GitHub

VectifyAI has released OpenKB, an open-source Python library for building LLM-powered knowledge bases. The repository is trending on GitHub with 2,389 total stars and 208 new stars in a single day, suggesting meaningful community interest. No detailed technical description is available from the source snippet.

Agent and Tool Ecosystem OpenKB VectifyAI

6Berkeley Ai Research (Bair) Blog·1mo ago·source ↗

SPEX and ProxySPEX: Scalable Interaction Discovery for LLM Interpretability

Researchers from BAIR introduce SPEX (Spectral Explainer) and ProxySPEX, algorithms for identifying influential feature, data, and model-component interactions in LLMs at scale. The approach exploits sparsity, low-degreeness, and hierarchy properties to reframe interaction discovery as a sparse recovery problem using tools from signal processing and coding theory. ProxySPEX achieves comparable performance to SPEX with roughly 10x fewer ablations by leveraging hierarchical structure. The methods are evaluated on feature attribution (sentiment analysis), data attribution, and mechanistic interpretability tasks, outperforming marginal methods like LIME at long context lengths.

Long Context Evolution Evaluation and Benchmarking GPT-4o mini Faith-Shap LIME +5 more

4Hugging Face Blog·1mo ago·source ↗

Open-Source Text Generation & LLM Ecosystem at Hugging Face

Hugging Face published a blog post surveying the open-source LLM ecosystem as of mid-2023, covering text generation models, tooling, and deployment patterns available on the platform. The post highlights the breadth of open-weight models and associated infrastructure for inference and fine-tuning. It serves as a reference overview of the state of open-source LLMs at that point in time.

Open Weights Progress Inference Economics Hugging Face +1 more

5arXiv · cs.CL·1mo ago·source ↗

Text Analytics Evaluation Framework: Benchmarking LLMs on Social Media NLP Tasks

Researchers introduce a 470-question evaluation framework to assess LLM performance on aggregated social media text, applied to Twitter datasets across sentiment analysis, hate speech detection, and emotion recognition. Results show performance degrades substantially as input scale exceeds 500 instances, particularly for open-weights models on numerical tasks. Multi-label and target-dependent scenarios also show notable performance drops, and task complexity progressively erodes accuracy from basic semantic identification to comparison and counting operations. The findings point to architectural bottlenecks in current LLMs for rigorous quantitative analysis over large text collections.

Long Context Evolution Evaluation and Benchmarking Emotion Recognition Text Analytics Evaluation Framework X (Twitter)+3 more

4Github Trending·11h ago·source ↗

LLM Wiki: desktop app that builds persistent knowledge bases from documents using LLMs

LLM Wiki is an open-source cross-platform desktop application that uses LLMs to incrementally build and maintain a persistent, interlinked wiki from user documents rather than performing retrieval-augmented generation on each query. The project has accumulated 12,217 GitHub stars with 111 added today, suggesting notable community traction. It represents an alternative architectural pattern to standard RAG pipelines.

Enterprise Deployment Patterns Agent and Tool Ecosystem LLM Wiki nashsu

5arXiv · cs.CL·12d ago·source ↗

EmbedFilter: Using the unembedding matrix to suppress high-frequency token noise in LLM text embeddings

Researchers identify that LLM text embeddings over-express high-frequency but semantically uninformative tokens when projected onto vocabulary space, degrading embedding quality. They introduce EmbedFilter, a simple linear transformation that filters out the subspace of the unembedding matrix responsible for writing these tokens into embedding space. The method improves zero-shot performance on text embedding benchmarks across multiple LLM backbones and yields a byproduct of dimensionality reduction without quality loss. Code is publicly released.

Evaluation and Benchmarking Inference Economics Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings EmbedFilter

4Github Trending·24d ago·source ↗

Langfuse: Open Source LLM Engineering Platform Trending on GitHub

Langfuse is an open-source LLM engineering platform providing observability, metrics, evaluations, prompt management, and dataset tooling. It integrates with OpenTelemetry, LangChain, OpenAI SDK, and LiteLLM. The project has accumulated 28,075 GitHub stars with 89 new stars today, indicating sustained community traction. Backed by Y Combinator (W23), it represents a notable entry in the LLM ops/tooling ecosystem.

Evaluation and Benchmarking Agent and Tool Ecosystem OpenTelemetry Langfuse Y Combinator +3 more