product
MinerU
productactiveprovisional
mineru-ef62c19a·2 events·first seen 18d agoAliases: MinerU
Co-occurring entities
More like this (12)
Recent events (2)
MinerU: Document-to-LLM-Ready Markdown/JSON Conversion Tool
MinerU is an open-source Python tool by OpenDataLab that converts complex documents (PDFs, Office files) into structured markdown or JSON formats optimized for LLM and agentic workflows. The repository has accumulated 65,610 GitHub stars with 180 new stars today, indicating sustained community traction. It targets a common preprocessing bottleneck in RAG and agent pipelines.
Yuxi: Multi-tenant agent harness integrating LightRAG, knowledge graphs, and MCP
Yuxi is an open-source multi-tenant agent harness platform that combines a LightRAG knowledge base with knowledge graph management. Built on LangChain, Vue, and FastAPI, it supports DeepAgents, MinerU PDF parsing, Neo4j, and the Model Context Protocol (MCP). The project has accumulated 5,451 GitHub stars with modest daily traction (+47).