organization

THUDM

organizationactiveprovisionalthudm-2a1ac2ff·1 events·first seen 11h ago

Aliases: THUDM

Co-occurring entities

slime

More like this (12)

HKUDS TREAD UMAP MMMU HM3D u-muP DAAM IHUBERT HMMT 2026 LLUMI MMLU MMDiT

Recent events (1)

5Github Trending·11h ago·source ↗

THUDM releases slime: RL scaling post-training framework for LLMs

THUDM (Tsinghua University's Knowledge Engineering Group) has released slime, an open-source Python framework for LLM post-training via reinforcement learning scaling. The repository has accumulated 6,548 stars with 195 added in a single day, indicating significant community interest. RL-based post-training frameworks are a key area of active development following the success of techniques like GRPO and PPO in improving reasoning capabilities.

Agent and Tool Ecosystem Alignment and RLHF THUDM slime