Entity · model

LLUMI

modelactivellumi-644f6bda·1 events·first seen May 29, 2026

Aliases: LLUMI

Co-occurring entities

Reddit Direct Preference Optimization (DPO)GPT supervised fine-tuning

More like this (12)

PLLuM Luma AI AILuminate LiteLLM Lumos-Nexus TED-LIUM 3 LUCID PortLLM LACUNA LACUNA LightMem Mesh LLM

Recent events (1)

5arXiv · cs.CL·May 29, 2026·source ↗

LLUMI: Fine-Tuning Open-Source LLMs for Mental Health Writing Assistance Using Reddit Community Feedback

LLUMI is a two-component system (a generation model and an improvement model) designed to provide mental health writing assistance using smaller open-source LLMs hosted in privacy-preserving, on-premise environments. The system leverages Reddit community endorsement signals (upvotes/downvotes) to construct preference pairs for SFT and DPO training, then further aligns outputs via human evaluation across readability, empathy, connection, actionability, and safety dimensions. Results show LLUMI achieves performance comparable to proprietary GPT-based models on linguistic and human evaluations, suggesting community-derived preference signals can substitute for expensive expert labeling in sensitive domains.

Open Weights Progress AI Safety Research Reddit LLUMI Direct Preference Optimization (DPO)+3 more