Almanac
organization

LCO-Embedding

organizationactiveprovisionallco-embedding-e5408c36·1 events·first seen 25h ago

Aliases: LCO-Embedding

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.CL·25h ago·source ↗

RL-trained LLMs learn retriever-specific query formulation strategies for RAG

A new arXiv paper presents the first systematic study of using reinforcement learning to teach LLMs to adapt query formulation strategies to different retrieval backends. The authors find that different retrievers have surprisingly distinct optimal query styles (e.g., descriptive vs. question-like), making cross-retriever strategy transfer ineffective. They introduce a branching-based rollout technique to stabilize training over multi-step retrieval trajectories and show gains from retriever-specific human guidance and model scaling.