Almanac
model

NVIDIA Nemotron-3-Nano-30B-A3B

modelactiveprovisionalnvidia-nemotron-3-nano-30b-a3b-0c6e8958·1 events·first seen 47h ago

Aliases: NVIDIA Nemotron-3-Nano-30B-A3B

Co-occurring entities

More like this (12)

Recent events (1)

5arXiv · cs.LG·47h ago·source ↗

Probe-and-Refine Tuning improves coding agent performance via iterative repository guidance refinement

A new arXiv paper introduces probe-and-refine tuning, a procedure that uses synthetic bug-fix probes to iteratively improve AGENTS.md repository guidance files for LLM-based coding agents without requiring an agent loop during tuning. Evaluated on SWE-bench Verified with Qwen3.5-35B-A3B, the method achieves 33.0% mean resolve rate versus 28.3% for a static knowledge base baseline and 25.5% for an unguided baseline. The improvement is attributed to coverage gains—refined guidance helps agents locate the correct files rather than improving patch quality—and a step-budget experiment shows guidance is necessary for agents to productively use larger compute budgets.