Qwen3-4B-Base
qwen3-4b-base-1f13b116·2 events·first seen 26d agoAliases: Qwen3-4B-Base, Qwen3.5-4B-Base
Co-occurring entities
More like this (12)
Recent events (2)
Qwen releases Qwen3.5-4B-Base multimodal model on Hugging Face
Qwen has released Qwen3.5-4B-Base, a 4-billion parameter base model supporting image-text-to-text tasks, published on Hugging Face. The model is tagged as conversational and endpoints-compatible, using the safetensors format. With over 207,000 downloads, it represents a new entry in the Qwen3.5 model family with multimodal capabilities at a small parameter count.
RELEX: Extrapolating LLM RLVR Training via Rank-1 Parameter Trajectories
This paper demonstrates that RLVR weight update trajectories are extremely low-rank and near-linearly predictable, with a rank-1 approximation capturing most downstream performance gains. The authors propose RELEX, a compute-efficient method that observes a short training window, estimates the rank-1 subspace, and extrapolates future checkpoints via linear regression—requiring no additional training. Evaluated on Qwen2.5-Math-1.5B, Qwen3-4B-Base, and Qwen3-8B-Base, RELEX matches or exceeds full RLVR performance using as few as 15% of training steps, and can extrapolate up to 10–20× beyond the observed prefix. The authors attribute the method's effectiveness to a denoising effect from rank-1 projection that discards stochastic optimization noise.