technique
FeedForward submodule
techniqueactiveprovisional
feedforward-submodule-e0d67c5f·1 events·first seen 15d agoAliases: FeedForward submodule
Co-occurring entities
More like this (12)
feed-forward transformerFeedforward Neural NetworksUltraFeedbackFeedback-AgentAttention submoduleEmbedFilterFFR: Forward-Forward Learning for RegressionReinforcement Learning from Rich Feedback with Distributional DAggerReinforcement Learning from Human FeedbackError Feedback (EF)Extended Kalman FilterChain-of-Thought Fine-Tuning
Recent events (1)
SubFit: Submodule-Level Fitted Residual Replacement for LLM Compression
SubFit introduces a post-training LLM compression method that operates at the submodule level (Attention and FeedForward separately) rather than full layers, and selects components non-contiguously. The approach replaces removed submodules with lightweight fitted residual bypasses calibrated on small data. Evaluated across ten LLMs at sparsity levels from 12.5% to 37.5%, SubFit retains 84.6% of dense downstream accuracy at 25% sparsity versus 81.6% for the strongest baseline, while reducing perplexity degradation from 4.34x to 2.42x and delivering measurable inference speedup and KV-cache savings.