Entity · technique

NC-FFN

techniqueactiveprovisionalnc-ffn-735a84e3·1 events·first seen 2d ago

Aliases: NC-FFN

Co-occurring entities

LAMBADA OpenWebText Explicit Fuzzy Logic in the Feed-Forward Layer: Self-Forgetting Quantifiers Discover Legible Grammatical-Licensing Detectors

More like this (12)

NVFP4 GFNet NF4 NF-CoT NNCF (Neural Network Compression Framework)NDCG MedNLI NeRF TabPFN NF4 (NormalFloat4)FM-CGM FFHQ

Recent events (1)

6arXiv · cs.CL·2d ago·source ↗

Negation-capable fuzzy logic FFN replacement yields interpretable grammatical licensing detectors in transformers

Researchers propose replacing the standard transformer feed-forward sublayer with explicit fuzzy set operations (intersection and set-difference), creating a negation-capable FFN (NC-FFN) whose hidden units carry interpretable logical form. At 125M scale on OpenWebText, NC-FFN matches GELU baseline perplexity while remaining legible by construction. Adding soft sequence quantifiers with learned forgetting rates recovers grammatical licensing deficits and produces units that detectably fire on grammatical licensors (comparatives, passive participles, negative-polarity items) without dictionary learning. The work advances mechanistic interpretability by providing a parameter-neutral architecture whose computations are readable as grammatical mechanisms.

Evaluation and Benchmarking LAMBADA OpenWebText Explicit Fuzzy Logic in the Feed-Forward Layer: Self-Forgetting Quantifiers Discover Legible Grammatical-Licensing Detectors +1 more