Plain-language guides to the models, labs, and ideas shaping AI — synthesized from the corpus.
Reinforcement LearningConcept
Mixture of ExpertsConcept
Constitutional AIConcept
Chain-of-Thought ReasoningConcept
Diffusion ModelsConcept
Direct Preference Optimization (DPO)Concept
knowledge distillationConcept
LLM-as-a-JudgeConcept
GRPOConcept
mechanistic interpretabilityConcept
MambaConcept
LoRAConcept
PPOConcept
prompt injectionConcept
Model Context ProtocolConcept
Reinforcement Learning from Human FeedbackConcept
Reinforcement Learning with Verifiable RewardsConcept
Retrieval-Augmented GenerationConcept
scalable oversightConcept
supervised fine-tuningConcept
speculative decodingConcept
Vision-Language ModelsConcept