technique
LLM-based content classification
techniqueactiveprovisional
llm-based-content-classification-fdc62980·1 events·first seen 15d agoAliases: LLM-based content classification
Co-occurring entities
More like this (12)
Recent events (1)
Mistral AI Releases Content Moderation API
Mistral AI has launched a dedicated content moderation API that classifies text inputs into 9 policy categories, including model-generated harms such as unqualified advice and PII. The API offers two endpoints—one for raw text and one for conversational content—and is natively multilingual across 11 languages. It is the same moderation system powering Mistral's Le Chat product, now made available to external developers. The classifier is LLM-based and designed to be customizable to application-specific safety standards.