Almanac
technique

LLM-based content classification

techniqueactiveprovisionalllm-based-content-classification-fdc62980·1 events·first seen 15d ago

Aliases: LLM-based content classification

Co-occurring entities

More like this (12)

Recent events (1)

5Mistral Ai News·15d ago·source ↗

Mistral AI Releases Content Moderation API

Mistral AI has launched a dedicated content moderation API that classifies text inputs into 9 policy categories, including model-generated harms such as unqualified advice and PII. The API offers two endpoints—one for raw text and one for conversational content—and is natively multilingual across 11 languages. It is the same moderation system powering Mistral's Le Chat product, now made available to external developers. The classifier is LLM-based and designed to be customizable to application-specific safety standards.