technique
zero-shot evaluation
techniqueactive
zero-shot-evaluation-9f7ae28e·1 events·first seen 28d agoAliases: zero-shot evaluation
Co-occurring entities
More like this (12)
zero-shot learningfew-shot learningzero-shot systematizerOne-Shot Imitation LearningUnsupervised Pre-trainingZEDA (Zero-Expert Self-Distillation Adaptation)third-party AI evaluationsunsupervised learningLanguage Models are Few-Shot LearnersEvolveNav: Proactive Preflection and Self-Evolving Memory for Zero-Shot Object Goal NavigationZeroGPUAI-assisted human evaluation
Recent events (1)
Very Large Language Models and How to Evaluate Them
This Hugging Face blog post from October 2022 discusses approaches to zero-shot evaluation of large language models hosted on the Hub. It covers methodologies for benchmarking LLMs without task-specific fine-tuning, addressing the practical challenges of evaluating very large models at scale. The post situates evaluation tooling within the broader ecosystem of open model hosting and assessment.