Entity · benchmark

TextQuests

benchmarkactivetextquests-f4d57013·1 events·first seen May 19, 2026

Aliases: TextQuests

Co-occurring entities

More like this (12)

text games TextReg Qwen-TTS Xtext TextCraft TextGrad ConTextual TEQST SimpleQA fastText text-to-3D Text Generation Inference

Recent events (1)

4Hugging Face Blog·May 19, 2026·source ↗

TextQuests: How Good are LLMs at Text-Based Video Games?

A Hugging Face blog post introduces TextQuests, an evaluation framework that tests LLMs on text-based video games as a proxy for interactive reasoning, planning, and language understanding. The benchmark assesses how well models can navigate, solve puzzles, and maintain state across multi-turn interactions in classic interactive fiction environments. This type of evaluation targets agentic capabilities including long-horizon planning and grounded language understanding.

Evaluation and Benchmarking Agent and Tool Ecosystem TextQuests Hugging Face