Almanac
dataset

CCPoetry-49K

datasetactiveprovisionalccpoetry-49k-6532e463·1 events·first seen 6d ago

Aliases: CCPoetry-49K

Co-occurring entities

More like this (12)

Recent events (1)

3arXiv · cs.CL·6d ago·source ↗

PoetryQwen: LoRA-fine-tuned Qwen2.5-14B for classical Chinese poetry understanding with new 49K dataset

Researchers introduce CCPoetry-49K, a 49,404-pair instruction dataset for classical Chinese poetry appreciation, decomposed into term interpretation, semantic interpretation, and emotional inference subtasks. They fine-tune Qwen2.5-14B using LoRA to produce PoetryQwen, achieving a 9.7% improvement over the baseline on the CCL25-Eval Task 5 benchmark (0.757 vs 0.690). The work addresses a gap in domain-specific LLM adaptation for classical Chinese literary tasks.