Entity · technique

TextGrad

techniqueactivetextgrad-07771c81·2 events·first seen May 21, 2026

Aliases: TextGrad

Co-occurring entities

SkillOpt Trace2Skill Claude Code EvoSkill GEPA Codex GPT-5.5 REVOLVE Semantic Edit Regularization representational inefficiency Dual-Evidence Gradient Purification TextReg

More like this (12)

TextReg TextImage Augmentation fastText TextCraft text-to-3D Textual Gradient Optimization Xtext TextQuests text-to-image models text-to-video generation text-to-speech Text Generation Inference

Recent events (2)

7arXiv · cs.AI·May 25, 2026·source ↗

SkillOpt: Systematic Text-Space Optimizer for Self-Evolving Agent Skills

SkillOpt introduces a principled optimization framework for agent skills, treating the skill document as an external trainable state analogous to model weights. A separate optimizer model converts scored rollouts into bounded edits (add/delete/replace) on a skill document, accepting only edits that improve held-out validation scores. Evaluated across six benchmarks, seven target models, and three execution harnesses (direct chat, Codex, Claude Code), SkillOpt achieves best or tied performance on all 52 evaluated cells, lifting GPT-5.5 no-skill accuracy by up to +24.8 points inside the Codex agentic loop. Optimized skill artifacts also transfer across model scales and execution environments without further optimization.

Evaluation and Benchmarking Agent and Tool Ecosystem TextGrad SkillOpt Trace2Skill +6 more

6arXiv · cs.CL·May 21, 2026·source ↗

TextReg: Regularization Framework for Mitigating Prompt Distributional Overfitting in LLM Optimization

TextReg addresses a failure mode in iterative prompt optimization where LLM-rewritten prompts grow longer, accumulate narrow rules, and generalize poorly—termed prompt distributional overfitting. The authors formalize this via 'representational inefficiency,' a dual-factor measure decomposing prompt inefficiency into capacity cost and scope narrowness. TextReg applies a soft-penalty regularization framework using Dual-Evidence Gradient Purification, Semantic Edit Regularization, and Regularization-Guided Prompt Update. On reasoning benchmarks, it achieves up to +11.8% OOD accuracy over TextGrad and +16.5% over REVOLVE.

Evaluation and Benchmarking Agent and Tool Ecosystem TextGrad REVOLVE Semantic Edit Regularization +4 more