Entity · technique

test-time training

techniqueactivetest-time-training-f5ac4944·3 events·first seen May 27, 2026

Aliases: test-time training

Co-occurring entities

Online Neural Space Time Memory for Dynamic Novel View Synthesis Vision-Language-Action models NVIDIA RoboTTT RoboTTT LawBench SIA (Self Improving AI)harness update Feedback-Agent

More like this (12)

Self-Guided Test-Time Training Test-Time Finetuning (TTFT)test-time compute test-time compute scaling self-training temporally ordered pre-training Test-time Compute Search consistency training Verifier-in-the-Loop Training (ViL)distributed training Selective Ground Truth Token Training Deployment Simulation

Recent events (3)

4arXiv · cs.LG·Jul 17, 2026·source ↗

Online Neural Space-Time Memory enables real-time dynamic novel view synthesis from streaming video

A new arXiv preprint proposes a neural memory architecture for real-time novel view synthesis from multi-view streaming video of dynamic scenes. The method decouples memory update frequency from memory application frequency, using periodic gradient-based updates with per-frame cross-view attention to handle deformations. Two mechanisms — an auxiliary Memory Loss and a Memory Caching strategy — prevent catastrophic forgetting over long contexts. The approach achieves state-of-the-art performance on dynamic human motion scenes with minute-scale online memorization at real-time speeds.

Multimodal Progress test-time training Online Neural Space Time Memory for Dynamic Novel View Synthesis

7arXiv · cs.LG·Jul 17, 2026·source ↗

RoboTTT scales robot policy context to 8K timesteps via Test-Time Training, enabling one-shot imitation and long-horizon tasks

NVIDIA researchers introduce RoboTTT, a robot foundation model training recipe that extends visuomotor context to 8,000 timesteps — three orders of magnitude beyond current state-of-the-art — without increasing inference latency. The approach integrates Test-Time Training into Vision-Language-Action policies, using fast weights (parameters updated by gradient descent during inference) to compress long histories into weight space. On real-robot manipulation tasks, RoboTTT achieves 87% performance improvement over single-step baselines and is the first system to fully complete a five-minute, ten-stage assembly task. The work identifies context length as a new scaling axis for robot foundation models, with 8K-context pretraining outperforming 1K-context by 62%.

Long Context Evolution Frontier Model Releases Vision-Language-Action models NVIDIA test-time training +3 more

7arXiv · cs.CL·May 27, 2026·source ↗

SIA: Self-Improving AI via Joint Harness and Weight Updates

SIA proposes a self-improving loop in which a Feedback-Agent simultaneously updates both the scaffold (harness) and model weights of a task-specific agent, unifying two previously disjoint research lines: meta-agent scaffold rewriting and test-time training. The system is evaluated on three diverse benchmarks—Chinese legal charge classification, GPU kernel optimization, and single-cell RNA denoising—achieving gains of 56.6%, 91.9% runtime reduction, and 502% respectively over baselines. The paper argues that harness updates shape agentic behavior while weight updates instill domain intuition that prompting alone cannot provide, and that combining both levers consistently outperforms either alone.

Frontier Model Releases Evaluation and Benchmarking LawBench SIA (Self Improving AI)harness update +4 more