Entity · benchmark

DeepConsult

benchmarkactivedeepconsult-5c19723a·1 events·first seen May 26, 2026

Aliases: DeepConsult

Co-occurring entities

DeepSeek V4 cognitive-graph DeepResearch Bench Qwen3.6-27B VeriTrace

More like this (12)

DeepEval Deep Interaction DeepSeek Reasonix DeepWideSearch DeepSeek-V4-Pro Preview DeepSeek API Deep Research DeepAgents DeepProbLog DeepSeek-Math-V2 DeepSeek-V3.1-Base DeepSeek-V4-Flash Preview

Recent events (1)

6arXiv · cs.AI·May 26, 2026·source ↗

VeriTrace: Cognitive-Graph Framework with Explicit Regulatory Loops for Deep Research Agents

VeriTrace introduces a cognitive-graph framework for deep research agents that replaces implicit LLM reasoning over intermediate representations with three explicit regulatory loops: interpretive update, deviation feedback, and schema revision. The system addresses contamination and error propagation in evolving mental models during complex multi-step research tasks. Using Qwen3.5-27B backbones, VeriTrace improves over the strongest matched baseline by 4.22 pp on DeepResearch Bench Insight and 5.9 pp Overall win rate on DeepConsult. With Config-DeepSeek, it achieves the strongest reproducible open-source result on DeepResearch Bench.

Frontier Model Releases Evaluation and Benchmarking DeepSeek V4 cognitive-graph DeepResearch Bench +4 more