Entity · benchmark

BFCLv3

benchmarkactivebfclv3-826296f9·1 events·first seen May 19, 2026

Aliases: BFCLv3

Co-occurring entities

VitaBench MCP-Atlas Reinforcement Learning Qwen3 EnvFactory τ²-Bench

More like this (12)

BFCL-V3 BFCL BFCL Multi-Turn InfLLMv2 3LM AlphaFold3 RLVR RL² NVFP4 FMLM+LC-BC LFM2-8B-A1B

Recent events (1)

7arXiv · cs.CL·May 19, 2026·source ↗

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

EnvFactory is a fully automated framework for training tool-use LLM agents via Agentic Reinforcement Learning, addressing two key bottlenecks: scalable execution environments and realistic multi-turn training data. It autonomously constructs stateful, executable tool environments from authentic resources and synthesizes natural trajectories with implicit human intents via topology-aware sampling. Using only 85 verified environments across 7 domains, it generates 2,575 SFT and RL trajectories and improves Qwen3-series models by up to +15% on BFCLv3, +8.6% on MCP-Atlas, and +6% on conversational benchmarks, outperforming prior approaches that use 5x more environments.

Training Infrastructure Evaluation and Benchmarking VitaBench MCP-Atlas BFCLv3 +6 more