company

FrontisAI

companyactiveprovisionalfrontisai-7c12a11b·1 events·first seen 4h ago

Aliases: FrontisAI

Co-occurring entities

More like this (12)

Frontier AI Framework OpenAI Frontier ProducerAI FriendliAI crewAIInc OpenAI frontier models Reflection AI AssemblyAI iOfficeAI Arena AI Envision AI OpenAI Voice AI

Recent events (1)

5arXiv · cs.CL·4h ago·source ↗

EnterpriseClawBench: A benchmark for enterprise agents derived from real workplace sessions

Researchers introduce EnterpriseClawBench, an enterprise agent benchmark constructed from proprietary real-world workplace sessions, yielding 852 reproducible tasks with fixtures, prompts, role classes, skill subclasses, and semantic rubrics. Because the sessions contain internal enterprise content, the benchmark data is not publicly released, but the construction and evaluation protocol is the reusable contribution. The best evaluated configuration (Codex with GPT-5.5) achieves only 0.663, indicating substantial headroom. The paper argues enterprise agent evaluation must report harness-model combinations, artifact delivery, visual quality, cost, runtime, and skill-transfer behavior rather than collapsing to a single score.

Evaluation and Benchmarking Enterprise Deployment Patterns FrontisAI EnterpriseClawBench Codex +2 more