Finance agents stall near ~50% on hard tasks. How much is the data layer?
Frontier finance agents stall near ~50% on hard, multi-step tasks. We held the model fixed and toggled a SEC-data MCP — the efficiency gap is the consistent signal, and you can watch a real run and click every number back to the 10-K.