Alvin Lang
Feb 05, 2026 18:35
Claude Opus 4.6 scores 23% greater on monetary evaluation benchmarks, provides Excel and PowerPoint integrations for funding banking workflows.
Anthropic dropped Claude Opus 4.6 on February 5, positioning its newest AI mannequin as a direct play for monetary providers workflows. The headline quantity: a 23 proportion level enchancment over Claude Sonnet 4.5 on the corporate’s inner Actual-World Finance benchmark, which exams roughly 50 funding and monetary evaluation use circumstances.
The mannequin now scores 60.7% on Vals AI’s Finance Agent benchmark—a 5.47% leap from Opus 4.5—which evaluates efficiency on SEC submitting analysis. It additionally hits 76% on TaxEval, one other exterior benchmark testing tax-related reasoning.
The place Analysts Really Work
The true story right here is not simply benchmark scores. Anthropic is pushing Claude instantly into the instruments finance professionals use every day: Excel and PowerPoint.
Claude in Excel now handles pivot tables, chart modifications, conditional formatting, and what Anthropic calls “finance-grade formatting.” The combination helps multi-file drag-and-drop and auto-compaction for lengthy conversations—addressing the copy-paste hell that plagues anybody constructing complicated monetary fashions throughout a number of tabs.
Claude in PowerPoint launches in beta for Max, Staff, and Enterprise customers. The AI reads current layouts, fonts, and grasp slides earlier than producing new content material, theoretically letting analysts construct client-ready decks with out ranging from scratch.
The Productiveness Declare
Anthropic’s advertising supplies present side-by-side comparisons of economic due diligence outputs—the sort of acquisition evaluation work they are saying “would sometimes take a senior analyst two to a few weeks to finish.” First-pass high quality has improved noticeably, in line with companions already testing the system.
“Creating monetary PowerPoints that used to take hours now takes minutes,” stated Aabhas Sharma, CTO at Hebbia. Nico Christie, co-founder of Shortcut AI, referred to as it “a watershed second for spreadsheet brokers.”
Lloyd Hilton, Head of Hg Catalyst, famous the mannequin handles “unstructured knowledge and intelligently working with minimal prompting to meaningfully automate complicated evaluation.”
What’s New Below the Hood
Opus 4.6 ships with a 1-million-token context window, letting it course of huge datasets in single periods. The mannequin additionally improved on BrowseComp and DeepSearchQA benchmarks, which check data extraction from giant, unstructured doc units—important for anybody doing earnings name evaluation or regulatory submitting critiques.
Cowork, Anthropic’s desktop app characteristic, now lets finance groups kick off a number of analyses concurrently whereas steering Claude’s method on every deliverable. A company finance plugin offers pre-built workflows for journal entries, variance analyses, and reconciliation.
The Superb Print
Anthropic is not claiming full autonomy. “Customers ought to proceed to assessment Claude’s outputs to make sure it meets their specs; notably for high-stakes work, human judgment stays important,” the corporate famous in its launch.
For crypto and fintech companies evaluating AI integration, Opus 4.6 represents the clearest sign but that basis mannequin corporations are shifting past chatbot interfaces towards embedded enterprise instruments. The query now: how rapidly will competing fashions from OpenAI and Google match these finance-specific capabilities?
Claude Opus 4.6 is obtainable now on all paid Claude plans. The PowerPoint integration stays in analysis preview for higher-tier subscribers.
Picture supply: Shutterstock






