New benchmark tests LLMs on financial statement verification accuracy
FinVerBench evaluates 15 LLMs on numerical consistency checks in SEC 10-K filings, introducing error taxonomy across arithmetic, linkage, and magnitude categories to measure AI readiness for financ...