Stanford releases open-source SEC filings dataset for LLM training

Stanford researchers release SEFD, a layout-faithful, open-source dataset of 4.4M SEC filings reconstructed into MultiMarkdown format for pretraining financial LLMs at scale.

Continue reading

Get daily agentic AI accounting news in your inbox
Read original article →

Stay ahead of AI in accounting

Get the latest news on agentic AI for accounting, audit, and tax delivered to your inbox. Curated by AI, reviewed by professionals.

Subscribe to Newsletter