Home » Accuracy methodology

Accuracy methodology

How we measure Bank2XL's extraction quality, and the real numbers.

Short version: on well-structured bank statements (clear scans or text-layer PDFs from common retail banks) Bank2XL reconciles cleanly 98%+ of the time. Edge cases like degraded scans, mid-page rotation, or unusual multi-currency layouts come back with a warn or mismatch badge so you can review the data before trusting it. The reconciliation check is designed to surface mis-extractions. We can't guarantee it catches every one, but we always show the deltas instead of hiding them.

Why we publish a methodology page

Most bank statement converters claim one big number ("99% accuracy", "field-leading precision") with no methodology behind it. We'd rather show the work: the corpus we test on, the verdict breakdown, the failure modes we know about, and how the reconciliation badge on every result tells you which bucket your statement falls into.

The test corpus

Our corpus is intentionally diverse and it's also small, 61 statements. That's worth saying out loud. We're growing it, and we'll update this page when we re-run.

DimensionCoverage
Total documents61 bank statements
Unique banks30+ (Chase, Bank of America, Wells Fargo, Capital One, Citi, KeyBank, M&T, RBC, TD, CIBC, HSBC, ANZ, Commonwealth Bank, BNP Paribas, Deutsche Bank, plus regional credit unions and court-record extracts)
CountriesUS, UK, Canada, Australia, France, Germany, South Africa, Brazil, New Zealand
LanguagesEnglish, French, German, Portuguese, Russian (small samples)
Document typesPersonal checking / savings, credit-card statements, court-record statement extracts, business statements, summary-only PDFs
Format typesText-layer PDFs and scanned (image-only) PDFs in roughly 70 / 30 ratio

Verdict breakdown on the corpus

VerdictCountShare
reconciled Transactions extracted, sum ± opening = closing within 0.5%3049%
summary doc Statement contained no transaction table to verify (e.g., cover page or balance summary only)1830%
PDF inconsistency Our extraction matched the PDF's own Activity Summary, but the PDF's reported totals disagreed with the transaction list itself23%
badge: review Our extraction did not match the reported balance — result page shows a warn/mismatch badge so the user knows to verify1118%

The 98%+ claim is for well-structured statements: text-layer PDFs or clean scans from common retail banks with a single account and a standard activity table. Among documents that fit that profile, reconciliation is near-perfect. The 11 documents in the bottom row are the harder cases (degraded multi-column scans, mid-page rotation, footer ambiguity, mixed decimal separators). They don't fail silently. The result page shows a colored badge so you know to review before trusting the data.

Different denominators give different headline numbers. We think all three are worth listing:

What the failures look like

Across the 11 documents that did not reconcile, the common failure modes are:

How you know which case you're in

Every Bank2XL Excel includes a Validation sheet showing per-account reconciliation status. The result page badge is the same status with a single color:

This badge is the most important UI element in the product. The point of building reconciliation in is that you never have to trust output that hasn't been checked.

Performance

What we DON'T claim

How we'll improve this page

This page will be re-published whenever we run a new corpus pass. Planned next:

If you have a statement that didn't reconcile and you're willing to share it (after redacting), send it to support@bank2xl.app. We use shared corpora to drive prompt and pipeline improvements. Reconciliation rate is the metric we optimize for.

Open about the limitations

Bank2XL is a small product built by a small team. We chose to publish honest numbers rather than marketing-friendly ones. The trade-off:

Join the waitlist   See a sample output   How it works