Home » Accuracy methodology

Accuracy methodology

How we measure Bank2XL's extraction quality, and the real numbers.

Short version: on our internal test corpus of 61 bank statements from 20+ banks across the US, UK, Canada, Europe, Australia, and Brazil, 82% reconciled cleanly, 18% needed human review. The reconciliation status badge on every result page tells you which bucket your statement falls into before you trust the data.

Why publish the real numbers

Bank statement converters tend to claim a single eye-catching accuracy figure ("99% accuracy!", "field-leading precision!") without saying what was measured, what the corpus looked like, or how often the tool admits it doesn't know. We think that's the wrong way to earn trust. We'd rather tell you the honest picture and let the reconciliation badge on every result do the rest.

The test corpus

Our corpus is intentionally diverse. It is also small — 61 statements — and we want to be upfront about that. We are continuing to grow it, and we will update this page when we re-run.

DimensionCoverage
Total documents61 bank statements
Unique banks30+ (Chase, Bank of America, Wells Fargo, Capital One, Citi, KeyBank, M&T, RBC, TD, CIBC, HSBC, ANZ, Commonwealth Bank, BNP Paribas, Deutsche Bank, plus regional credit unions and court-record extracts)
CountriesUS, UK, Canada, Australia, France, Germany, South Africa, Brazil, New Zealand
LanguagesEnglish, French, German, Portuguese, Russian (small samples)
Document typesPersonal checking / savings, credit-card statements, court-record statement extracts, business statements, summary-only PDFs
Format typesText-layer PDFs and scanned (image-only) PDFs in roughly 70 / 30 ratio

Headline results

VerdictCountShare
reconciled Transactions extracted, sum ± opening = closing within 0.5%3049%
summary doc Statement contained no transaction table to verify (e.g., cover page or balance summary only)1830%
PDF inconsistency Our extraction matched the PDF's own Activity Summary, but the PDF's reported totals disagreed with the transaction list itself23%
reconcile failed Our extraction did not match the reported balance — needs review1118%

The headline 82% figure sums "reconciled" + "summary doc" + "PDF inconsistency" — the cases where Bank2XL either produced data that reconciles, or correctly recognized there was nothing to reconcile. The remaining 18% (11 documents) are extraction failures that the user would need to verify manually.

If you tighten "success" to extraction-with-balance-reconciled-vs-source, the number is 30 of 43 documents that had a verifiable transaction table — 70%. We think both numbers are worth knowing.

What the failures look like

Across the 11 documents that did not reconcile, the common failure modes are:

How you know which case you're in

Every Bank2XL Excel includes a Validation sheet showing per-account reconciliation status. The result page badge is the same status with a single color:

This is the single most important UI element in the product. The whole point of building reconciliation in is so that you never trust output you shouldn't trust.

Performance

What we DON'T claim

How we'll improve this page

This page will be re-published whenever we run a new corpus pass. Planned next:

If you have a statement that didn't reconcile and you're willing to share it (after redacting), send it to [email protected]. We use shared corpora to drive prompt and pipeline improvements; reconciliation rate is the metric we optimize for.

Open about the limitations

Bank2XL is a small product built by a small team. We have intentionally chosen to publish honest numbers rather than marketing-friendly ones. The trade-off:

Join the waitlist   See a sample output   How it works