CPA · US
US-wide Certified Public Accountant data across 55 jurisdictions — state-board licensee rosters, NASBA CPAverify, PCAOB-registered firms, IRS PTIN holders, discipline actions. Target ~600–800K records by Phase 1.
Active CPAs sampled from Florida Board of Accountancy (weekly bulk xlsx, ~71K total) + IRS PTIN FOIA (~208K CPA-bearing nationwide). P2: NASBA ALD per-name enrichment, TX TSBPA, CA CBA, NY NYSED.
697 PCAOB-registered audit firms (with engagement-partner + issuer counts) merged with 1,628 AICPA GAQC governmental audit member firms + 8 State Audit Organizations.
IRS PTIN holders (858K total, biannual FOIA CSV) — superset that also includes EAs / Attorneys / uncredentialed preparers. CPAs are already in /licensees.
PCAOB enforcement actions + inspection reports (4.3K, CSV/XML/JSON), SEC accountant suspensions, state-board disciplinary listings (NC, MN, TX, CA, …).
Phase 1 pickup order
- NASBA CPAverify / ALD — ~600–700K records across 53/55 jurisdictions. Bulk API requires email to
aldsupport@nasba.org. - Florida CBA bulk xlsx — ~40–50K, direct download, ready-to-ingest. Best POC for the state-roster pipeline.
- IRS PTIN bulk CSV (FOIA) — 2.1M+ preparers; cross-reference with NASBA to flag CPAs vs. EAs vs. uncredentialed.
- PCAOB AuditorSearch — 10K+ registered firms, daily-updated zip.
- State boards (P2) — TX TSBPA, CA CBA, NY NYSED — bulk via records request.
Full inventory: docs/CPA_DATA_SOURCES.md