| Metric | Value |
|---|---|
| Total Records | 664,441 |
| Output Columns | 110 |
| Overall Completeness | 54.8% |
| Row Preservation | Passed |
| Unique EINs | 664,441 |
| Duplicate EINs | 0 |
BMF Pipeline Quality Report
Post-Transformation Validation Results
Executive Summary
Pipeline Timestamp: 2026-05-06 15:38:36
Field Completeness by Category
This section shows the completeness of each output field, grouped by category. Each field shows its source column(s) from the raw BMF file.
Identity
Average Completeness: 100.0% | Columns: 2
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| ein | EIN | 100.0% | Good | 0 |
| ein_raw | EIN | 100.0% | Good | 0 |
Column Descriptions:
- ein: Employer Identification Number formatted as XX-XXXXXXX
- ein_raw: Original 9-digit EIN value from source file without formatting
Organization Name
Average Completeness: 68.7% | Columns: 5
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| org_name_raw | NAME | 100.0% | Good | 0 |
| org_name_join | NAME | 100.0% | Good | 0 |
| org_name_display | NAME | 100.0% | Good | 0 |
| org_legal_suffix | NAME | 38.3% | Low | 410,195 |
| org_parent_name | NAME | 5.0% | Low | 631,247 |
Column Descriptions:
- org_name_raw: Original organization name exactly as it appears in the source file
- org_name_join: Standardized name for matching and joining (uppercase, punctuation removed)
- org_name_display: Title-cased organization name suitable for display purposes
- org_legal_suffix: Legal entity suffix extracted from name (Inc, Corp, LLC, Foundation, etc.)
- org_parent_name: Parent organization name if this is a subordinate/chapter organization
DBA Name
Average Completeness: 0.0% | Columns: 2
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| dba_name | SORT_NAME | 0.0% | Low | 664,441 |
| dba_name_raw | SORT_NAME | 0.0% | Low | 664,441 |
Column Descriptions:
- dba_name: Cleaned ‘Doing Business As’ name
- dba_name_raw: Original secondary/DBA name from source file
In Care Of
Average Completeness: 33.3% | Columns: 3
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| in_care_of_name_raw | ICO | 0.0% | Low | 664,441 |
| in_care_of_name_clean | ICO | 0.0% | Low | 664,441 |
| in_care_of_name_provided | ICO | 100.0% | Good | 0 |
Column Descriptions:
- in_care_of_name_raw: Original ‘In Care Of’ field from source file
- in_care_of_name_clean: Cleaned ICO name with standardized formatting
- in_care_of_name_provided: Boolean indicating whether an ICO name was provided
Group Exemption
Average Completeness: 33.3% | Columns: 3
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| group_exemption_number_raw | GROUP | 0.0% | Low | 664,441 |
| group_exemption_number | GROUP | 0.0% | Low | 664,441 |
| group_exemption_is_member | GROUP | 100.0% | Good | 0 |
Column Descriptions:
- group_exemption_number_raw: Original group exemption number from source file
- group_exemption_number: Cleaned group exemption number (GEN)
- group_exemption_is_member: Boolean indicating if organization is a member of a group exemption
Address (Raw)
Average Completeness: 75.0% | Columns: 4
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| org_addr_street_raw | STREET | 0.0% | Low | 664,441 |
| org_addr_city_raw | CITY | 100.0% | Good | 1 |
| org_addr_state_raw | STATE | 100.0% | Good | 0 |
| org_addr_zip_raw | ZIP | 100.0% | Good | 0 |
Column Descriptions:
- org_addr_street_raw: Original street address from source file
- org_addr_city_raw: Original city name from source file
- org_addr_state_raw: Original state code from source file
- org_addr_zip_raw: Original ZIP code from source file
Address (Cleaned)
Average Completeness: 68.7% | Columns: 7
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| org_addr_street | STREET | 0.0% | Low | 664,441 |
| org_addr_city | CITY | 100.0% | Good | 1 |
| org_addr_state | STATE | 99.8% | Good | 1,365 |
| org_addr_zip5 | ZIP | 90.4% | Fair | 63,873 |
| org_addr_zip4 | ZIP | 0.0% | Low | 664,441 |
| org_addr_zip | ZIP | 90.4% | Fair | 63,873 |
| org_addr_full | STREET, CITY, STATE, ZIP | 100.0% | Good | 0 |
Column Descriptions:
- org_addr_street: Standardized street address with USPS abbreviations
- org_addr_city: Cleaned city name
- org_addr_state: Two-letter state abbreviation
- org_addr_zip5: 5-digit ZIP code
- org_addr_zip4: 4-digit ZIP code extension (if available)
- org_addr_zip: Full ZIP code (5 or 9 digits)
- org_addr_full: Complete formatted address string
Address Quality Flags
Average Completeness: 100.0% | Columns: 6
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| org_addr_is_missing | STREET, CITY, STATE, ZIP | 100.0% | Good | 0 |
| org_addr_is_po_box | STREET | 100.0% | Good | 0 |
| org_addr_is_rural_route | STREET | 100.0% | Good | 0 |
| org_addr_has_special_chars | STREET | 100.0% | Good | 0 |
| org_addr_missing_number | STREET | 100.0% | Good | 0 |
| org_addr_state_invalid | STATE | 100.0% | Good | 0 |
Column Descriptions:
- org_addr_is_missing: TRUE if street address is missing or empty
- org_addr_is_po_box: TRUE if address is a P.O. Box
- org_addr_is_rural_route: TRUE if address is a rural route
- org_addr_has_special_chars: TRUE if address contains unusual special characters
- org_addr_missing_number: TRUE if street address lacks a street number
- org_addr_state_invalid: TRUE if state code is not a valid US state/territory
Classification
Average Completeness: 35.1% | Columns: 4
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| subsection_code | SUBSECTION | 70.3% | Fair | 197,249 |
| classification_code | CLASSIFICATION | 0.0% | Low | 664,441 |
| exempt_organization_type | SUBSECTION | 70.3% | Fair | 197,457 |
| all_classifications_string | CLASSIFICATION | 0.0% | Low | 664,441 |
Column Descriptions:
- subsection_code: IRS subsection code (e.g., 03 for 501(c)(3), 04 for 501(c)(4))
- classification_code: IRS classification code indicating organization type within subsection
- exempt_organization_type: Human-readable exempt organization type based on subsection
- all_classifications_string: Semicolon-separated list of all classification descriptions
Organization Codes
Average Completeness: 33.0% | Columns: 11
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| affiliation_code | AFFILIATION | 0.0% | Low | 664,441 |
| affiliation_code_definition | AFFILIATION | 0.0% | Low | 664,441 |
| deductibility_code | DEDUCTIBILITY | 0.0% | Low | 664,441 |
| deductibility_code_definition | DEDUCTIBILITY | 0.0% | Low | 664,441 |
| foundation_code | FOUNDATION | 65.8% | Low | 227,369 |
| foundation_code_definition | FOUNDATION | 65.8% | Low | 227,369 |
| organization_code | ORGANIZATION | 65.8% | Low | 227,369 |
| organization_code_definition | ORGANIZATION | 65.8% | Low | 227,369 |
| status_code | STATUS | 0.0% | Low | 664,441 |
| status_code_definition | STATUS | 0.0% | Low | 664,441 |
| naics_code | NTEE_CD | 100.0% | Good | 0 |
Column Descriptions:
- affiliation_code: Code indicating relationship to parent organization (1-9)
- affiliation_code_definition: Description of affiliation relationship
- deductibility_code: Code indicating deductibility status of contributions (1-4)
- deductibility_code_definition: Description of contribution deductibility
- foundation_code: Foundation status code (00-99) per IRS determination
- foundation_code_definition: Description of foundation/public charity status
- organization_code: Code for type of organization (corporation, trust, etc.)
- organization_code_definition: Description of organization type
- status_code: IRS determination status code (01-99)
- status_code_definition: Description of exempt status
- naics_code: North American Industry Classification System code derived from NTEE
Dates
Average Completeness: 57.1% | Columns: 7
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| ruling_date_ym_str | RULING | 0.0% | Low | 664,441 |
| ruling_date | RULING | 100.0% | Good | 0 |
| ruling_date_is_missing | RULING | 100.0% | Good | 0 |
| tax_period_ym_str | TAX_PERIOD | 0.0% | Low | 664,441 |
| tax_period_ymd | TAX_PERIOD | 100.0% | Good | 0 |
| tax_period_is_missing | TAX_PERIOD | 100.0% | Good | 0 |
| accounting_period | ACCT_PD | 0.0% | Low | 664,441 |
Column Descriptions:
- ruling_date_ym_str: Ruling date as YYYYMM string
- ruling_date: Date of IRS ruling granting exempt status
- ruling_date_is_missing: TRUE if ruling date is missing or invalid
- tax_period_ym_str: Tax period end date as YYYYMM string
- tax_period_ymd: Tax period end date in YYYY-MM-DD format
- tax_period_is_missing: TRUE if tax period is missing
- accounting_period: Month when organization’s accounting period ends (01-12)
Financial Codes
Average Completeness: 0.0% | Columns: 4
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| asset_code | ASSET_CD | 0.0% | Low | 664,441 |
| asset_code_definition | ASSET_CD | 0.0% | Low | 664,441 |
| income_code | INCOME_CD | 0.0% | Low | 664,441 |
| income_code_definition | INCOME_CD | 0.0% | Low | 664,441 |
Column Descriptions:
- asset_code: Asset amount range code (0-9)
- asset_code_definition: Description of asset range (e.g., ‘$100,000 to $499,999’)
- income_code: Income amount range code (0-9)
- income_code_definition: Description of income range
Financial Amounts
Average Completeness: 66.7% | Columns: 3
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| asset_amount | ASSET_AMT | 100.0% | Good | 0 |
| income_amount | INCOME_AMT | 100.0% | Good | 0 |
| revenue_amount | REVENUE_AMT | 0.0% | Low | 664,441 |
Column Descriptions:
- asset_amount: Total assets in dollars (most recent return)
- income_amount: Total income in dollars (can be negative)
- revenue_amount: Total revenue in dollars (can be negative)
Activity
Average Completeness: 0.0% | Columns: 3
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| activity_code | ACTIVITY | 0.0% | Low | 664,441 |
| activity_code_definitions | ACTIVITY | 0.0% | Low | 664,441 |
| activity_code_categories | ACTIVITY | 0.0% | Low | 664,441 |
Column Descriptions:
- activity_code: Three 3-digit activity codes concatenated (9 characters total)
- activity_code_definitions: Semicolon-separated descriptions of activity codes
- activity_code_categories: Semicolon-separated activity categories
Filing Requirements
Average Completeness: 0.0% | Columns: 4
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| filing_requirement_code | FILING_REQ_CD | 0.0% | Low | 664,441 |
| filing_requirement_code_definition | FILING_REQ_CD | 0.0% | Low | 664,441 |
| pf_filing_requirement_code | PF_FILING_REQ_CD | 0.0% | Low | 664,441 |
| pf_filing_requirement_code_definition | PF_FILING_REQ_CD | 0.0% | Low | 664,441 |
Column Descriptions:
- filing_requirement_code: Code indicating required annual return form (0-6)
- filing_requirement_code_definition: Description of filing requirement (990, 990-EZ, 990-N, etc.)
- pf_filing_requirement_code: Private foundation filing requirement code
- pf_filing_requirement_code_definition: Description of private foundation filing requirement
NTEE Codes
Average Completeness: 100.0% | Columns: 6
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| ntee_code_raw | NTEE_CD | 100.0% | Good | 0 |
| ntee_code_clean | NTEE_CD | 100.0% | Good | 0 |
| ntee_code_definition | NTEE_CD | 100.0% | Good | 0 |
| ntee_code_major_group | NTEE_CD | 100.0% | Good | 0 |
| ntee_common_code | NTEE_CD | 100.0% | Good | 0 |
| ntee_common_code_definition | NTEE_CD | 100.0% | Good | 0 |
Column Descriptions:
- ntee_code_raw: Original NTEE code from source file (1-4 characters)
- ntee_code_clean: Standardized 3-character NTEE code
- ntee_code_definition: Full description of NTEE classification
- ntee_code_major_group: NTEE major group letter (A-Z) indicating broad category
- ntee_common_code: Common code suffix for 4-character NTEE codes (e.g., 01-99)
- ntee_common_code_definition: Description of common code suffix
NTEE V2 Codes
Average Completeness: 100.0% | Columns: 5
| Field | Source | Completeness | Status | Null Count |
|---|---|---|---|---|
| nteev2 | NTEE_CD | 100.0% | Good | 0 |
| nteev2_code | NTEE_CD | 100.0% | Good | 0 |
| nteev2_subsector | NTEE_CD | 100.0% | Good | 0 |
| nteev2_subsector_definition | NTEE_CD | 100.0% | Good | 0 |
| nteev2_org_type | NTEE_CD | 100.0% | Good | 0 |
Column Descriptions:
- nteev2: Full NTEEv2 code in SUBSECTOR-CODE-TYPE format
- nteev2_code: NTEEv2 code portion (3 characters)
- nteev2_subsector: NTEEv2 subsector code (e.g., UNI, HOS, ART, ENV)
- nteev2_subsector_definition: Human-readable name of the NTEEv2 subsector (e.g., ‘Human Services’, ‘Public, Societal Benefit’)
- nteev2_org_type: NTEEv2 organization type (RG=Regular, AA=Alliance, etc.)
Organization Distribution
Exempt Organization Type
Distribution of organizations by exempt organization type (based on IRS subsection code).
| Exempt Organization Type | Count | Percentage |
|---|---|---|
| 501(c)(3) | 465,788 | 70.10% |
| NA | 197,457 | 29.72% |
| 501(c)(5) | 421 | 0.06% |
| 501(c)(4) | 377 | 0.06% |
| 501(c)(6) | 213 | 0.03% |
| 501(c)(7) | 73 | 0.01% |
| 501(c)(2) | 22 | 0.00% |
| 501(c)(9) | 22 | 0.00% |
| 501(c)(10) | 17 | 0.00% |
| 501(c)(8) | 16 | 0.00% |
| 501(c)(19) | 8 | 0.00% |
| 501(c)(13) | 8 | 0.00% |
| 501(c)(1) | 7 | 0.00% |
| 501(c)(12) | 5 | 0.00% |
| 4947(a)(1) | 3 | 0.00% |
| 501(c)(25) | 2 | 0.00% |
| 501(c)(14) | 1 | 0.00% |
| 501(k) | 1 | 0.00% |
NTEE Major Group Distribution
Distribution of organizations by NTEE major group code.
| NTEE Major Group | Count | Percentage |
|---|---|---|
| Education | 113,084 | 17.02% |
| Religion-Related | 80,440 | 12.11% |
| Arts, Culture and Humanities | 73,518 | 11.06% |
| Philanthropy, Voluntarism and Grantmaking Foundations | 67,616 | 10.18% |
| Human Services | 52,095 | 7.84% |
| Recreation and Sports | 35,300 | 5.31% |
| Health Care | 33,555 | 5.05% |
| Community Improvement and Capacity Building | 30,680 | 4.62% |
| Youth Development | 25,776 | 3.88% |
| Unknown | 24,277 | 3.65% |
| Environment | 16,904 | 2.54% |
| Voluntary Health Associations and Medical Disciplines | 15,643 | 2.35% |
| Housing and Shelter | 14,936 | 2.25% |
| Public and Societal Benefit | 14,457 | 2.18% |
| Mental Health and Crisis Intervention | 12,391 | 1.86% |
| Public Safety, Disaster Preparedness and Relief | 9,204 | 1.39% |
| Crime and Legal-Related | 8,180 | 1.23% |
| International, Foreign Affairs and National Security | 5,802 | 0.87% |
| Civil Rights, Societal Action and Advocacy | 5,404 | 0.81% |
| Employment | 5,397 | 0.81% |
| Animal-Related | 5,274 | 0.79% |
| Science and Technology | 4,577 | 0.69% |
| Food, Agriculture and Nutrition | 3,680 | 0.55% |
| Medical Research | 2,182 | 0.33% |
| Mutual and Membership Benefit | 1,909 | 0.29% |
| Social Science | 1,780 | 0.27% |
| UNDEFINED | 380 | 0.06% |
Financial Summary
| Metric | Value |
|---|---|
| Total Assets (all organizations) | $0 |
| Median Assets | $0 |
| Organizations with Asset Data | 664,441 |
| Organizations with Zero Assets | 664,441 |
| Total Income | $0 |
| Median Income | $0 |
| Total Revenue | $0 |
| Median Revenue | $NA |
Address Quality
| Address Issue | Count | % of Total |
|---|---|---|
| Missing Address | 664,441 | 100.00% |
| P.O. Box Addresses | 0 | 0.00% |
| Rural Route Addresses | 0 | 0.00% |
| Invalid State Code | 1,365 | 0.21% |
Date Coverage
Ruling Date Range
No ruling date range data available.
Tax Period Year Distribution
Data Issues
All expected columns are present and no critical field issues were found.
Report Metadata
| Property | Value |
|---|---|
| Report Generated | 2026-05-06 15:38:58 |
| Pipeline Timestamp | 2026-05-06 15:38:36 |
| Row Preservation Check | Passed |
| Overall Completeness | 54.8% |
| Overall Result | PASSED |
Generated by BMF Pipeline Quality System