SME Financial Health & Informal Economy Dataset
Granular financial records from African informal SMEs spanning revenue cycles, inventory turnover, supplier credit terms, and digital payment adoption — purpose-built for inclusive credit scoring and embedded finance models.
This is a synthetic dataset generated from high-quality expert-labelled seed data. All records are algorithmically derived — statistical distributions, inter-field correlations, and annotation characteristics faithfully replicate real-world patterns from the source data, while ensuring no real individual, organisation, or transaction can be identified or reconstructed.
The African SME Financial Health & Informal Economy Dataset captures the operational and financial reality of small and micro enterprises across Nigeria, Ghana, Rwanda, and South Africa — markets where 80–90 % of businesses operate outside formal financial infrastructure. Each record encodes monthly revenue, cost of goods, supplier credit utilisation, digital-payment share, and an annotated informal-economy score derived from field surveys and aggregated mobile-money transaction signatures.
Unlike corporate financial datasets, this collection reflects the seasonality, cash-flow irregularity, and asset-light nature of African informal commerce. Records span market traders, artisans, smallholder agri-processors, transport operators, and service micro-businesses. Each entry carries a manually validated credit-risk label and a data-completeness flag, enabling robust train / eval splits even when fields are partially observed.
The dataset is purpose-built for teams developing inclusive credit underwriting engines, embedded-finance products for neobanks and MNOs, and market-sizing models for B2SME SaaS platforms. Structural anonymisation removes all direct identifiers while preserving the statistical distributions needed for bias-aware model evaluation.
Key Use Cases
Data Quality Indicators
Geographic Coverage
Dataset Schema
Each record represents one SME business unit observed over a monthly reporting period. Fields cover business profile, financial flows, credit relationships, digital adoption, and annotated risk labels.
| Field Name | Type | Description | Nullable | Example |
|---|---|---|---|---|
| sme_id | STRING | Anonymised unique identifier for the SME unit | No | SME-NGA-00312 |
| country_code | STRING | ISO 3166-1 alpha-2 country code | No | NG |
| city_cluster | STRING | City / market cluster label (anonymised) | No | Lagos-SW |
| sector | ENUM | Business sector: RETAIL, FOOD, TRANSPORT, SERVICES, AGRI_PROCESS | No | RETAIL |
| report_month | DATE | Observation month (YYYY-MM) | No | 2024-03 |
| monthly_revenue | FLOAT | Total monthly revenue in USD equivalent | No | 1240.5 |
| cogs | FLOAT | Cost of goods sold in USD equivalent | Yes | 780 |
| gross_margin_pct | FLOAT | Gross margin as a percentage of revenue | Yes | 37.1 |
| inventory_days | INTEGER | Average inventory turnover days | Yes | 14 |
| supplier_credit_usd | FLOAT | Outstanding supplier credit balance in USD | Yes | 430 |
| supplier_credit_days | INTEGER | Typical supplier credit term in days | Yes | 21 |
| digital_payment_pct | FLOAT | Share of revenue collected via digital channels (%) | No | 62.5 |
| mobile_money_provider | ENUM | Primary mobile-money provider: MPESA, MTN, AIRTEL, OPAY, OTHER | Yes | OPAY |
| num_employees | INTEGER | Number of employees (including owner) | Yes | 3 |
| years_operating | INTEGER | Years the business has been operating | No | 5 |
| has_bank_account | BOOLEAN | Whether the SME holds a formal bank account | No | true |
| informal_economy_score | FLOAT | Composite informality score (0–100, higher = more informal) | No | 68.4 |
| credit_risk_label | ENUM | Annotated credit risk: LOW, MEDIUM, HIGH | No | MEDIUM |
| data_completeness_flag | BOOLEAN | True if fewer than 3 fields are missing in this record | No | true |
Sample Records
Four representative records illustrating variation across sectors, countries, and credit-risk profiles.
Build with Data that reflects Africa
Request access to our full catalog of licensed human-validated African datasets or request custom data tailored to your project.