DS-04 Financial Services

SME Financial Health & Informal Economy Dataset

Granular financial records from African informal SMEs spanning revenue cycles, inventory turnover, supplier credit terms, and digital payment adoption — purpose-built for inclusive credit scoring and embedded finance models.

This is a synthetic dataset generated from high-quality expert-labelled seed data. All records are algorithmically derived — statistical distributions, inter-field correlations, and annotation characteristics faithfully replicate real-world patterns from the source data, while ensuring no real individual, organisation, or transaction can be identified or reconstructed.

The African SME Financial Health & Informal Economy Dataset captures the operational and financial reality of small and micro enterprises across Nigeria, Ghana, Rwanda, and South Africa — markets where 80–90 % of businesses operate outside formal financial infrastructure. Each record encodes monthly revenue, cost of goods, supplier credit utilisation, digital-payment share, and an annotated informal-economy score derived from field surveys and aggregated mobile-money transaction signatures.

Unlike corporate financial datasets, this collection reflects the seasonality, cash-flow irregularity, and asset-light nature of African informal commerce. Records span market traders, artisans, smallholder agri-processors, transport operators, and service micro-businesses. Each entry carries a manually validated credit-risk label and a data-completeness flag, enabling robust train / eval splits even when fields are partially observed.

The dataset is purpose-built for teams developing inclusive credit underwriting engines, embedded-finance products for neobanks and MNOs, and market-sizing models for B2SME SaaS platforms. Structural anonymisation removes all direct identifiers while preserving the statistical distributions needed for bias-aware model evaluation.

Key Use Cases

Inclusive credit scoring for unbanked and under-banked SMEs
Embedded finance product design for MNOs and neobanks
Informal-economy segmentation and market sizing
Cash-flow forecasting and working-capital risk models
Digital payment adoption propensity modelling
Supply-chain finance and supplier credit risk assessment
SME churn and financial distress early-warning systems
Regulatory sandbox testing for alternative credit frameworks

Data Quality Indicators

Field Completeness 87 %
Label Accuracy (Credit Risk) 91 %
Geographic Representation 78 %
Temporal Consistency 83 %

Geographic Coverage

Primary Coverage
Other Regions

Dataset Schema

Each record represents one SME business unit observed over a monthly reporting period. Fields cover business profile, financial flows, credit relationships, digital adoption, and annotated risk labels.

Field NameTypeDescriptionNullableExample
sme_id STRING Anonymised unique identifier for the SME unit No SME-NGA-00312
country_code STRING ISO 3166-1 alpha-2 country code No NG
city_cluster STRING City / market cluster label (anonymised) No Lagos-SW
sector ENUM Business sector: RETAIL, FOOD, TRANSPORT, SERVICES, AGRI_PROCESS No RETAIL
report_month DATE Observation month (YYYY-MM) No 2024-03
monthly_revenue FLOAT Total monthly revenue in USD equivalent No 1240.5
cogs FLOAT Cost of goods sold in USD equivalent Yes 780
gross_margin_pct FLOAT Gross margin as a percentage of revenue Yes 37.1
inventory_days INTEGER Average inventory turnover days Yes 14
supplier_credit_usd FLOAT Outstanding supplier credit balance in USD Yes 430
supplier_credit_days INTEGER Typical supplier credit term in days Yes 21
digital_payment_pct FLOAT Share of revenue collected via digital channels (%) No 62.5
mobile_money_provider ENUM Primary mobile-money provider: MPESA, MTN, AIRTEL, OPAY, OTHER Yes OPAY
num_employees INTEGER Number of employees (including owner) Yes 3
years_operating INTEGER Years the business has been operating No 5
has_bank_account BOOLEAN Whether the SME holds a formal bank account No true
informal_economy_score FLOAT Composite informality score (0–100, higher = more informal) No 68.4
credit_risk_label ENUM Annotated credit risk: LOW, MEDIUM, HIGH No MEDIUM
data_completeness_flag BOOLEAN True if fewer than 3 fields are missing in this record No true

Sample Records

Four representative records illustrating variation across sectors, countries, and credit-risk profiles.

sme_sample_records.json
[ { "sme_id": "SME-NGA-00312", "country_code": "NG", "city_cluster": "Lagos-SW", "sector": "RETAIL", "report_month": "2024-03", "monthly_revenue": 1240.5, "cogs": 780, "gross_margin_pct": 37.1, "inventory_days": 14, "supplier_credit_usd": 430, "supplier_credit_days": 21, "digital_payment_pct": 62.5, "mobile_money_provider": "OPAY", "num_employees": 3, "years_operating": 5, "has_bank_account": true, "informal_economy_score": 68.4, "credit_risk_label": "MEDIUM", "data_completeness_flag": true }, { "sme_id": "SME-GHA-00187", "country_code": "GH", "city_cluster": "Accra-N", "sector": "FOOD", "report_month": "2024-02", "monthly_revenue": 870, "cogs": 610, "gross_margin_pct": 29.9, "inventory_days": 8, "supplier_credit_usd": 150, "supplier_credit_days": 14, "digital_payment_pct": 45, "mobile_money_provider": "MTN", "num_employees": 2, "years_operating": 3, "has_bank_account": false, "informal_economy_score": 82.1, "credit_risk_label": "HIGH", "data_completeness_flag": true }, { "sme_id": "SME-RWA-00094", "country_code": "RW", "city_cluster": "Kigali-C", "sector": "SERVICES", "report_month": "2024-01", "monthly_revenue": 2100, "cogs": null, "gross_margin_pct": null, "inventory_days": null, "supplier_credit_usd": 0, "supplier_credit_days": 0, "digital_payment_pct": 91.3, "mobile_money_provider": "AIRTEL", "num_employees": 6, "years_operating": 8, "has_bank_account": true, "informal_economy_score": 31.7, "credit_risk_label": "LOW", "data_completeness_flag": false }, { "sme_id": "SME-ZAF-00521", "country_code": "ZA", "city_cluster": "Johannesburg-SW", "sector": "TRANSPORT", "report_month": "2024-03", "monthly_revenue": 3450, "cogs": 2200, "gross_margin_pct": 36.2, "inventory_days": null, "supplier_credit_usd": 980, "supplier_credit_days": 30, "digital_payment_pct": 78, "mobile_money_provider": "OTHER", "num_employees": 5, "years_operating": 12, "has_bank_account": true, "informal_economy_score": 44.9, "credit_risk_label": "LOW", "data_completeness_flag": true } ]
Request Dataset Access

All datasets are available under a commercial licence agreement. Our team typically responds within 2 business days.

Request Access
NDA may be required

Build with Data that reflects Africa

Request access to our full catalog of licensed human-validated African datasets or request custom data tailored to your project.