DS-15 HealthCare

African Community Health Worker Interaction Dataset

500K+ structured visit records from community health workers operating in rural Nigeria, Ghana, and Kenya — covering maternal health assessments, child nutrition screenings, immunisation tracking, and referral decisions to train AI tools that extend last-mile healthcare capacity.

This is a synthetic dataset generated from high-quality expert-labelled seed data. All records are algorithmically derived — statistical distributions, inter-field correlations, and annotation characteristics faithfully replicate real-world patterns from the source data, while ensuring no real individual, organisation, or transaction can be identified or reconstructed.

The African Community Health Worker Interaction Dataset contains 500K+ structured visit records generated by community health workers (CHWs) operating in rural and peri-urban communities across Nigeria, Ghana, and Kenya. CHWs recorded each household visit using a standardised digital data collection tool deployed on low-end Android smartphones, capturing maternal health indicators, child nutrition and growth metrics, immunisation status, symptom flags, and the referral decision made at the point of care.

Maternal health records cover antenatal visit compliance, delivery location, postnatal check scheduling, and danger sign screening for conditions including pre-eclampsia, postpartum haemorrhage, and neonatal infection. Child records track MUAC measurements, weight-for-age Z-scores, breastfeeding status, and vaccination history against the national immunisation schedule. Each visit record is linked to a household identifier enabling longitudinal panel analysis of health trajectories across multiple CHW visits.

The dataset is purpose-built for CHW decision-support AI — systems that suggest next-best actions, flag high-risk households for supervisor follow-up, and optimise visit routing. It is also used for supply-chain forecasting (predicting commodity needs based on caseload trends) and for training multilingual voice-based data collection assistants that reduce CHW administrative burden.

Key Use Cases

CHW next-best-action decision support at point of care
High-risk household flagging for supervisor follow-up
Maternal danger sign early warning and escalation routing
Child malnutrition screening and MUAC trend monitoring
Immunisation defaulter identification and reminder targeting
CHW visit route optimisation and caseload management
Health commodity demand forecasting from caseload data
Multilingual voice assistant training for CHW data entry

Visit Type Distribution

Maternal Health (ANC/PNC) 38 %
Child Nutrition & Growth 29 %
Immunisation Follow-up 18 %
General Household Screening 15 %

Geographic Coverage

Primary Coverage
Other Regions

Dataset Schema

Each record represents one CHW household visit. Fields cover visit identity, beneficiary type, clinical assessment indicators, and the referral decision made at point of care.

Field NameTypeDescriptionNullableExample
visit_id STRING Unique visit identifier No VIS-NGA-KG-0081234
household_id STRING Anonymised persistent household identifier No HH-NGA-KG-004821
chw_id STRING Anonymised CHW identifier No CHW-NGA-0174
country_code STRING ISO 3166-1 alpha-2 country code No NG
visit_date DATE Date of CHW visit (YYYY-MM-DD) No 2023-07-12
visit_type ENUM Visit category: ANC, PNC, NUTRITION, IMMUNISATION, GENERAL_SCREENING No ANC
beneficiary_type ENUM Primary beneficiary: PREGNANT_WOMAN, POSTPARTUM_WOMAN, CHILD_U5, HOUSEHOLD No PREGNANT_WOMAN
gestational_age_weeks INTEGER Gestational age in weeks (null if not maternal visit) Yes 28
muac_cm FLOAT Mid-upper arm circumference in cm (null if not nutrition visit) Yes null
weight_kg FLOAT Measured weight in kg Yes 62.4
danger_sign_flag BOOLEAN True if one or more maternal or neonatal danger signs were observed No false
immunisation_up_to_date BOOLEAN True if child immunisations are current per national schedule (null if not child) Yes null
referral_made BOOLEAN True if CHW made a facility referral during this visit No false
referral_reason STRING Free-text referral reason (null if no referral) Yes null
commodities_dispensed JSON Array of commodity names dispensed during visit (e.g. ORS, iron-folate, ITN) Yes ["iron-folate", "ITN"]
visit_duration_min INTEGER Duration of visit in minutes Yes 22

Sample Records

Four representative CHW visit records spanning visit types, countries, and referral outcomes.

chw_visit_sample.json
[ { "visit_id": "VIS-NGA-KG-0081234", "household_id": "HH-NGA-KG-004821", "chw_id": "CHW-NGA-0174", "country_code": "NG", "visit_date": "2023-07-12", "visit_type": "ANC", "beneficiary_type": "PREGNANT_WOMAN", "gestational_age_weeks": 28, "muac_cm": null, "weight_kg": 62.4, "danger_sign_flag": false, "immunisation_up_to_date": null, "referral_made": false, "referral_reason": null, "commodities_dispensed": [ "iron-folate", "ITN" ], "visit_duration_min": 22 }, { "visit_id": "VIS-GHA-NR-0034091", "household_id": "HH-GHA-NR-001243", "chw_id": "CHW-GHA-0089", "country_code": "GH", "visit_date": "2023-05-30", "visit_type": "NUTRITION", "beneficiary_type": "CHILD_U5", "gestational_age_weeks": null, "muac_cm": 11.2, "weight_kg": 8.7, "danger_sign_flag": true, "immunisation_up_to_date": false, "referral_made": true, "referral_reason": "SAM — MUAC below 11.5 cm threshold, child unresponsive", "commodities_dispensed": [ "RUTF" ], "visit_duration_min": 35 }, { "visit_id": "VIS-KEN-KS-0057812", "household_id": "HH-KEN-KS-008834", "chw_id": "CHW-KEN-0231", "country_code": "KE", "visit_date": "2022-11-14", "visit_type": "IMMUNISATION", "beneficiary_type": "CHILD_U5", "gestational_age_weeks": null, "muac_cm": 13.8, "weight_kg": 11.1, "danger_sign_flag": false, "immunisation_up_to_date": true, "referral_made": false, "referral_reason": null, "commodities_dispensed": [], "visit_duration_min": 15 }, { "visit_id": "VIS-NGA-EN-0092410", "household_id": "HH-NGA-EN-019031", "chw_id": "CHW-NGA-0341", "country_code": "NG", "visit_date": "2023-09-03", "visit_type": "PNC", "beneficiary_type": "POSTPARTUM_WOMAN", "gestational_age_weeks": null, "muac_cm": null, "weight_kg": 58.1, "danger_sign_flag": true, "immunisation_up_to_date": null, "referral_made": true, "referral_reason": "Postpartum haemorrhage signs — heavy bleeding reported day 3", "commodities_dispensed": [ "ORS" ], "visit_duration_min": 40 } ]
Request Dataset Access

All datasets are available under a commercial licence agreement. Our team typically responds within 2 business days.

Request Access
NDA may be required

Build with Data that reflects Africa

Request access to our full catalog of licensed human-validated African datasets or request custom data tailored to your project.