DS-17 Supply Chain

African Last-Mile Logistics & Delivery Dataset

5M+ GPS-tracked delivery records and driver logs from e-commerce operators across Nigeria, Kenya, and South Africa — including geocoding quality scores, traffic-adjusted ETAs, and failed-delivery reason codes for training route-optimisation, ETA prediction, and address-normalisation AI.

This is a synthetic dataset generated from high-quality expert-labelled seed data. All records are algorithmically derived — statistical distributions, inter-field correlations, and annotation characteristics faithfully replicate real-world patterns from the source data, while ensuring no real individual, organisation, or transaction can be identified or reconstructed.

The African Last-Mile Logistics & Delivery Dataset captures 5M+ individual delivery attempt records collected from e-commerce and quick-commerce operators across Lagos, Nairobi, and Johannesburg / Cape Town. Each record tracks the full lifecycle of a delivery order from dispatch to final outcome — recording depot departure time, GPS waypoints at key events (dispatch, arrival at delivery zone, delivery attempt, return to depot), traffic-adjusted ETA at dispatch, actual delivery time, and the outcome code (delivered, failed, rescheduled, returned).

A distinctive feature of the dataset is its geocoding quality layer. African addressing is highly informal — many delivery zones rely on landmark descriptions, area codes, and driver local knowledge rather than structured street addresses. Each record includes a geocoding confidence score (0–1), an address type classification (STRUCTURED, LANDMARK, AREA_CODE, FREE_TEXT), and a normalised coordinate pair derived from a best-effort geocoder pipeline. Failed deliveries carry a structured reason code from a 12-class taxonomy (customer absent, address not found, access denied, security zone, etc.) enabling targeted model development for specific failure modes.

The dataset supports route optimisation under realistic African urban constraints — traffic congestion, road quality variation, fuel cost sensitivity, and address ambiguity. It is structured as a time-stamped event log compatible with common logistics ML frameworks and geospatial libraries (GeoPandas, H3, OSRM). A companion road-network graph (OpenStreetMap-derived) is provided for routing model development.

Key Use Cases

Last-mile route optimisation under African urban constraints
Delivery ETA prediction with traffic and address-quality features
Address normalisation and geocoding quality improvement
Failed delivery root-cause classification and prevention
Driver performance scoring and incentive optimisation
Dynamic re-routing triggered by real-time traffic events
Delivery zone clustering and hub placement optimisation
Customer-facing delivery tracking and communication AI

Dataset Highlights

Delivery Records
5M+
GPS-tracked attempts
Cities Covered
3
Lagos, Nairobi, Johannesburg / Cape Town
Failure Reason Codes
12
structured delivery outcome taxonomy
Address Types
4
structured, landmark, area code, free text

Geographic Coverage

Primary Coverage
Other Regions

Dataset Schema

Each record represents one delivery attempt. Fields cover order identity, timing, GPS tracking, address quality, and outcome classification.

Field NameTypeDescriptionNullableExample
delivery_id STRING Unique delivery attempt identifier No DLV-NGA-LG-20230914-0041823
order_id STRING Parent order identifier (multiple attempts may share this) No ORD-NGA-0918234
attempt_number INTEGER Delivery attempt sequence number for this order (1 = first attempt) No 1
country_code STRING ISO 3166-1 alpha-2 country code No NG
city STRING City of delivery No Lagos
dispatch_timestamp DATETIME ISO 8601 dispatch datetime from depot No 2023-09-14T08:32:00Z
eta_at_dispatch_min INTEGER Traffic-adjusted ETA in minutes at time of dispatch Yes 47
actual_delivery_min INTEGER Actual time from dispatch to delivery outcome in minutes (null if unresolved) Yes 63
address_type ENUM Address format: STRUCTURED, LANDMARK, AREA_CODE, FREE_TEXT No LANDMARK
geocode_confidence FLOAT Geocoding confidence score (0–1, higher = more precise location) No 0.71
dest_lat FLOAT Destination latitude (WGS 84, best-effort geocoded) Yes 6.4281
dest_lng FLOAT Destination longitude (WGS 84, best-effort geocoded) Yes 3.4219
outcome ENUM Delivery outcome: DELIVERED, FAILED, RESCHEDULED, RETURNED No DELIVERED
failure_reason STRING Failure reason code from 12-class taxonomy (null if delivered) Yes null
driver_id STRING Anonymised driver identifier No DRV-NGA-0841
vehicle_type ENUM Delivery vehicle: MOTORCYCLE, CAR, VAN, BICYCLE No MOTORCYCLE

Sample Records

Four representative delivery records spanning countries, address types, outcomes, and vehicle types.

logistics_sample.json
[ { "delivery_id": "DLV-NGA-LG-20230914-0041823", "order_id": "ORD-NGA-0918234", "attempt_number": 1, "country_code": "NG", "city": "Lagos", "dispatch_timestamp": "2023-09-14T08:32:00Z", "eta_at_dispatch_min": 47, "actual_delivery_min": 63, "address_type": "LANDMARK", "geocode_confidence": 0.71, "dest_lat": 6.4281, "dest_lng": 3.4219, "outcome": "DELIVERED", "failure_reason": null, "driver_id": "DRV-NGA-0841", "vehicle_type": "MOTORCYCLE" }, { "delivery_id": "DLV-NGA-LG-20230914-0041901", "order_id": "ORD-NGA-0891042", "attempt_number": 2, "country_code": "NG", "city": "Lagos", "dispatch_timestamp": "2023-09-14T14:15:00Z", "eta_at_dispatch_min": 38, "actual_delivery_min": null, "address_type": "FREE_TEXT", "geocode_confidence": 0.29, "dest_lat": null, "dest_lng": null, "outcome": "FAILED", "failure_reason": "ADDRESS_NOT_FOUND", "driver_id": "DRV-NGA-0212", "vehicle_type": "MOTORCYCLE" }, { "delivery_id": "DLV-KEN-NB-20231022-0008741", "order_id": "ORD-KEN-0244091", "attempt_number": 1, "country_code": "KE", "city": "Nairobi", "dispatch_timestamp": "2023-10-22T10:05:00Z", "eta_at_dispatch_min": 31, "actual_delivery_min": 29, "address_type": "STRUCTURED", "geocode_confidence": 0.94, "dest_lat": -1.2921, "dest_lng": 36.8219, "outcome": "DELIVERED", "failure_reason": null, "driver_id": "DRV-KEN-0334", "vehicle_type": "CAR" }, { "delivery_id": "DLV-ZAF-JB-20231108-0031204", "order_id": "ORD-ZAF-0671830", "attempt_number": 1, "country_code": "ZA", "city": "Johannesburg", "dispatch_timestamp": "2023-11-08T09:48:00Z", "eta_at_dispatch_min": 55, "actual_delivery_min": null, "address_type": "AREA_CODE", "geocode_confidence": 0.58, "dest_lat": -26.2041, "dest_lng": 28.0473, "outcome": "RESCHEDULED", "failure_reason": "CUSTOMER_ABSENT", "driver_id": "DRV-ZAF-0189", "vehicle_type": "VAN" } ]
Request Dataset Access

All datasets are available under a commercial licence agreement. Our team typically responds within 2 business days.

Request Access
NDA may be required

Build with Data that reflects Africa

Request access to our full catalog of licensed human-validated African datasets or request custom data tailored to your project.