African Port and Trade Clearance Dataset
2M+ import/export declaration records with end-to-end customs timelines from major ports in Nigeria, Ghana, Kenya, and South Africa — including HS code classifications, dwell-time breakdowns, inspection outcomes, and duty assessments for trade compliance AI and port-operations optimisation.
This is a synthetic dataset generated from high-quality expert-labelled seed data. All records are algorithmically derived — statistical distributions, inter-field correlations, and annotation characteristics faithfully replicate real-world patterns from the source data, while ensuring no real individual, organisation, or transaction can be identified or reconstructed.
The African Port and Trade Clearance Dataset aggregates 2M+ import and export declaration records processed at major seaports and land border crossing points across Nigeria (Apapa, Tin Can Island), Ghana (Tema), Kenya (Mombasa), and South Africa (Durban, Cape Town). Each record captures the full customs clearance lifecycle: declaration lodgement, document verification, physical inspection decision, examination outcome, duty assessment, and release — with timestamps at each stage enabling end-to-end dwell time modelling.
Declarations are classified by HS code chapter (2-digit) and a 6-digit HS code where available, commodity description, trade flow (import/export), declared value, and country of origin or destination. Risk channel assignment (green, amber, red lane) and inspection outcomes (compliant, documentary discrepancy, undervaluation, prohibited goods) are included for the subset of declarations that were risk-selected for examination, enabling supervised customs compliance and fraud-detection model development.
The dataset is anonymised at the declarant level — importer/exporter names are replaced with sector and size-band codes — while preserving the statistical relationships between trader characteristics, commodity types, and compliance outcomes that are essential for risk-scoring model training. A companion port-level metadata file covers berth capacity, handling equipment, and seasonal throughput indices for port-operations forecasting tasks.
Key Use Cases
Dataset Highlights
Geographic Coverage
Dataset Schema
Each record represents one customs declaration. Fields cover declaration identity, commodity details, trade parties, valuation, customs process timeline, and inspection outcome.
| Field Name | Type | Description | Nullable | Example |
|---|---|---|---|---|
| declaration_id | STRING | Unique anonymised declaration identifier | No | DCL-NGA-AP-20230712-0084123 |
| country_code | STRING | ISO 3166-1 alpha-2 country of the customs authority | No | NG |
| port_code | STRING | Port or border crossing code | No | NGAPP |
| trade_flow | ENUM | Trade direction: IMPORT, EXPORT | No | IMPORT |
| declaration_date | DATE | Date of declaration lodgement (YYYY-MM-DD) | No | 2023-07-12 |
| hs_chapter | INTEGER | 2-digit HS chapter code | No | 84 |
| hs_code_6digit | STRING | 6-digit HS code (null if not available) | Yes | 847150 |
| commodity_description | STRING | Declared commodity description (free text) | No | Automatic data processing machines (laptops) |
| origin_country | STRING | ISO country code of goods origin (imports) or destination (exports) | No | CN |
| declared_value_usd | FLOAT | Declared customs value in USD | No | 48200 |
| gross_weight_kg | FLOAT | Gross weight of shipment in kilograms | Yes | 1240 |
| risk_channel | ENUM | Risk lane assigned: GREEN, AMBER, RED | No | GREEN |
| inspected | BOOLEAN | True if shipment was physically inspected | No | false |
| inspection_outcome | ENUM | Result if inspected: COMPLIANT, DOCUMENTARY_DISCREPANCY, UNDERVALUATION, PROHIBITED_GOODS (null if not inspected) | Yes | null |
| dwell_time_days | FLOAT | Total port dwell time from arrival to release in days | Yes | 4.2 |
| importer_sector | STRING | Anonymised sector code of importer / exporter | Yes | ELECTRONICS_RETAIL |
| importer_size_band | ENUM | Trader size: MICRO, SMALL, MEDIUM, LARGE | Yes | MEDIUM |
Sample Records
Four representative declaration records spanning ports, trade flows, risk channels, and inspection outcomes.
Build with Data that reflects Africa
Request access to our full catalog of licensed human-validated African datasets or request custom data tailored to your project.