DS-05 Agriculture

Agricultural Yield, Soil & Climate Dataset (West Africa)

300K+ geo-referenced plot records coupling observed crop yields with matched soil profiles, seasonal rainfall, temperature anomalies, and NDVI satellite indices — the foundational dataset for precision farming and climate-adaptive agriculture AI across West Africa.

This is a synthetic dataset generated from high-quality expert-labelled seed data. All records are algorithmically derived — statistical distributions, inter-field correlations, and annotation characteristics faithfully replicate real-world patterns from the source data, while ensuring no real individual, organisation, or transaction can be identified or reconstructed.

The Agricultural Yield, Soil & Climate Dataset covers 300K+ geo-referenced agricultural plot observations across Nigeria, Ghana, and Senegal — three of West Africa's most agriculturally significant economies. Each record links plot-level yield outcomes (kg/ha) for major crops (maize, cassava, sorghum, groundnut, cowpea) to matched soil-chemical profiles, seasonal climate variables, and satellite-derived vegetation indices (NDVI, EVI) sourced from Sentinel-2 and MODIS imagery.

Data originates from a combination of national agricultural extension services, geo-tagged field surveys conducted by partner agronomists, and publicly available satellite archives post-processed into 250 m grid cells. Yield records are validated against sub-national production statistics from FAO and national bureaux of statistics, with anomaly flags for suspected mis-reporting or environmental shocks (drought, flood, pest outbreak).

The dataset supports the full spectrum of agricultural AI applications — from field-level yield prediction and input optimisation to district-scale early-warning systems and parametric agri-insurance product design. It is structured to interoperate with GeoJSON boundary files and common remote-sensing Python libraries (rasterio, xarray, geopandas).

Key Use Cases

Plot-level crop yield prediction and input optimisation
Parametric agri-insurance pricing and trigger design
Climate adaptation strategy modelling for smallholders
Soil health monitoring and fertiliser recommendation engines
Drought and flood early-warning system calibration
NDVI-based crop-type classification and growth stage detection
Carbon sequestration and soil organic matter change detection
Agricultural extension service targeting and impact evaluation

Compatible Tools & Ecosystems

🐍 Python (rasterio, xarray, geopandas)
📡 Sentinel-2 / MODIS (GEE compatible)
🌍 GeoJSON / Shapefile boundaries
📊 R (terra, sf, tidymodels)
☁️ AWS S3 / GCS Parquet
🤖 PyTorch / TensorFlow / scikit-learn

Dataset Highlights

Plot Observations
300K+
geo-referenced records
Crop Types
5
maize, cassava, sorghum, groundnut, cowpea
Satellite Indices
NDVI + EVI
Sentinel-2 & MODIS
Time Span
2018 – 2024
6 growing seasons

Geographic Coverage

Primary Coverage
Other Regions

Dataset Schema

Each record represents one agricultural plot observed over a single growing season. Fields cover plot location, crop type, yield outcome, soil chemistry, seasonal climate, and satellite vegetation indices.

Field NameTypeDescriptionNullableExample
plot_id STRING Unique identifier for the agricultural plot No PLT-NGA-GW-04821
country_code STRING ISO 3166-1 alpha-2 country code No NG
admin1 STRING First-level administrative division (state / region) No Kaduna
latitude FLOAT Plot centroid latitude (WGS 84) No 10.5231
longitude FLOAT Plot centroid longitude (WGS 84) No 7.4389
season STRING Crop growing season identifier (YYYY-S1 or YYYY-S2) No 2023-S1
crop_type ENUM Crop grown: MAIZE, CASSAVA, SORGHUM, GROUNDNUT, COWPEA No MAIZE
yield_kg_ha FLOAT Observed crop yield in kilograms per hectare No 2340.5
plot_area_ha FLOAT Plot area in hectares No 0.75
soil_ph FLOAT Soil pH measured at 0–20 cm depth Yes 6.2
soil_nitrogen_pct FLOAT Soil nitrogen content as percentage of dry weight Yes 0.14
soil_organic_carbon FLOAT Soil organic carbon (g/kg) Yes 8.7
seasonal_rainfall_mm FLOAT Total seasonal rainfall in millimetres No 682
avg_temp_celsius FLOAT Average seasonal temperature in °C No 27.4
ndvi_peak FLOAT Peak NDVI value during crop growth window (0–1) Yes 0.71
evi_mean FLOAT Mean Enhanced Vegetation Index over growing season (0–1) Yes 0.42
irrigation_flag BOOLEAN True if the plot is irrigated No false
fertiliser_applied BOOLEAN True if chemical fertiliser was applied this season Yes true
anomaly_flag ENUM Environmental anomaly: NONE, DROUGHT, FLOOD, PEST, UNVERIFIED No NONE
data_source ENUM Primary data origin: FIELD_SURVEY, EXTENSION_SERVICE, SATELLITE_ONLY No FIELD_SURVEY

Sample Records

Four representative plot records spanning different crops, countries, soil profiles, and anomaly conditions.

agri_yield_sample.json
[ { "plot_id": "PLT-NGA-GW-04821", "country_code": "NG", "admin1": "Kaduna", "latitude": 10.5231, "longitude": 7.4389, "season": "2023-S1", "crop_type": "MAIZE", "yield_kg_ha": 2340.5, "plot_area_ha": 0.75, "soil_ph": 6.2, "soil_nitrogen_pct": 0.14, "soil_organic_carbon": 8.7, "seasonal_rainfall_mm": 682, "avg_temp_celsius": 27.4, "ndvi_peak": 0.71, "evi_mean": 0.42, "irrigation_flag": false, "fertiliser_applied": true, "anomaly_flag": "NONE", "data_source": "FIELD_SURVEY" }, { "plot_id": "PLT-GHA-AS-01193", "country_code": "GH", "admin1": "Ashanti", "latitude": 6.8912, "longitude": -1.6234, "season": "2023-S1", "crop_type": "CASSAVA", "yield_kg_ha": 8750, "plot_area_ha": 1.2, "soil_ph": 5.8, "soil_nitrogen_pct": 0.09, "soil_organic_carbon": 12.1, "seasonal_rainfall_mm": 1120, "avg_temp_celsius": 26.1, "ndvi_peak": 0.63, "evi_mean": 0.38, "irrigation_flag": false, "fertiliser_applied": false, "anomaly_flag": "NONE", "data_source": "EXTENSION_SERVICE" }, { "plot_id": "PLT-SEN-ZG-00348", "country_code": "SN", "admin1": "Ziguinchor", "latitude": 12.5603, "longitude": -16.2719, "season": "2022-S1", "crop_type": "GROUNDNUT", "yield_kg_ha": 940, "plot_area_ha": 0.5, "soil_ph": 6.5, "soil_nitrogen_pct": 0.11, "soil_organic_carbon": 6.3, "seasonal_rainfall_mm": 412, "avg_temp_celsius": 30.8, "ndvi_peak": 0.49, "evi_mean": 0.28, "irrigation_flag": false, "fertiliser_applied": false, "anomaly_flag": "DROUGHT", "data_source": "FIELD_SURVEY" }, { "plot_id": "PLT-NGA-KN-07651", "country_code": "NG", "admin1": "Kano", "latitude": 11.9902, "longitude": 8.5194, "season": "2023-S2", "crop_type": "SORGHUM", "yield_kg_ha": 1620, "plot_area_ha": 2, "soil_ph": null, "soil_nitrogen_pct": null, "soil_organic_carbon": null, "seasonal_rainfall_mm": 530, "avg_temp_celsius": 29.1, "ndvi_peak": 0.58, "evi_mean": null, "irrigation_flag": true, "fertiliser_applied": true, "anomaly_flag": "NONE", "data_source": "SATELLITE_ONLY" } ]
Request Dataset Access

All datasets are available under a commercial licence agreement. Our team typically responds within 2 business days.

Request Access
NDA may be required

Build with Data that reflects Africa

Request access to our full catalog of licensed human-validated African datasets or request custom data tailored to your project.