A curated, structured, link-complete directory of high-quality datasets.
- 1. General Dataset Portals
- 2. Government Open Data
- 3. Society & Development
- 4. Ecology & Biodiversity
- 5. Land Cover & Geography
- 6. Oceans
- 7. Social Networks
- 8. Linguistics
- 9. Scientific Publications
- 10. APIs & Tools
- 11. Curated Lists
- Awesome Public Datasets — https://github.com/awesomedata/awesome-public-datasets
- BuzzFeed News GitHub — https://github.com/orgs/BuzzFeedNews/repositories?type=all
- CareerFoundry Dataset Overview — https://careerfoundry.com/en/blog/data-analytics/where-to-find-free-datasets/
- Culturomics (Google Ngrams etc.) — http://www.culturomics.org/
- data.world — https://data.world/search
- Ecological Data Wiki — https://ecologicaldata.org/home
- FiveThirtyEight Data — https://data.fivethirtyeight.com/
- Google Dataset Search — https://toolbox.google.com/datasetsearch
- Google Dataset Search Blog — https://www.blog.google/products/search/making-it-easier-discover-datasets/
- Kaggle — https://www.kaggle.com/
- Nature News — Virtual Satellite (Nature Trace) — https://www.nature.com/articles/d41586-025-02412-1
- Nature Trace (Earth Engine datasets) — https://developers.google.com/earth-engine/datasets/publisher/nature-trace
- Pew Research Internet Datasets — https://www.pewresearch.org/internet/datasets/
- R for Ecology — Top Five(ish) Sources of Ecological Data — https://www.rforecology.com/post/top-five-ish-sources-of-ecological-data/
- Your GIS / Dataset Spreadsheet — https://docs.google.com/spreadsheets/d/1OW6X94Ha_-_v5fe8XJlQp7ncubgF1CG6LgAIUk1kFcY/edit?gid=1637976342#gid=1637976342
- AidData — https://www.aiddata.org/datasets
- Australia (data.gov.au) — http://data.gov.au/
- CDRC (UK Consumer Data Research Centre) — https://data.cdrc.ac.uk/
- Eurostat — http://ec.europa.eu/eurostat/web/main/home
- FAO Land & Water — http://www.fao.org/land-water/databases-and-software/en/
- France (data.gouv.fr) — http://www.data.gouv.fr/
- Germany (govdata.de) — https://www.govdata.de/
- Ghana (data.gov.gh) — http://data.gov.gh/
- Hong Kong PSI — http://www.gov.hk/en/theme/psi/datasets/
- Japan (data.go.jp) — http://www.data.go.jp/
- Open Data Portals Directory — http://www.data.gov/opendatasites
- United Kingdom (data.gov.uk) — https://data.gov.uk/
- United Nations Data — http://data.un.org/
- United States (data.gov) — https://www.data.gov/
- ASDFree — US Survey Data Tutorials — http://www.asdfree.com/
- Biodiversity Indicators Partnership (BIP) — https://www.bipindicators.net/
- Gapminder (Development & Health) — http://www.gapminder.org/
- Happy Planet Index — http://happyplanetindex.org/
- Human Well-being (ScienceDirect article & data) — https://www.sciencedirect.com/science/article/pii/S0921800920322084
- Indigenous Populations Dataset (bioRxiv) — https://www.biorxiv.org/content/10.1101/2019.12.11.873695v3.full.pdf
- Inequality Dataset (Ellis) — http://ellisp.github.io/blog/2017/07/22/inter-country-inequality
- Our World in Data (history, health, development, etc.) — https://ourworldindata.org/
- World Happiness Report (Kaggle) — https://www.kaggle.com/datasets/mathurinache/world-happiness-report-20152021
- Worldometers (population, demographics, etc.) — https://www.worldometers.info/
- World Bank — World Development Indicators (Kaggle) — https://www.kaggle.com/worldbank/world-development-indicators
- World Bank Data360R (R package interface) — https://cran.r-project.org/web/packages/data360r/index.html
- DHS Surveys — https://dhsprogram.com/data/
- ENACT — Global Electrification Database — https://data.jrc.ec.europa.eu/dataset/be02937c-5a08-4732-a24a-03e0a48bdcda (DOI: 10.1038/s41467-020-18344-5)
- Fichier localisé social et fiscal (Filosofi) — https://www.insee.fr/fr/metadonnees/source/serie/s1172
- GlobPOP — https://zenodo.org/records/10088105 (DOI: 10.1038/s41597-024-02913-0)
- Gridded Population of the World (GPWv4) — https://www.earthdata.nasa.gov/data/projects/gpw
- History Database of the Global Environment (HYDE) v3.5 — https://public.yoda.uu.nl/geo/UU01/F45D44.html (DOI: 10.5194/essd-9-927-2017)
- IPUMS-DHS — https://www.idhsdata.org/
- Kummu et al. 2018 — Global Population & Human Water Security — https://datadryad.org/stash/dataset/doi:10.5061/dryad.dk1j0 (DOI: 10.1038/sdata.2018.4)
- LandScan — https://landscan.ornl.gov/
- Population Modeling in R (tutorial with data) — https://rviews.rstudio.com/2017/10/09/population-modeling-in-r/
- WorldPop — Datasets by Category — https://hub.worldpop.org/project/categories?id=18 (DOI: 10.1080/20964471.2019.1625151)
- WorldPop Portal — https://www.worldpop.org/
- Global Human Settlement Population (GHS-POP) — https://jeodpp.jrc.ec.europa.eu/ftp/jrc-opendata/GHSL/GHS_POP_GLOBE_R2023A/ (DOI: 10.2905/2FF68A52-5B5B-4A22-8F40-C41DA8332CFE)
- Connaissance locale de l'appareil productif (CLAP) — https://www.insee.fr/fr/statistiques/2021289
- Infochimps Marketplace — http://www.infochimps.com/marketplace
- Kummu et al. 2018 — Global Wealth & Resource Use — https://datadryad.org/stash/dataset/doi:10.5061/dryad.dk1j0 (DOI: 10.1038/sdata.2018.4)
- quantmod (R financial data) — http://www.quantmod.com/
- BIEN — Botanical Information & Ecology Network — http://bien.nceas.ucsb.edu/
- eBird — https://ebird.org/home
- GBIF — Global Biodiversity Information Facility — http://www.gbif.org/
- iDigBio — https://www.idigbio.org/
- OneZoom Tree of Life — http://www.onezoom.org/life.html
- PREDICTS — http://www.predicts.org.uk/
- PREDICTS Dataset (NHM) — http://data.nhm.ac.uk/dataset/902f084d-ce3f-429f-a6a5-23162c73fdf7
- SCRFA — Reef Fish Spawning Aggregations — https://www.scrfa.org/
- SPLINK (Brazilian species data) — http://splink.cria.org.br/
- Virus–Host Dataset (mBio) — https://journals.asm.org/doi/10.1128/mbio.02985-21
- Annual Terrestrial Footprint — https://doi.org/10.6084/m9.figshare.16571064 (DOI: 10.1038/s41597-022-01284-8)
- EarthEnv Texture Metrics — https://www.earthenv.org/texture
- Global Land Cover / Land Use Change 2000–2020 (GLAD app) — https://glad.earthengine.app/view/glcluc-2000-2020
- Global Vegetation Dataset (iDiv) — https://idata.idiv.de/ddm/Data/ShowData/3474
- GLOBIO4 Biodiversity Intactness — https://www.globio.info/globio-data-downloads (DOI: 10.1111/gcb.14848)
- Human Footprint (SEDAC) — http://sedac.ciesin.columbia.edu/data/set/wildareas-v1-human-footprint-ighp
- Human Footprint Index (HFI) — https://datadryad.org/stash/dataset/doi:10.5061/dryad.052q5 (DOI: 10.1038/sdata.2016.67)
- Human Impact Index (WCS) — https://wcshumanfootprint.org/data-access (DOI: 10.32942/osf.io/d7rh6)
- Land-Use Change 1982–2016 (Song et al.) — https://www.nature.com/articles/s41586-018-0411-9/
- Land-Use Change Dataset (UMD GLAD) — https://glad.umd.edu/dataset/long-term-global-land-change (DOI: 10.1038/s41586-018-0411-9)
- Landscape Fragmentation – Effective Mesh Density (EEA) — https://www.eea.europa.eu/en/datahub/datahubitem-view/9d0b51f9-047d-4af1-89eb-3756e46ffc53?activeAccordion=1083720
- Terrestrial Human Footprint (HFP-100) — https://datadryad.org/dataset/doi:10.5061/dryad.ttdz08m1f (DOI: 10.3389/frsen.2023.1130896)
- CaRHAB (French Habitat Mapping Programme) — https://inpn.mnhn.fr/programme/carhab
- CARTNAT — Conservation Value of Terrestrial Habitats in France — https://uicn-ressources.fr/CartNat/Donnees%20cartographiques.zip (DOI: 10.1038/s43247-025-02160-0)
- Ecoregions 2017 (TEOW update) — https://storage.googleapis.com/teow2016/Ecoregions2017.zip (DOI: 10.1093/biosci/bix014)
- Freshwater Ecoregions of the World (FEOW) — https://www.feow.org/download (DOI: 10.1641/B580507)
- Geodiversity (global index) — https://datadryad.org/dataset/doi:10.5061/dryad.crjdfn39c (DOI: 10.1098/rsta.2023.0173)
- Global Biome Cluster — https://zenodo.org/records/5848610 (DOI: 10.5281/zenodo.5848610)
- Global Ecological Zones (FAO GEZ) — https://data.apps.fao.org/catalog/dataset/2fb209d0-fd34-4e5e-a3d8-a13c241eb61b
- Global Ecosystem Typology (IUCN GET) — https://zenodo.org/records/10081251 (DOI: 10.1038/s41586-022-05318-4)
- Global Maps of Habitat Types — https://doi.org/10.5281/zenodo.4058819 (DOI: 10.1038/s41597-020-00599-8)
- Global Environmental Stratification (GEnS) — https://datashare.ed.ac.uk/handle/10283/3089 (DOI: 10.1111/geb.12022)
- GRIIS — Global Register of Introduced and Invasive Species — http://www.griis.org/
- GRECO & SER — French Ecological Regions — https://inventaire-forestier.ign.fr/spip/spip.php?article773
- Marine Ecoregions of the World (MEOW) — https://www.worldwildlife.org/publications/marine-ecoregions-of-the-world-a-bioregionalization-of-coastal-and-shelf-areas (DOI: 10.1641/B570707)
- Natura 2000 — https://www.eea.europa.eu/en/datahub/datahubitem-view/6fc8ad2d-195d-40f4-bdec-576e7d1268e4
- NOBANIS — European Invasive Alien Species — https://www.nobanis.org/
- Protected Planet API (includes WDPA) — https://api.protectedplanet.net/documentation
- Sea Around Us — Marine Protected Areas — http://www.seaaroundus.org/data/#/mpa
- Terrestrial Ecoregions of the World (TEOW) — https://www.worldwildlife.org/publications/terrestrial-ecoregions-of-the-world (DOI: 10.1641/0006-3568(2001)051[0933:TEOTWA]2.0.CO;2)
- World Database on Protected Areas (WDPA) — https://www.protectedplanet.net/en/thematic-areas/wdpa?tab=WDPA
- World Terrestrial Ecosystems — https://www.sciencebase.gov/catalog/item/6296791ed34ec53d276bb293 (DOI: 10.1016/j.gecco.2019.e00860)
- CHELSA Climate Data — https://chelsa-climate.org/ (DOI: 10.1038/s41597-020-00587-y)
- E-OBS Gridded Observations for Europe — https://surfobs.climate.copernicus.eu/dataaccess/access_eobs.php (DOI: 10.1029/2017JD028200)
- ERA5-Land Reanalysis — https://cds.climate.copernicus.eu/datasets/reanalysis-era5-land-monthly-means?tab=overview (DOI: 10.24381/cds.adbb2d47)
- EuMedClim — https://entrepot.recherche.data.gouv.fr/dataset.xhtml?persistentId=doi:10.15454/1.505380010373349E12 (DOI: 10.15454/1.505380010373349E12)
- European Downscaled Daily Climate v4 — https://figshare.com/articles/online_resource/Description_and_Evaluation_of_Downscaled_Daily_Climate_Data_Version_4/22962671/1?file=40704293 (DOI: 10.1016/j.envsoft.2023.105627)
- PUG-BIOCLIMATE-1km-ERA5 (Regional) — https://cds.climate.copernicus.eu/datasets/sis-biodiversity-era5-regional (DOI: 10.24381/cds.fe90a594)
- PUG-BIOCLIMATE-ERA5 (Global) — https://cds.climate.copernicus.eu/datasets/sis-biodiversity-era5-global (DOI: 10.24381/cds.bce175f0)
- S2M SAFRAN Snow & Atmosphere Reanalysis — https://www.aeris-data.fr/en/landing-page/?uuid=865730e8-edeb-4c6b-ae58-80f95166509b (DOI: 10.5194/essd-14-1707-2022)
- SIM SAFRAN (France) — https://geodata.inrae.fr/geonetwork/srv/api/records/4ee133fa-d4cb-44d8-b708-46946573ba5f
- TerraClimate — https://www.climatologylab.org/terraclimate.html (DOI: 10.1038/sdata.2017.191)
- WorldClim — https://www.worldclim.org/ (DOI: 10.1002/joc.5086)
- Biomass Carbon Density (Global 2010) — https://www.earthdata.nasa.gov/data/catalog/ornl-cloud-global-maps-c-density-2010-1763-1 (DOI: 10.1038/s41597-020-0444-4)
- CCI BIOMASS — https://climate.esa.int/en/projects/biomass/data/
- Global Aridity Index & Potential Evapotranspiration (Global-AI_PET v3) — https://figshare.com/articles/dataset/Global_Aridity_Index_and_Potential_Evapotranspiration_ET0_Climate_Database_v2/7504448 (DOI: 10.1038/s41597-022-01493-1)
- Global Ecosystem Dynamics Investigation (GEDI) L3 — https://www.earthdata.nasa.gov/data/catalog/ornl-cloud-gedi-l3-landsurface-metrics-v2-1952-2 (DOI: 10.3334/ORNLDAAC/1952)
- GLEAM v4 (Global Land Evaporation Amsterdam Model) — https://zenodo.org/records/14724263 (DOI: 10.1038/s41597-025-04610-y)
- GlobBiomass — https://globbiomass.org/wp-content/uploads/GB_Maps/Globbiomass_global_dataset.html
- High-Resolution Vegetation Products (Copernicus HR-VPP) — https://land.copernicus.eu/en/products/vegetation
- Liu et al. 2023 — Global Canopy Height / Biomass — https://zenodo.org/records/8154445 (DOI: 10.1126/sciadv.adh4097)
- MODIS Primary Productivity (MOD17A3HGF) — https://lpdaac.usgs.gov/products/mod17a3hgfv061/ (DOI: 10.5067/MODIS/MOD17A3HGF.061)
- MODIS Vegetation Index (MOD13Q1) — https://www.earthdata.nasa.gov/data/catalog/lpcloud-mod13q1-061 (DOI: 10.5067/MODIS/MOD13Q1.061)
- TreeOfLife-10M — https://huggingface.co/datasets/imageomics/TreeOfLife-10M
- Imageomics — https://huggingface.co/imageomics
- BD TOPO (France) — https://geoservices.ign.fr/bdtopo
- Cartobio (France organic agriculture) — https://www.data.gouv.fr/fr/datasets/616d6531c2951bbe8bd97771/
- CEREMA — Consumption of Natural, Agricultural & Forest Areas — https://www.data.gouv.fr/fr/datasets/consommation-despaces-naturels-agricoles-et-forestiers-du-1er-janvier-2009-au-1er-janvier-2024/
- CLC Plus (CLC-Backbone) — https://land.copernicus.eu/en/products/clc-backbone (DOI: 10.2909/b0bd43c6-1fa1-4d88-9c45-98b13a95d0b2)
- Corine Land Cover (CLC) — https://land.copernicus.eu/en/products/corine-land-cover (DOI: 10.2909/960998c1-1870-4e82-8051-6485205ebbac)
- CROPGRIDS — https://figshare.com/articles/dataset/CROPGRIDS/22491997 (DOI: 10.1038/s41597-024-03247-7)
- Dedieu & Pomeon 2024 — French Cropland Dataset — https://entrepot.recherche.data.gouv.fr/dataset.xhtml?persistentId=doi:10.57745/SHXHP4 (DOI: 10.57745/SHXHP4)
- EUCROPMAP — https://data.jrc.ec.europa.eu/dataset/15f86c84-eae1-4723-8e00-c1b35c8f56b9 (DOI: 10.1016/j.rse.2021.112708)
- EU Field Boundaries — https://zenodo.org/records/14229033 (DOI: 10.5281/zenodo.14229033)
- ESA CCI Land Cover — https://cds.climate.copernicus.eu/datasets/satellite-land-cover?tab=download (DOI: 10.24381/cds.006f2c9a)
- European Land Systems — https://doi.org/10.34894/XNC5KA (DOI: 10.1007/s10980-021-01227-5)
- European Land Systems (paper) — https://doi.org/10.1007/s10980-021-01227-5
- GLC_FCS30D — Global 30m Land Cover — https://zenodo.org/records/15063683 (DOI: 10.5194/essd-16-1353-2024)
- Global Dynamic Land Cover (Copernicus) — https://land.copernicus.eu/en/products/global-dynamic-land-cover
- Global Forest Change (Hansen et al.) — https://storage.googleapis.com/earthenginepartners-hansen/GFC-2024-v1.12/download.html (DOI: 10.1126/science.1244693)
- Global Forest Cover Change (GFCC30TC) — https://www.earthdata.nasa.gov/data/catalog/lpcloud-gfcc30tc-003 (DOI: 10.1080/17538947.2013.786146)
- Global Distribution of Field Size — https://pure.iiasa.ac.at/id/eprint/15526/ (DOI: 10.1111/gcb.14492)
- Grassland — Copernicus High Resolution Layer — https://land.copernicus.eu/en/products/high-resolution-layer-grassland
- HILDA+ — https://doi.org/10.1594/PANGAEA.921846 (DOI: 10.1038/s41467-021-22702-2)
- Landsat Soil Spectral Indices (NDTI etc.) — https://stac.ecodatacube.eu/ndti_glad.landsat.ard2.seasconv.m.yearly/collection.json?.language=en (DOI: 10.5281/zenodo.10851081)
- MODIS Land Cover (MCD12Q1) — https://lpdaac.usgs.gov/products/mcd12q1v061/ (DOI: 10.5067/MODIS/MCD12Q1.061)
- OSO Land Cover (France) — https://geodes-portal.cnes.fr/ (DOI: 10.3390/rs9010095)
- Pan-European Land Cover Map 2015 — https://doi.pangaea.de/10.1594/PANGAEA.896282 (DOI: 10.1016/j.rse.2018.12.001)
- Panhelleux et al. 2023 — Landscape Metrics Dataset — https://zenodo.org/records/7895449 (DOI: 10.1016/j.dib.2023.109348)
- Registre Parcellaire Graphique (RPG) — https://geoservices.ign.fr/rpg#telechargement
- Rousseau et al. 2024 — Land-Sea Change Dataset — https://metadata.imas.utas.edu.au/geonetwork/srv/eng/catalog.search#/metadata/1241a51d-c8c2-4432-aa68-3d2bae142794 (DOI: 10.25959/MNGY-0Q4313)
- WorldCereal — https://zenodo.org/records/7875105 (DOI: 10.5194/essd-2023-184)
- WorldCover (ESA WorldCover) — https://esa-worldcover.org/en/data-access (DOI: 10.5281/zenodo.7254221)
- Air Pollution (INERIS, France) — https://www.ineris.fr/fr/recherche-appui/risques-chroniques/mesure-prevision-qualite-air/qualite-air-france-metropolitaine (DOI: 10.5194/essd-14-2419-2022)
- BD Carto (France) — https://geoservices.ign.fr/bdcarto
- BD Topage (France Hydrography) — https://www.sandre.eaufrance.fr/atlas/srv/fre/catalog.search#/metadata/82752235-2ddf-4b62-a82f-6ea276671f18
- Cartes de bruit stratégiques (CBS, noise maps) — https://www.data.gouv.fr/fr/datasets/cartes-de-bruit-strategiques-des-reseaux-routiers-et-ferroviaires-non-concedes-directive-europeenne-2002-49-ce/
- EEA Reference Grid — https://sdi.eea.europa.eu/catalogue/srv/api/records/aac8379a-5c4e-445c-b2ef-23a6a2701ef0?language=all
- EMEP MSC-W Atmospheric Chemistry Transport Model — https://www.emep.int/mscw/mscw_moddata.html (DOI: 10.5194/acp-12-7825-2012)
- ENACT — Global Electrification Database — https://data.jrc.ec.europa.eu/dataset/be02937c-5a08-4732-a24a-03e0a48bdcda (DOI: 10.1038/s41467-020-18344-5)
- European Soil Database (ESDB v2) — https://esdac.jrc.ec.europa.eu/content/european-soil-database-v2-raster-library-1kmx1km (DOI: 10.1111/ejss.13315)
- Global Roads Inventory Project (GRIP) — https://www.globio.info/download-grip-dataset (DOI: 10.1088/1748-9326/aabd42)
- Global Streamflow Characteristics Dataset (GSCD) — https://www.gloh2o.org/gscd/ (DOI: 10.1175/JHM-D-14-0155.1)
- Global Surface Water (GSW) — https://global-surface-water.appspot.com/download (DOI: 10.1038/nature20584)
- Hydrography90m — https://hydrography.org/hydrography90m/hydrography90m_layers (DOI: 10.5194/essd-14-4525-2022)
- Landscape Fragmentation – Effective Mesh Density (EEA) — https://www.eea.europa.eu/en/datahub/datahubitem-view/9d0b51f9-047d-4af1-89eb-3756e46ffc53?activeAccordion=1083720
- Quietness Suitability Index — https://sdi.eea.europa.eu/catalogue/srv/eng/catalog.search#/metadata/e9151c34-da65-48b9-a2ca-b9b835480812
- River Quality (France) — https://qualite-riviere.lesagencesdeleau.fr/
- Route500 (France Road Network) — https://geoservices.ign.fr/route500
- Topsoil Physical & Chemical Properties (LUCAS) — https://esdac.jrc.ec.europa.eu/content/topsoil-physical-properties-europe-based-lucas-topsoil-data (DOI: 10.1016/j.geoderma.2019.113912)
- VIIRS Nighttime Lights — https://eogdata.mines.edu/products/vnl/ (DOI: 10.3390/rs13050922)
- Water and Wetness (Copernicus HRL WAW) — https://land.copernicus.eu/en/products/high-resolution-layer-water-and-wetness/water-and-wetness-status-2018 (DOI: 10.2909/7992f641-bf77-47b7-b0c1-74fc832b78b1)
- Waterbase (EEA Water Quality) — https://www.eea.europa.eu/en/datahub/datahubitem-view/fbf3717c-cd7b-4785-933a-d0cf510542e1
- ALOS World 3D - 30m (AW3D30) — https://www.eorc.jaxa.jp/ALOS/en/dataset/aw3d30/aw3d30_e.htm (DOI: 10.5194/isprs-archives-XLIII-B4-2020-183-2020)
- ASTER GDEM v3 — https://www.earthdata.nasa.gov/data/catalog/lpcloud-astgtm-003 (DOI: 10.5067/ASTER/ASTGTM.003)
- Copernicus DEM GLO-30 — https://doi.org/10.5270/ESA-c5d3d65 (DOI: 10.5270/ESA-c5d3d65)
- EarthEnv-DEM90 — https://www.earthenv.org/DEM (DOI: 10.1016/j.isprsjprs.2013.11.002)
- ETOPO Global Relief Model — https://www.ncei.noaa.gov/products/etopo-global-relief-model (DOI: 10.25921/fd45-gt74)
- European Digital Terrain Models (EU DTM) — https://zenodo.org/records/4724549 (DOI: 10.5281/zenodo.4724549)
- FABDEM — https://data.bris.ac.uk/data/dataset/s5hqmjcdj8yo2ibzi9b4ew3sn (DOI: 10.1088/1748-9326/ac4d4f)
- GTOPO30 — https://doi.org/10.5066/F7DF6PQS (DOI: 10.1029/99EO00050)
- Global Ensemble Digital Terrain Model (GEDTM30) — https://zenodo.org/records/15689805 (DOI: 10.7717/peerj.19673)
- GEBCO Gridded Bathymetry — https://www.gebco.net/data-products/gridded-bathymetry-data (DOI: 10.5285/1c44ce99-0a0d-5f4f-e063-7086abc0ea0f/)
- Earth Nullschool (visualisation) — http://earth.nullschool.net
- LIDAR HD (France) — https://geoservices.ign.fr/lidarhd
- RGE ALTI (France) — https://geoservices.ign.fr/rgealti
- SRTM 30 m — https://dwtkns.com/srtm30m/
- Tandem-X (TDM30) — https://geoservice.dlr.de/web/dataguide/tdm30/
- Base de Données Géographique des Sols de France (BDGSF) — https://entrepot.recherche.data.gouv.fr/dataverse/bdgsf (DOI: 10.15454/BPN57S)
- European Soil Database (ESDB v2) — https://esdac.jrc.ec.europa.eu/content/european-soil-database-v2-raster-library-1kmx1km (DOI: 10.1111/ejss.13315)
- Global Rainfall Erosivity (GloREDa) — https://esdac.jrc.ec.europa.eu/content/gloreda (DOI: 10.1016/j.dib.2023.109482)
- Harmonized World Soil Database (HWSD v2.0) — https://gaez.fao.org/pages/hwsd (DOI: 10.1016/j.geoderma.2016.01.034)
- SoilGrids — https://www.isric.org/explore/soilgrids (DOI: 10.5194/soil-7-217-2021)
- Topsoil Physical & Chemical Properties (LUCAS) — https://esdac.jrc.ec.europa.eu/content/topsoil-physical-properties-europe-based-lucas-topsoil-data (DOI: 10.1016/j.geoderma.2019.113912)
- AquaMaps (marine species distributions) — https://www.aquamaps.org/
- Bio-ORACLE (marine environmental rasters) — http://www.bio-oracle.org/code.php
- Copernicus Marine Data (e.g., GLORYS12V1) — https://data.marine.copernicus.eu/product/GLOBAL_MULTIYEAR_PHY_001_030/description (DOI: 10.48670/moi-00021)
- FishBase — https://www.fishbase.se/
- GEBCO Gridded Bathymetry — https://www.gebco.net/data-products/gridded-bathymetry-data (DOI: 10.5285/1c44ce99-0a0d-5f4f-e063-7086abc0ea0f/)
- GlobColour Ocean Colour Products — https://hermes.acri.fr/index.php (DOI: 10.48670/moi-00281)
- Global Fishing Watch — https://globalfishingwatch.org/data/our-apis-portal/
- NOAA NODC/NCEI (oceanographic data) — https://www.nodc.noaa.gov/access/allproducts.html
- OBIS — Ocean Biodiversity Information System — https://obis.org/
- PNAS Global Islands Dataset — https://www.pnas.org/content/110/38/15307
- Flickr Aesthetics Dataset — http://www.di.unito.it/~schifane/dataset/beauty-icwsm15/#download
- Flickr Geospatial Tutorial — http://data-analytics.net/wp-content/uploads/2014/09/geospatial2.html
- Flickr YFCC100M (Yahoo Webscope) — https://webscope.sandbox.yahoo.com/catalog.php?datatype=i&did=67
- SIL Linguistic Computing Resources — http://www-01.sil.org/linguistics/computing.html
- Academic Accelerator (journal metrics / APIs) — https://academic-accelerator.com/
- Bioxbio (Impact Factor history) — https://www.bioxbio.com/
- Clarivate Master Journal List — https://mjl.clarivate.com/home
- Connected Papers (paper similarity graph) — https://www.connectedpapers.com/
- Crossref REST API — https://api.crossref.org/
- Dimensions (free tier) — https://www.dimensions.ai/products/free/
- JANE — Journal/Author Name Estimator — http://jane.biosemantics.org/
- Journal Indicators (CWTS) — https://www.journalindicators.com/methodology
- OpenAlex Docs — https://docs.openalex.org/
- OpenAlex API — https://api.openalex.org
- SCImago Journal Rank — https://www.scimagojr.com/journalrank.php?wos=false
- Scopus ASJC Codes (data file) — https://github.com/dhimmel/scopus/blob/master/data/asjc-codes.tsv
- Global Fishing Watch API Python Client — https://github.com/GlobalFishingWatch/gfw-api-python-client
- Global Fishing Watch R Client (gfwr) — https://globalfishingwatch.github.io/gfwr/
- Global Fishing Watch APIs — https://globalfishingwatch.org/data/our-apis-portal/
- Protected Planet API — https://api.protectedplanet.net/documentation
- rOpenSci Packages — https://ropensci.org/packages/
- Social APIs in the R Ecosystem
- twitteR (Twitter API client)
- RFacebook (Facebook API client)
- RGoogleMaps (Google Maps)
- rfigshare (Figshare)
- rplos (PLOS)
- AWS Public Datasets (Registry of Open Data) — https://registry.opendata.aws/
- CMU StatLib Archive — http://lib.stat.cmu.edu/
- GEO — Gene Expression Omnibus — https://www.ncbi.nlm.nih.gov/geo/
- Gregory Piatetsky-Shapiro (KDnuggets datasets & resources) — http://www.kdnuggets.com/gps.html
- Hilary Mason Dataset Bundle — http://bitly.com/bundles/hmason/1
- Jeff Hammerbacher — Intro to Data Science Datasets — http://www.quora.com/Jeff-Hammerbacher/Introduction-to-Data-Science-Data-Sets
- Mortar Data — Dataset Lists Curated by Data Scientists — http://blog.mortardata.com/post/67652898761/6-dataset-lists-curated-by-data-scientists
- Peter Skomoroch Delicious Dataset Bookmarks — https://delicious.com/pskomoroch/dataset
- SNAP — Stanford Large Network Dataset Collection — https://snap.stanford.edu/data/
- UCI Machine Learning Repository — https://archive.ics.uci.edu/ml/index.php
- arXiv Metadata & APIs — https://arxiv.org/help/bulk_data