SDG Data Catalog

An open, evolving global database of SDG relevant data sets.

The SDG Data Catalog is an open, extensible, global database of data sets, metadata, and research networks built automatically by mining millions of published open access academic works.

The SDG Data Catalog leverages advancements in Artificial Intelligence (AI) and Natural Language Processing (NLP) technologies to extract and organize knowledge from public datasets that is otherwise hidden in plain sight in the continuous stream of research generated by the scientific community.

The goal, ultimately, is to connect researchers and students with SDG-relevant datasets so that their work can make meaningful progress towards social good. 

decorative shape
ilustration map with red circles showing affected areas

Research Paper

Hidden in Plain Sight: Building a Global Sustainable Development Data Catalogue

By James Hodson & Andy Spezzati

Modern scientific research for Sustainable Development depends on the availability of large amounts of relevant real-world data. However, there are currently no extensive global databases that associate existing data sets with the research domains they cover.

We present the SDG Data Catalogue – an open, extensible, global database of data sets, metadata, and research networks built automatically by mining millions of published open access academic works. Our system leverages advances in AI and NLP Technologies to extract and organise deep knowledge of data sets available that is otherwise hidden in plain sight in the continuous stream of research generated by the scientific community.

Explore SDG Datasets

Goal 1:
No Poverty

AI Induced Empathy

AI Induced Empathy

Datasets and projects designed to increase empathy for often impoverished victims of far-away disasters.

Poverty and Equity

Poverty and Equity Database

The latest poverty and inequality indicators compiled from officially recognized sources with national, regional and global estimates.

social protection Profiling

Social Protection Responses to COVID-19

Annual social protection data are compiled by the International Labour Organization (ILO) through its Social Security Inquiry, sourced from national administrative data. The indicators are disseminated through ILO World Social Protection Data Dashboards

UNICEF poverty

UNICEF Research & Reports

A list of equitable data sets, research and reports from UNICEF Office of Innovation to support programmes, campaigns, and initiatives.

Goal 2:
Zero Hunger

Algorithm Crop Conditions

Crop Conditions

The Group on Earth Observations Global Agricultural Monitoring (GEOGLAM) Crop Monitor (https://cropmonitor.org/) is an international initiative that was developed under the framework of the 2011 G20 Action Plan on Food Price Volatility in Agriculture.

Desert dunes Desert Locust climate

Desert Locust

Created by the UN Food and Agriculture Organization (FAO), an agency dedicated to international efforts to end hunger, this dataset tracks desert locust observations, as well as whether the observed locusts are adults or nymphs (known as hoppers) and whether the locusts form a group.

Global Hunger

Global Hunger Index 2013

The Global Hunger Index (GHI) is a tool designed to comprehensively measure and track hunger globally, by region and country.

Global Hunger

Global Hunger Index 2017

The Global Hunger Index (GHI) is a tool designed to comprehensively measure and track hunger globally, by region and country.

tangerine tree Partnership for Action on Green Economy Insufficient Food Intake

Insufficient Food Intake

The World Food Programme (WFP) has developed the HungerMapLIVE, a global hunger monitoring system that tracks and predicts hunger in near-real time.

Goal 3:
Good Health & Wellbeing

Genome Project

1000 Genome Project

The International Genome Sample Resource contains the most extensive catalogue of genetic variation in humans including SNPs, structural variants and haplotype context.

Spotlight Initiative Three young woman Health Services

Access to Health Services for women of child bearing age

World Pop is an applied research group focussed on mapping demographics in low and middle income countries, and works to measures the availability and geographical accessibility of healthcare services at the national and sub-national levels across Sub-Saharan Africa as one of its activities.

World map with red circles on affected areas COVID-19 Cases

COVID-19 Cases

Created by the John Hopkins University Center for Systems Science and Engineering, this dataset reports COVID-19 cases at the provincial-level in China, at the county-level in the U.S., and at the state and national-levels for other countries.

assortment of pills COVID-19 Vaccine Stanford

COVID-19 Vaccine Procurement

This dataset provides the most recent data on vaccine purchases and negotiations by individual countries and unilateral partnerships from 16 companies.

global health

Global Health Observatory

The GHO data repository contains data collected by the World Health Organization on various health-related statistics including mortality and disease burden rates in 194 countries.

Goal 4:
Quality Education

Team Learners impacted by COVID-19 Blog Jobs FAQs newsletter

Learners impacted by COVID-19

The United Nations Educational, Scientific and Cultural Organization(UNESCO) is supporting countries in their efforts to mitigate the immediate negative impact of school closures and to facilitate the continuity of education through remote learning.

Goal 5:
Gender Equality

Young woman workin on laptop Gender Gap youth

Digital Gender Gap

The University of Oxford and Qatar Computing Research Institute(QCRI), with support from Data2X, are collaborating to measure digital gender gaps in real time.

four friends on a hike World Leaders

Female World Leaders

The United Nations Protocol and Liaison Service maintains a list of Heads of State, Heads of Government, and Ministers for Foreign Affairs of all Member States based on the information provided by the Permanent Missions.

Three young kids standing by eachother with beautiful smiles Women in Parliament Humanitarian

Percentage of Women in Parliament

The Inter-Parliamentary Union (IPU) tracks monthly rankings of the percentage of women in parliament from January 2019 onwards through Parline, a free resource with over 600 data points provided directly by national parliaments on their structure, composition, working methods, and activities.

Goal 6:
Clean Water & Sanitation

wide view of blue ocean cost vilage with mountains in the background Water Quality Environmental

Trophic State (Water Quality)

The UN Environment Programme (UNEP) works with partners to support the global monitoring of freshwater ecosystems, as reported through the Freshwater Ecosystems Explorer, which provides up-to-date geospatial data on changes to their extent and water quality.

Melting icebergs Water Anomalies Climate Watch Water Anomalies

Water Anomalies

The ISciences Water Security Indicator Model v2 (WSIMv2) describes places where water availability during the most recent 12-month period is more or less than would be expected based on a 1950-2009 baseline period.

The Council for Good, overview photo of ocean coast AI Water Stress

Water Stress

The Falkenmark Water Stress Index is a widely used metric to characterize water stress based on annual renewable water supply per capita.

Goal 7:
Affordable & Clean Energy

Energy History

Energy History

Data on energy consumption and per capita energy consumption of a few countries.

world

Our World in Data

Data on global energy consumption by source, energy production and trade, energy transitions and renewable energy investments.

AI for renewable energy Population

Population Without Electricity

Developed by Fondazione Eni Enrico Mattei (FEEM), a sustainable development think-tank, this dataset measures electricity access in Sub-Saharan Africa.

project

The Shift Project

Datasets on primary energy production and consumption, CO2 from fossil fuels, greenhouse gas emissions, renewable energy and electricity.

Goal 8:
Decent Work & Economic Growth

child labor

Child Labor

Research by the Oxfard Martin School on child labor internationally.

Covid-19 Fiscal Response

COVID-19 Fiscal Response

The International Monetary Fund (IMF) compiles a database on fiscal measures announced by 141 different governments in response to the COVID-19 pandemic

working Launchpad employment

COVID-19 impact on employment

The International Labour Organisation (ILO) is tracking the impacts on the world of work that has been severely impacted by COVID-19

GDP Growth Inequality

GDP Growth Rates

The OECD’s quarterly national accounts (QNA) dataset presents GDP growth data collected from all the OECD member countries and some other major economies on the basis of a standardised questionnaire.

Goal 9:
Industry, Innovation & Infrastructure

Kids looking at a laptop Internet

Access to Internet

The International Telecommunication Union measures internet access across the globe twice a year using survey data.

Goal 10:
Reduced Inequalities

prosperity

Global Database of Shared Prosperity

The World Bank’s Global Database of Shared Prosperity covers 83 countries, with 75 percent of the world’s people, with most recent estimates available for 2013.

Man carrying large log of wood traveling long distances Migrant Stock

International Migrant Stock

Reported by the UN Division of Economic and Social Affairs (UN DESA), International migrant stocks are estimates of the total number of international migrants present in a given country at a particular time.

ITU

ITU AI Repository

A global Artificial Intelligence (AI) repository to identify AI related projects, research initiatives, think-tanks and organizations that can accelerate progress towards the 17 UN Sustainable Development Goals.

Goal 11:
Sustainable Cities & Communities

overview of city with lots of smog Air Quality

Air Quality (PM2.5)

OpenAQ, a non-profit organization, collects daily air quality information from stations around the world and provides it as free and open data to help better monitor and manage the air we breathe.

People carrying buckets of grain in their heads on arid dirt road COVID19 Community

COVID19 Community Mobility Reports

Google’s Community Mobility Reports chart the geographic movement trends associated with COVID-19 over time and provides the data, aggregated and anonymized, to the public.

European Data

European Data Portal

The European Data Portal harvests the metadata of Public Sector Information available on public data portals across European countries. Information regarding the provision of data and the benefits of re-using data is also included.

blured night picture of city skyscrapers Settlement

Settlement Extents

The database constitutes a comprehensive set of settlement polygons. It is in geodatabase format and consists of three feature classes for built up areas (BUA), small settlement areas (SSA), and hamlets (hamlets).

Sustainable Cities

Sustainable Cities and Society Mendeley Datasets

Mendeley Data Repository is free-to-use and open access. It enables you to deposit any research data (including raw and processed data, video, code, software, algorithms, protocols, and methods) associated with your research manuscript.

social protection Profiling

The Settlement Profiling Tool

The Settlement Profiling Tool guides field personnel in creating cross-sectoral settlement profiles intended to help inform future urban development plans and policies in displacement affected contexts.

Goal 12:
Responsible Consumption & Production

SDG Indicators

Global SDG Indicators Database

This platform provides access to data compiled through the UN System in preparation for the Secretary-General’s annual report on “Progress towards the Sustainable Development Goals.”

brown fields with wind turbines on the background Renewable Energy SDG

Installed Renewable Energy Capacity

The International Renewable Energy Agency (IRENA), an intergovernmental organization that supports countries in their transition to a sustainable energy future, compiled this dataset by measuring the maximum net generating capacity of renewable and non-renewable energy sources by country.

Moldova fountain in park with city buldings in the background

Moldova | SDG Integration

A collaborative data platform that integrates different types of data to allow the Moldovan Government access to exhaustive information on land coverage, population density and mobility behaviour.

production

SDG Production Tracker

SDG Tracker is a free, open-access publication that tracks global progress towards the SDGs and allows people around the world to hold their governments accountable to achieving the agreed goals.

Goal 13:
Climate Action

Melting glaciers Arctic Sea Ice

Arctic Sea Ice

Areas of the ocean that have frozen are considered “sea ice,” and can vary from slushy, barely solid areas to sheets of ice that are meters thick.

Factories with lots of Carbon Dioxide Carbon Project

Carbon Dioxide Emissions

The Carbon Monitor dataset, led by researchers Zhu Liu, Philippe Ciais and Steven Davis, was created as the first estimate of daily CO2 emissions for six different sectors, including power, ground transportation, industrial production, residential consumption, and maritime and aircraft transportation.

NOAA desert dunes Drought

Drought and Precipitation

The Climate Hazards Group InfraRed Precipitation with Station Data (CHIRPS) is a joint project between the US. Geological Survey and UC Santa Barbara.

Forest on fire Temperature Change

Global Temperature Change

The National Oceanic and Atmospheric Administration (NOAA), the National Aeronautics and Space Administration (NASA), and the UK Meteorological Office (UK Met) have used detailed station data going back to the 1800s to analyze temperature changes and have all confirmed the warming of our planet.

Photo of beautiful pink clouds Environmental

National Centers for Environmental Information

NCEI provides the world’s largest collection of weather and climate data, including information that’s “land-based, marine, model, radar, weather balloon, satellite, and paleoclimatic” alongside other datasets.

NOAA desert dunes Drought

NOAA – Climate.gov

Provides science and information, focusing on news, data, and climate teaching materials, and the data products and services to track global climate data.

data Aerial view of a factory with a lot of smoke coming out

Our World In Data

Our World Data provides a complete guide to CO2 and Greenhouse gas emission profiles for individual countries, charting how emissions are changing in each country, reduction progress and statistics.

Goal 14:
Life below Water

clown fish on coral reef

Bleaching of Coral Reef Areas

Coral reefs are one of the most diverse and ecologically important areas in the world, but many are threatened by rising ocean temperatures.

A huge school of Yellowstripe Scads in tight formation Fishing

Global Fishing Activity

Global Fishing Watch (GFW) is advancing ocean governance through increased transparency of human activity at sea.

close up photo of water in the ocean

Ocean Tracking Network

The Ocean Tracking Network is a global aquatic animal tracking, data management, and partnership platform.

Goal 15:
Life on Land

Turtle swimming underwater Conservation

Conservation

The World Database on Protected Areas (WDPA) was established in 1981 after the UN Economic and Social Council called for a list of natural reserves, citing its value for economic, scientific, and conservation.

photo of a path in between trees in a forest Deforestation Atlases

Deforestation

Global Forest Watch (GFW) provides data and tools for monitoring forests and provides access to near real-time information about where and how forests are changing around the world.

forest watch close up photo of a tree trunk in the forest

Global Forest Watch

Provides data about forests including land cover, land use, biodiversity metrics and forest change allowing for the monitoring and management of forests.

wide photo of a forest with tall trees and the sunset in the back resource

Resource Watch

Provides data on forest ecosystems including tree cover loss and gain rates, restoration opportunities, forest fires and biodiversity hotspots.

close up photo of a bucket with assorted fruits and a hand holding an apple Nutrition

Sustainable Nutrition for All

Aimed to improve nutrition through the adoption of agro-biodiversity and improved dietary diversity at the household level in Uganda & Zambia.

photo of a path in between trees in a forest Deforestation Atlases

The Forest Atlases

Allows users to visualize and analyse data on country specific forest characteristics.

satelite view of ocean coast Tropics

Tropics Imagery

Norway’s International Climate and Forests Initiative (NICFI) makes high-resolution (<5m per pixel) optical satellite imagery of the tropics freely available to all in the pursuit of helping stop deforestation and combat climate change.

forest Wildfires

Wildfires

The Active Fires product, managed by the National Oceanic and Atmospheric Administration (NOAA), is based on the detection and analysis of active wildfires as received by a sensor.

photo of a tree in the forest WRI

WRI Research

Features environmental conservation and restoration frameworks for policymakers and private-sector initiatives including infographics, datasets, visualization tools, and more.

Goal 16:
Peace, Justice & strong Institutions

5 military people running across sand dunes Armed Conflict UCDP

Armed Conflict

he Armed Conflict Location & Event Data Project (ACLED), a disaggregated data collection, analysis, and crisis mapping project, maintains a database of all forms of human conflict from over 50 developing countries.

statue of Lady Justice - Piracy Data Initiative Supreme Court

Piracy

National Geospatial-Intelligence Agency, an agency within the United States Department of Defense, records instances of hostile attacks against ships and mariners via its Anti-Shipping Activity Messages (ASAM) database.

statue of Lady Justice - Piracy Data Initiative Supreme Court

SDG16 Data Initiative

Pulls together data sets in an open format to track SDG16 and provide a snapshot of the current situation, and eventually progress over time.

Arches of a cathedral Voluntary

Voluntary National Reviews

The Voluntary National Reviews (VNRs) aim to facilitate the sharing of experiences, including successes, challenges, and lessons learned, with the goal of accelerating the implementation of the 2030 Agenda.

Goal 17:
Partnerships for the Goals

Hand holding a crystal ball Development Assistance

Official Development Assistance

Official development assistance (ODA) is defined by the OECD Development Assistance Committee as government aid that promotes and targets the economic development and welfare of developing countries.

jar knock out with coins coming out Remittances Income Inequality

Remittances

Compiled by the World Bank, this dataset measures officially-recorded remittance inflows (remittances received) per country in 2020.

Further Research and Resources

professor Achim Rettinger

Interview with Achim Rettinger

AI for Good Board Member and Full Professor at Trier University, Achim Rettinger discusses with the AI for Good Foundation Team his work in natural language processing, and how that can impact progress toward the SDGs. According to Professor Rettinger, AI and machine learning can be utilized to understand communication better by analyzing huge quantities of data. The data can help the international community uncover insights on the collective progress toward the 2030 deadline.

Join us

The AI for Good Foundation is continually looking for researchers and experts in the machine learning field to pool our collective talent in support of UN’s Sustainable Development Goals.

The SDG Data Catalogue is structured so that research and data sets can be submitted and shared. Free flow of knowledge and open source data is at the core of our vision.

Contact us to submit your research and to advise on the build out of the search tool.

Impact Network

Logo Association for computing Machinery

Association for Computing Machinery

ACM, the world's largest educational and scientific computing society, delivers resources that advance computing as a science and a profession.

Logo Informs

Informs

The Institute for Operations Research and the Management Sciences is an international society for practitioners in the fields of operations research, management science, and analytics.

Logo Berkeley University of california

University of California, Berkeley

The University of California, Berkeley is a public research university in Berkeley, California.

Share this Page

Get Involved

Join our efforts to unlock AI’s potential towards serving humanity.

ai4_donate

Support us

Support new research and collaborative projects to meet the UN’s Sustainable Development Goals.
Donate now
ai4_partner

Become a Partner

Collaborate with us on AI and Machine Learning Projects and Policy to make a meaningful impact.
Partnerships
ai4_volunteer

Volunteer with us

Join our team to design, build or guide innovative AI research to shape the future of global policy.
Volunteer
ai4_newsletter

Newsletter

Receive monthly newsletter updates on how AI for Good is creating impact around the world.
Subscribe
SDG Data Catalog