SDG Data Catalog

An open, evolving global database of SDG relevant data sets.

The SDG Data Catalog is an open, extensible, global database of data sets, metadata, and research networks built automatically by mining millions of published open access academic works.

The SDG Data Catalog leverages advancements in Artificial Intelligence (AI) and Natural Language Processing (NLP) technologies to extract and organize knowledge from public datasets that is otherwise hidden in plain sight in the continuous stream of research generated by the scientific community.

The goal, ultimately, is to connect researchers and students with SDG-relevant datasets so that their work can make meaningful progress towards social good. 

decorative shape
ilustration map with red circles showing affected areas

Research Paper

Hidden in Plain Sight: Building a Global Sustainable Development Data Catalogue

By James Hodson & Andy Spezzati

Modern scientific research for Sustainable Development depends on the availability of large amounts of relevant real-world data. However, there are currently no extensive global databases that associate existing data sets with the research domains they cover.

We present the SDG Data Catalogue – an open, extensible, global database of data sets, metadata, and research networks built automatically by mining millions of published open access academic works. Our system leverages advances in AI and NLP Technologies to extract and organise deep knowledge of data sets available that is otherwise hidden in plain sight in the continuous stream of research generated by the scientific community.

Explore SDG Datasets

Goal 1:
No Poverty

SDG Data Catalog

The latest poverty and inequality indicators compiled from officially recognized sources with national, regional and global estimates.

SDG Data Catalog

A list of equitable data sets, research and reports from UNICEF Office of Innovation to support programmes, campaigns, and initiatives.

SDG Data Catalog

Datasets and projects designed to increase empathy for often impoverished victims of far-away disasters.

old woman hands counting small change poverty

The World Poverty Clock developed by the World Data Lab provides real-time poverty estimates through 2030 for nearly all countries.

social protection Profiling

Annual social protection data are compiled by the International Labour Organization (ILO) through its Social Security Inquiry, sourced from national administrative data. The indicators are disseminated through ILO World Social Protection Data Dashboards

Goal 2:
Zero Hunger

SDG Data Catalog

The Global Hunger Index (GHI) is a tool designed to comprehensively measure and track hunger globally, by region and country.

SDG Data Catalog

The Global Hunger Index (GHI) is a tool designed to comprehensively measure and track hunger globally, by region and country.

tangerine tree Partnership for Action on Green Economy Insufficient Food Intake

The World Food Programme (WFP) has developed the HungerMapLIVE, a global hunger monitoring system that tracks and predicts hunger in near-real time.

Algorithm Crop Conditions

The Group on Earth Observations Global Agricultural Monitoring (GEOGLAM) Crop Monitor (https://cropmonitor.org/) is an international initiative that was developed under the framework of the 2011 G20 Action Plan on Food Price Volatility in Agriculture.

Desert dunes Desert Locust climate

Created by the UN Food and Agriculture Organization (FAO), an agency dedicated to international efforts to end hunger, this dataset tracks desert locust observations, as well as whether the observed locusts are adults or nymphs (known as hoppers) and whether the locusts form a group.

Goal 3:
Good Health & Wellbeing

Genome Project

The International Genome Sample Resource contains the most extensive catalogue of genetic variation in humans including SNPs, structural variants and haplotype context.

global health

The GHO data repository contains data collected by the World Health Organization on various health-related statistics including mortality and disease burden rates in 194 countries.

Spotlight Initiative Three young woman Health Services

World Pop is an applied research group focussed on mapping demographics in low and middle income countries, and works to measures the availability and geographical accessibility of healthcare services at the national and sub-national levels across Sub-Saharan Africa as one of its activities.

World map with red circles on affected areas COVID-19 Cases

Created by the John Hopkins University Center for Systems Science and Engineering, this dataset reports COVID-19 cases at the provincial-level in China, at the county-level in the U.S., and at the state and national-levels for other countries.

assortment of pills COVID-19 Vaccine Stanford

This dataset provides the most recent data on vaccine purchases and negotiations by individual countries and unilateral partnerships from 16 companies.

Goal 4:
Quality Education

education quality

Panel database on education quality featuring data from 163 countries between 1965-2015.

World Inequality

The World Inequality Database on Education (WIDE) highlights the powerful influence of circumstances, such as wealth, gender, ethnicity and location.

Team Learners impacted by COVID-19 Blog Jobs

The United Nations Educational, Scientific and Cultural Organization(UNESCO) is supporting countries in their efforts to mitigate the immediate negative impact of school closures and to facilitate the continuity of education through remote learning.

Goal 5:
Gender Equality

UNECE

Provides datasets on households across the globe including marriages, fertility rates, adolescent fertility, etc.

Harvard Dataverse

Database providing data from family planning surveys conducted in various countries.

four friends on a hike World Leaders

The United Nations Protocol and Liaison Service maintains a list of Heads of State, Heads of Government, and Ministers for Foreign Affairs of all Member States based on the information provided by the Permanent Missions.

Three young kids standing by eachother with beautiful smiles Women in Parliament Humanitarian

The Inter-Parliamentary Union (IPU) tracks monthly rankings of the percentage of women in parliament from January 2019 onwards through Parline, a free resource with over 600 data points provided directly by national parliaments on their structure, composition, working methods, and activities.

Young woman workin on laptop Gender Gap youth

The University of Oxford and Qatar Computing Research Institute(QCRI), with support from Data2X, are collaborating to measure digital gender gaps in real time.

Goal 6:
Clean Water & Sanitation

Water Footprint

The most comprehensive source of international water footprint data including scarcity and pollution issues.

Research Institute People on a canoe on a floaded street

Provides datasets on various issues including flood hazard maps, water risk indicators and water stress projections across the globe.

regional stats

Datasets providing world and regional statistics, data and maps.

wide view of blue ocean cost vilage with mountains in the background Water Quality Environmental

The UN Environment Programme (UNEP) works with partners to support the global monitoring of freshwater ecosystems, as reported through the Freshwater Ecosystems Explorer, which provides up-to-date geospatial data on changes to their extent and water quality.

The Council for Good, overview photo of ocean coast AI Water Stress

The Falkenmark Water Stress Index is a widely used metric to characterize water stress based on annual renewable water supply per capita.

Melting icebergs Water Anomalies Climate Watch Water Anomalies

The ISciences Water Security Indicator Model v2 (WSIMv2) describes places where water availability during the most recent 12-month period is more or less than would be expected based on a 1950-2009 baseline period.

Goal 7:
Affordable & Clean Energy

world

Data on global energy consumption by source, energy production and trade, energy transitions and renewable energy investments.

Energy History

Data on energy consumption and per capita energy consumption of a few countries.

Polar Panels Renewable Energy

Detailed statistics on renewable energy capacity, power generation and renewable energy balances.

energy information

Global data on energy generation and consumption, energy intensity, CO2 emissions as well as import and export statistics.

project

Datasets on primary energy production and consumption, CO2 from fossil fuels, greenhouse gas emissions, renewable energy and electricity.

AI for renewable energy Population

Developed by Fondazione Eni Enrico Mattei (FEEM), a sustainable development think-tank, this dataset measures electricity access in Sub-Saharan Africa.

Goal 8:
Decent Work & Economic Growth

monetary fund

The IMF publishes a range of time series data on IMF lending, exchange rates and other economic and financial indicators.

child labor

Research by the Oxfard Martin School on child labor internationally.

working Launchpad employment

The International Labour Organisation (ILO) is tracking the impacts on the world of work that has been severely impacted by COVID-19

Covid-19 Fiscal Response

The International Monetary Fund (IMF) compiles a database on fiscal measures announced by 141 different governments in response to the COVID-19 pandemic

GDP Growth Inequality

The OECD's quarterly national accounts (QNA) dataset presents GDP growth data collected from all the OECD member countries and some other major economies on the basis of a standardised questionnaire.

Goal 9:
Industry, Innovation & Infrastructure

globalization

This study investigates the effect of the latest wave of economic globalization on manufacturing employment in developing countries.

UNIDO Industrialization

Gives graphs as well as country highlights relevant to survey results.

Kids looking at a laptop Internet

The International Telecommunication Union measures internet access across the globe twice a year using survey data.

Goal 10:
Reduced Inequalities

ITU

A global Artificial Intelligence (AI) repository to identify AI related projects, research initiatives, think-tanks and organizations that can accelerate progress towards the 17 UN Sustainable Development Goals.

prosperity

The World Bank's Global Database of Shared Prosperity covers 83 countries, with 75 percent of the world's people, with most recent estimates available for 2013.

GDP Growth Inequality

The goal of the SWIID is to meet the needs of those engaged in broadly cross-national research by maximizing the comparability of income inequality data.

Man carrying large log of wood traveling long distances Migrant Stock

Reported by the UN Division of Economic and Social Affairs (UN DESA), International migrant stocks are estimates of the total number of international migrants present in a given country at a particular time.

Goal 11:
Sustainable Cities & Communities

social protection Profiling

The Settlement Profiling Tool guides field personnel in creating cross-sectoral settlement profiles intended to help inform future urban development plans and policies in displacement affected contexts.

Sustainable Cities

Mendeley Data Repository is free-to-use and open access. It enables you to deposit any research data (including raw and processed data, video, code, software, algorithms, protocols, and methods) associated with your research manuscript.

European Data

The European Data Portal harvests the metadata of Public Sector Information available on public data portals across European countries. Information regarding the provision of data and the benefits of re-using data is also included.

Smart Cities

A compilation of smart cities around the world that have shared open data in an aggregated data portal.

smart cities

A compilation of smart cities in the world that have shared out open data in an aggregated open data portal.

overview of city with lots of smog Air Quality

OpenAQ, a non-profit organization, collects daily air quality information from stations around the world and provides it as free and open data to help better monitor and manage the air we breathe.

blured night picture of city skyscrapers Settlement

The database constitutes a comprehensive set of settlement polygons. It is in geodatabase format and consists of three feature classes for built up areas (BUA), small settlement areas (SSA), and hamlets (hamlets).

People carrying buckets of grain in their heads on arid dirt road COVID19 Community

Google’s Community Mobility Reports chart the geographic movement trends associated with COVID-19 over time and provides the data, aggregated and anonymized, to the public.

Goal 12:
Responsible Consumption & Production

SDG Indicators

This platform provides access to data compiled through the UN System in preparation for the Secretary-General's annual report on "Progress towards the Sustainable Development Goals."

production

SDG Tracker is a free, open-access publication that tracks global progress towards the SDGs and allows people around the world to hold their governments accountable to achieving the agreed goals.

Moldova fountain in park with city buldings in the background

A collaborative data platform that integrates different types of data to allow the Moldovan Government access to exhaustive information on land coverage, population density and mobility behaviour.

brown fields with wind turbines on the background Renewable Energy SDG

The International Renewable Energy Agency (IRENA), an intergovernmental organization that supports countries in their transition to a sustainable energy future, compiled this dataset by measuring the maximum net generating capacity of renewable and non-renewable energy sources by country.

Goal 13:
Climate Action

NOAA desert dunes Drought

Provides science and information, focusing on news, data, and climate teaching materials, and the data products and services to track global climate data.

data Aerial view of a factory with a lot of smoke coming out

Our World Data provides a complete guide to CO2 and Greenhouse gas emission profiles for individual countries, charting how emissions are changing in each country, reduction progress and statistics.

Photo of beautiful pink clouds Environmental

NCEI provides the world’s largest collection of weather and climate data, including information that’s “land-based, marine, model, radar, weather balloon, satellite, and paleoclimatic” alongside other datasets.

Melting glaciers Arctic Sea Ice

Areas of the ocean that have frozen are considered “sea ice,” and can vary from slushy, barely solid areas to sheets of ice that are meters thick.

NOAA desert dunes Drought

The Climate Hazards Group InfraRed Precipitation with Station Data (CHIRPS) is a joint project between the US. Geological Survey and UC Santa Barbara.

Forest on fire Temperature Change

The National Oceanic and Atmospheric Administration (NOAA), the National Aeronautics and Space Administration (NASA), and the UK Meteorological Office (UK Met) have used detailed station data going back to the 1800s to analyze temperature changes and have all confirmed the warming of our planet.

Factories with lots of Carbon Dioxide Carbon Project

The Carbon Monitor dataset, led by researchers Zhu Liu, Philippe Ciais and Steven Davis, was created as the first estimate of daily CO2 emissions for six different sectors, including power, ground transportation, industrial production, residential consumption, and maritime and aircraft transportation.

Goal 14:
Life below Water

plastic pollution beach with all sorts of platic trash

Includes the lifecycle of plastic in the oceans, plastic hotspots, and other measures.

fishing nets with plastic debrie Marine Debris

Datasets on marine debris and garbage patches in the oceans

underwater photo Plastic Pollution

A global dataset of 1571 locations where surface manta tows were conducted

lots of plastic under a bridge over a river

Sources of ocean plastic organized by river.

waves in the ocean Biodiversity

The global spatial distribution of likely or potential Critical Habitat, as defined by the International Finance Corporation’s Performance Standard 6 (IFC PS6) criteria, comprises 20 underlying datasets.

close up photo of water in the ocean

The Ocean Tracking Network is a global aquatic animal tracking, data management, and partnership platform.

clown fish on coral reef

Coral reefs are one of the most diverse and ecologically important areas in the world, but many are threatened by rising ocean temperatures.

A huge school of Yellowstripe Scads in tight formation Fishing

Global Fishing Watch (GFW) is advancing ocean governance through increased transparency of human activity at sea.

Goal 15:
Life on Land

forest watch close up photo of a tree trunk in the forest

Provides data about forests including land cover, land use, biodiversity metrics and forest change allowing for the monitoring and management of forests.

wide photo of a forest with tall trees and the sunset in the back resource

Provides data on forest ecosystems including tree cover loss and gain rates, restoration opportunities, forest fires and biodiversity hotspots.

photo of a path in between trees in a forest Deforestation Atlases

Allows users to visualize and analyse data on country specific forest characteristics.

wide photo of a green valley with clouds driping through the mountains convivial

The project is grounded in the premise that conservation is critical to transformations to sustainability but that its practices need to change radically.

close up photo of a bucket with assorted fruits and a hand holding an apple Nutrition

Aimed to improve nutrition through the adoption of agro-biodiversity and improved dietary diversity at the household level in Uganda & Zambia.

photo of a tree in the forest WRI

Features environmental conservation and restoration frameworks for policymakers and private-sector initiatives including infographics, datasets, visualization tools, and more.

photo of a path in between trees in a forest Deforestation Atlases

Global Forest Watch (GFW) provides data and tools for monitoring forests and provides access to near real-time information about where and how forests are changing around the world.

Turtle swimming underwater Conservation

The World Database on Protected Areas (WDPA) was established in 1981 after the UN Economic and Social Council called for a list of natural reserves, citing its value for economic, scientific, and conservation.

forest Wildfires

The Active Fires product, managed by the National Oceanic and Atmospheric Administration (NOAA), is based on the detection and analysis of active wildfires as received by a sensor.

satelite view of ocean coast Tropics

Norway's International Climate and Forests Initiative (NICFI) makes high-resolution (<5m per pixel) optical satellite imagery of the tropics freely available to all in the pursuit of helping stop deforestation and combat climate change.

Goal 16:
Peace, Justice & strong Institutions

statue of Lady Justice - Piracy Data Initiative Supreme Court

Pulls together data sets in an open format to track SDG16 and provide a snapshot of the current situation, and eventually progress over time.

5 military people running across sand dunes Armed Conflict UCDP

Tracks global conflict and violence.

Close up photos of people with hands in the air in the middle of a protest Human Rights

Tracking human rights abuses over time.

Photo of riot police and police cars in the background. WJP

Covering major topics on law and order by country.

Close up photo of new born baby with a cute blue hat UNICEF

Find data sets on topics including early childhood development, infant mortality, and intimate partner violence.

Photo of refugee camp with small child in the background displacement

Provides data and analysis, and supports partners to identify and implement solutions to internal displacement.

5 military people running across sand dunes Armed Conflict UCDP

he Armed Conflict Location & Event Data Project (ACLED), a disaggregated data collection, analysis, and crisis mapping project, maintains a database of all forms of human conflict from over 50 developing countries.

statue of Lady Justice - Piracy Data Initiative Supreme Court

National Geospatial-Intelligence Agency, an agency within the United States Department of Defense, records instances of hostile attacks against ships and mariners via its Anti-Shipping Activity Messages (ASAM) database.

Arches of a cathedral Voluntary

The Voluntary National Reviews (VNRs) aim to facilitate the sharing of experiences, including successes, challenges, and lessons learned, with the goal of accelerating the implementation of the 2030 Agenda.

Goal 17:
Partnerships for the Goals

responsible AI impact

The project looks at the broader ways in which universities can collaborate in support of the SDGs and lists partnerships in a ranking system.

sky view of green hills Sustainable Development Goals

Real time data on on-going SDG progress.

young woman siting down on dirt road Human Rights

The Danish Institute developed and trained an algorithm to link human rights recommendations to the corresponding SDG(s).

jar knock out with coins coming out Remittances Income Inequality

Compiled by the World Bank, this dataset measures officially-recorded remittance inflows (remittances received) per country in 2020.

Hand holding a crystal ball Development Assistance

Official development assistance (ODA) is defined by the OECD Development Assistance Committee as government aid that promotes and targets the economic development and welfare of developing countries.

Further Research and Resources

professor Achim Rettinger

Interview with Achim Rettinger

AI for Good Board Member and Full Professor at Trier University, Achim Rettinger discusses with the AI for Good Foundation Team his work in natural language processing, and how that can impact progress toward the SDGs.According to Professor Rettinger, AI and machine learning can be utilized to understand communication better by analyzing huge quantities of data. The data can help the international community uncover insights on the collective progress toward the 2030 deadline.

Join us

The AI for Good Foundation is continually looking for researchers and experts in the machine learning field to pool our collective talent in support of UN’s Sustainable Development Goals.

The SDG Data Catalogue is structured so that research and data sets can be submitted and shared. Free flow of knowledge and open source data is at the core of our vision.

Contact us to submit your research and to advise on the build out of the search tool.

Impact Network

Logo Association for computing Machinery

Association for Computing Machinery

ACM, the world's largest educational and scientific computing society, delivers resources that advance computing as a science and a profession.

Logo Informs

Informs

The Institute for Operations Research and the Management Sciences is an international society for practitioners in the fields of operations research, management science, and analytics.

Logo Berkeley University of california

University of California, Berkeley

The University of California, Berkeley is a public research university in Berkeley, California.

Share this Page

Get Involved

Join our efforts to unlock AI’s potential towards serving humanity.

ai4_donate

Support us

Support new research and collaborative projects to meet the UN’s Sustainable Development Goals.
Donate now
ai4_partner

Become a Partner

Collaborate with us on AI and Machine Learning Projects and Policy to make a meaningful impact.
Partnerships
ai4_volunteer

Volunteer with us

Join our team to design, build or guide innovative AI research to shape the future of global policy.
Volunteer
ai4_newsletter

Newsletter

Receive monthly newsletter updates on how AI for Good is creating impact around the world.
Subscribe
SDG Data Catalog
Logo - Ai for Good