IGF 2019 WS #30 Let there be data – Exploring data as a public good

Organizer 1: Lea Gimpel, Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ) GmbH
Organizer 2: Irmgarda Kasinskaite, UNESCO
Organizer 3: Ali Nyiringabo, Digital Umuganda

Speaker 1: Renata Avila, Civil Society, Latin American and Caribbean Group (GRULAC)
Speaker 2: Cathleen Berger, Private Sector, Western European and Others Group (WEOG)
Speaker 3: Audace Niyonkuru, Technical Community, African Group
Speaker 4: Mohammed Belkacem, Civil Society, African Group
Speaker 5: Rene Kabalisa, Government, African Group

Policy Question(s): 

• How can we support the development of digital public goods such as common data infrastructures to train artificial intelligences, e.g. for voice recognition technology in underrepresented languages?
• How can we develop sustainable governance models for data commons based on a multi-stakeholder approach?
• Which role can data commons play as an instrument of innovation policy and means to stimulate supply and demand for innovative technological solutions?

Relevance to Theme: Today, applications, which use artificial intelligence or automated decision making, are mostly developed by Western companies and in China. A big part of the world, notably people living in the global South, are excluded both from the development of these applications as well as from being represented in the data used to train artificial intelligences. One example is voice recognition technology: In local languages, this technology has the potential to enable underrepresented groups access to information, services and the diversity of cultural expressions. It is essential for an inclusive and diverse information society, and will play a major role in human-machine-interaction in the future. However, due to economic reasons, corporations are focusing on mainstream languages such as English and Chinese, leaving the majority of people in the global South underserved and excluded.
By discussing means to develop (local) data pools as commons, we are focusing on the open provision of training data as a crucial precondition for (local) developers to build inclusive AI-based applications and thereby close the digital divide we see today in the development and use of artifical intelligence.

Relevance to Internet Governance: The development of inclusive and ethical AI-based applications requires both a normative framework and shared resources, which enable more people to build applications relevant to their local context. For instance, voice recognition technology in local languages is oftentimes lacking a business case to justify investments in collecting data and the training of models, even if the potential for digital inclusion is staggering.
Building data commons thus takes away high investments needed by one stakeholder and bases the development of locally relevant AI applications on a multi-stakeholder model with shared responsibilities. It is these governance models for data commons and the respective roles governments, private sector and civil society can play within it, which we would like to discuss during the session.


Break-out Group Discussions - Flexible Seating - 90 Min

Description: Data is mostly seen as a tool: for decision-making, micro-targeted advertising, surveillance, and in some cases for social good, e.g. to increase transparency. However, data nowadays is also an infrastructure critical to social and economic development. Especially for the training of artificial intelligences, the availability of high quality data is crucial and one of the main barriers for the development of local AI-based solutions, especially in the global South where resources to acquire data are scarce.
Both the availability of training data and AI-based solutions as such can play a major role in addressing current inequalities regarding access to knowledge, services and the diversity of cultural expressions. Exemplary for impact-driven AI-based solutions is voice interaction: it has the potential to enable millions of people access to information and services they do not have yet, preserve cultural heritage, make technology more inclusive and ultimately foster local value creation as well as digital sovereignty.
In this session, we would like to explore different initiatives aiming at creating data commons and digital public goods to learn from their successes and challenges. We will discuss various governance models and ecosystem approaches such as community-governance and multi-stakeholder models with the aim to democratize the potential of artificial intelligence for all.

Expected Outcomes: - shared lessons learned and good practices for the development of digital public goods, especially data commons
- mapping of different governance models for data commons and the respective roles government, private sector and civil society play in such an ecosystem
- discuss the economic impact data commons potentially have as a means to stimulate the development and demand for innovative AI-based solutions amongst stakeholders

Onsite Moderator: 

Lea Gimpel, Government, Western European and Others Group (WEOG)

Online Moderator: 

Irmgarda Kasinskaite, Intergovernmental Organization, Intergovernmental Organization


Ali Nyiringabo, Technical Community, African Group

Discussion Facilitation: 

The session will consist of a short series of initial inputs from each of the speakers (5-7 minutes each), which will be followed by an interactive round of discussions in smaller groups (potentially in a "world café format") of approximately 40 minutes, each of them hosted by one of the speakers. The results of these "breakout sessions" will be brought together in the last 15-20 minutes of the workshop.
At the beginning of the session, we will also use Slido or a similar tool to collect open questions and comments of participants, which then will be addressed during the workshop.

Online Participation: 

At the beginning of the session we will use the tool to collect comments and questions from remote participants, which then will be addressed during the workshop. During the breakout sessions, each discussion group will use a laptop to ensure that remote participants can follow and take part in the discussion. During the wrap-up phase of the workshop, we will use the tool to ensure that remote participants will have the possibility to share their perspective with the bigger group.

Proposed Additional Tools: We will use Slido or a similar tool, which enables polls and the rating of questions and comments from the audience.


GOAL 1: No Poverty
GOAL 8: Decent Work and Economic Growth
GOAL 9: Industry, Innovation and Infrastructure
GOAL 10: Reduced Inequalities
GOAL 11: Sustainable Cities and Communities
GOAL 12: Responsible Production and Consumption
GOAL 17: Partnerships for the Goals