Professional Documents
Culture Documents
Participatory Data Stewardship - Final Report
Participatory Data Stewardship - Final Report
Participatory data
stewardship
Contents
3 Executive summary
6 Foreword
9 Introduction
53 Conclusion
55 Methodology
56 Appendix
58 Acknowledgements
Executive Summary Participatory data stewardship 3
Executive summary
1 Triggle, N. (2014). ‘Care.data: How did it go so wrong?’ BBC News. 19 Feb. Available at: https://www.bbc.co.uk/news/
health-26259101 [Accessed 6 Jul. 2021].
2 Fruchter, N., Yuan, B. and Specter, M. (2018). ‘Facebook/Cambridge Analytica: Privacy lessons and a way forward’. Internet Policy
Research Initiative at MIT. Available at: https://internetpolicy.mit.edu/blog-2018-fb-cambridgeanalytica/.
3 Centre for Data Ethics and Innovation. (2020). Independent report: Addressing trust in public sector data use. GOV.UK. Available
at: https://www.gov.uk/government/publications/cdei-publishes-its-first-report-on-public-sector-data-sharing/addressing-trust-in-
public-sector-data-use [Accessed 15 February 2021].
Executive Summary Participatory data stewardship 4
4 Arnstein, S. (1969). ‘A Ladder of Citizen Participation’. Journal of the American Institute of Planners, 35(4), pp.216-224. Available at:
https://www.tandfonline.com/doi/abs/10.1080/01944366908977225
5 Patel, R. and Gibbon, K. (2018). ‘Why decisions about the economy need you’. The RSA. Available at: https://www.thersa.org/
blog/2017/04/why-decisions-about-the-economy-need-you [Accessed 15 February 2021].
6 Patel, R. and Peppin, A. (2z020). ‘Exploring principles for data stewardship’. Ada Lovelace Institute. Available at: https://www.
adalovelaceinstitute.org/project/exploring-principles-for-data-stewardship/ [Accessed 16 Feb. 2021].
Executive Summary Participatory data stewardship 5
7 Kapoor, Astha and Whitt, Richard S. (2021). Nudging Towards Data Equity: The Role of Stewardship and Fiduciaries
in the Digital Economy. February 22. Available at SSRN: https://ssrn.com/abstract=3791845 or http://dx.doi.org/10.2139/
ssrn.3791845
8 Ada Lovelace Institute. (2021). Exploring legal mechanisms for data stewardship. Available at: https://www.adalovelaceinstitute.org/
report/legal-mechanisms-data-stewardship/
Foreword Participatory data stewardship 6
Foreword
One of the most frustrating experiences for people is when they are
told they will have the power to shape something, but find in fact
that consultation is very limited. To mitigate this the report helpfully
distinguishes between different kinds of involvement, with a spectrum
ranging from ‘inform’ all the way to ‘empower’.
Hetan Shah
Vice-Chair, Ada Lovelace Institute
Chief Executive, The British Academy
How to read this report Participatory data stewardship 7
Introduction
‘Involving beneficiaries The report provides evidence that involving people (‘beneficiaries’) in the
can encourage design, development and deployment of data governance frameworks
responsible innovation can help create the checks and balances that engender greater societal
and improve data quality’ and economic equity, can help to rebalance asymmetries of power, and
can contribute towards increased public confidence in the use of data.11
• the data subjects – the people to whom the data directly relates, for
instance, when processing biometrics data
• people within the wider public – for instance, those who might have an
interest in how data is governed or used ethically, as well as those who
might have lived experience of an issue or disadvantage
• people at risk of being oversurveilled, underrepresented or missing
from the data itself, e.g. migrant populations, members of Indigenous
communities, people from racialised minority groups, people with
mental health conditions and transgender people
• stakeholders working in technology or related organisations, or
members of a global supply chain. e.g. those engaged in collecting and
processing data, or who have an interest in data to secure their own
collective workplace rights.
11 Kapoor, Astha and Whitt, Richard S. (2021). Nudging Towards Data Equity: The Role of Stewardship and Fiduciaries
in the Digital Economy. 22 February. Available at SSRN: https://ssrn.com/abstract=3791845 or http://dx.doi.org/10.2139/
ssrn.3791845
12 The Information Commissioner’s Office defines the data subject as ‘the identified or identifiable living individual to whom personal
data relates’. Available at: https://ico.org.uk/for-organisations/data-protection-fee/legal-definitions-fees/#subject
Introduction Participatory data stewardship 11
More representative,
inclusive and Fairer and more
proportionate uses equitable outcomes
13 The use of stewardship to describe the governance of common resources was foundational to the work of Nobel Prize-winning
economist Elinor Ostrom, who developed design principles for collective governance. Though often focused on shared natural
resources like pastures, forests or fisheries, applying Ostrom’s principles to data can help us to think through thorny challenges
of doing good with data. See Ostrom, E. (2015). Governing the Commons. Cambridge: Cambridge University Press. The Ada
Lovelace Institute has mapped Ostrom’s principles against real-world examples where data is stewarded for social causes, or on
behalf of data subjects, see: https://docs.google.com/spreadsheets/d/1hAN8xMJuxobjARAWprZjtcZgq1lwOiFT7hf2UsiRBYU/
edit#gid=1786878085.
Introduction Participatory data stewardship 12
‘We understand data The Ada Lovelace Institute has developed the following working
stewardship as key to definition of data stewardship:
protecting the data
rights of individuals
and communities’ ‘The responsible use, collection and management
of data in a participatory and rights-preserving
way.’14
In this report, the term ‘data steward’ is used to describe the role of
the individuals and organisations processing and using data on behalf
of beneficiaries. In particular, stewards are responsible for involving
the people who have a stake in and are impacted by data use and
processing. That involvement is based on a relationship of trust and
a social mandate to use data (often described in the legal context as
a trust-based ‘fiduciary’ relationship – where the data steward has a
responsibility to put people and society’s interests ahead of their own
individual or organisational interests).
Data stewardship can operate throughout the data lifecycle (see figure 2
below). In the age of ‘datafication’,15 data stewardship operates not only in
relation to data collection, processing, organising, analysis and use, but
also in the design and development of data-driven systems, including
those that aim to predict, prescribe, observe, evaluate and sometimes
influence human and societal behaviour. This makes data stewardship
a complex task, but also points to its potential to engender better
outcomes for people and society.
14 See Ada’s post, Disambiguating data stewardship, for more on the development of this definition. Available at: https://www.
adalovelaceinstitute.org/blog/disambiguating-data-stewardship/
15 ’Datafication’ is a term that describes the process of rendering human and social behaviour in quantifiable ways, in turn releasing it as
a new form of economic and social value. See the Ada Lovelace Institute’s report The data will see you now for a detailed explanation
of how this operates in relation to health data. Available at: https://www.adalovelaceinstitute.org/report/the-data-will-see-you-now/
Introduction Participatory data stewardship 13
Delete Store
Figure 2: The data lifecycle 3. Introducing a framework for participatory data stewardship
The Ofqual exam results algorithm that made predictions about A-levels
in the UK, or the cookie notices and terms and conditions that ‘nudge’
towards uninformed consent at the expense of individual data rights,
demonstrate how this can disempower people.16
16 Office for Statistics Regulation. (2021). Ensuring statistical models command public confidence. Available at: https://osr.
statisticsauthority.gov.uk/publication/ensuring-statistical-models-command-public-confidence/ [Accessed 19 Aug 2021].
Introduction Participatory data stewardship 14
Inform Collaborate
We will keep you informed about We will look to you for advice
how your data is being governed and innovation in design of
data-governance models, and
incorporate your advice and
recommendations where possible
Involve
We will work with you to ensure your
concerns and aspirations are directly
reflected in data governance
Consult Empower
We will listen to, acknowledge We will advise and assist in line
and provide feedback on concerns with your decisions about your
and aspirations for the governance own data-governance models
of your data
Figure 3: Framework for We propose this framework to support thinking about how different
participation in data modes of participation in and about data governance can enable
stewardship
beneficiaries to have increasing power and agency in the decisions
made about their (and by extension others’) data. Moving through the
spectrum, the level of power afforded to beneficiaries increases from
being the recipients of ‘information’ through to being ‘empowered’ to act
with agency.
17 Arnstein, S. (1969). ‘A Ladder of Citizen Participation’. Journal of the American Institute of Planners, 35(4), pp.216-224. Available at:
https://www.tandfonline.com/doi/abs/10.1080/01944366908977225
18 Patel, R. and Gibbon, K. (2018). ‘Why decisions about the economy need you’. The RSA. Available at: https://www.thersa.org/
blog/2017/04/why-decisions-about-the-economy-need-you [Accessed 15 February 2021].
Introduction Participatory data stewardship 15
Data sheets
Explainability
Reframing data
narratives
Communication and
transparency
UX design
Lived experience
panels
Co-design:
developing the
terms of use
Co-design
Hackathons
Public deliberation
Figure 4: Purposes of
different participatory
mechanisms
Mechanisms Participatory data stewardship 17
Purpose Description What people can expect Relevant participatory Case studies
(drawn from Arnstein’s (drawn from the IAP2 mechanisms, methods
ladder of participation) Participation Spectrum) and activities
INFORMING ‘A one way flow of ‘We will keep you informed • Meaningful and radical • Enabling ‘armchair
information’ on how your data is being transparency audit’ through open
(p. 19)
used’ • Explainability – the AI and algorithm
process of enabling a registers (Helsinki
data-driven system to be and Amsterdam),
explained in human administered by
terms Saidot, 2020
• Mechanisms such as • Making sense of the
model cards and data societal value of
sheets data by reframing
• Rethinking and ‘big-data mining’
reframing how we talk and ‘data as oil’
about data, so it is more narratives
accessible and inclusive
Mechanisms Participatory data stewardship 18
Purpose Description What people can expect Relevant participatory Case studies
(drawn from Arnstein’s (drawn from the IAP2 mechanisms, methods
ladder of participation) Participation Spectrum) and activities
CONSULTING ‘Inviting people’s opinions, ‘We will listen to, • Community networks • Community
through attitude surveys, acknowledge concerns • User experience (UX) engagement and
(p. 28)
neighbourhood meetings and aspirations, and testing and co-design consultation by
and public hearings’ provide feedback on • Surveys and public CityVerve
how public input influenced attitudes research (Manchester),
the data-governance multi-sector
• Community
framework’ consortium led
engagements and
by Manchester
consultations
City Council,
2015
INVOLVING ‘Allow citizens to advise, ‘We will work with you to • One-off and • Institutionalised
but retain for ensure your concerns and institutionalised public public
(p. 32)
powerholders the aspirations are directly deliberation and deliberation, the
continued right to decide’ reflected in data deliberative democracy Danish Board of
governance... we will initiatives Technology,
provide feedback on how • Lived experience panels 1986–2011
public input influenced • Horizon scanning, design
these decisions’ thinking and futures
thinking
COLLABORATING ‘Enables people to ‘We will look to you for • Public deliberation and • Global mental
negotiate and engage in advice and innovation in deliberative democracy health data bank,
(p. 38)
trade-offs with design of data-governance initiatives the Wellcome
powerholders’ frameworks... and • Bottom-up ‘data- Trust with Sage
incorporate your advice governance initiatives’ Bionetworks,
and recommendations managed by an 2020
to the maximum extent independent fiduciary • Large-scale data
possible’ (e.g. data trusts, and donation for
data-sharing contracts research, UK
that build in Biobank,
collaboration) 2006–present
• Participant panels and
data donation
mechanisms
EMPOWERING ‘Citizens obtain the ‘We will provide advice and • Data governance rules • Citizen-driven,
majority of decision- assistance as requested in shaped and routinely collaborative
(p. 44)
making seats or full line with your decisions for reviewed by beneficiaries data
managerial power’ designing/developing your and data donors management
own data-governance • Voting on governance and governance,
framework’ boards of data- access Salus Coop
initiatives (Spain),
• Ownership and/or control 2017–present
of data cooperatives
• Setting terms of data
licensing and access, with
permissions overseen by
citizens
Mechanisms Participatory data stewardship 19
19 Wachter, S. (2021). A right to explanation. Alan Turing Institute. Available at: https://www.turing.ac.uk/research/impact-stories/a-right-
to-explanation [Accessed 15 February 2021].
20 See website from the Alan Turing Institute on Project ExplAIn. Leslie, D. (2019). Project ExplAIn. The Alan Turing Institute. Available at:
https://www.turing.ac.uk/news/project-explain [Accessed 15 February 2021].
Mechanisms Participatory data stewardship 20
21 Ruiz, J. (2018). ‘Machine learning and the right to explanation in GDPR’. Open Rights Group. Available at: https://www.openrightsgroup.
org/blog/machine-learning-and-the-right-to-explanation-in-gdpr/. [Accessed 15 February 2021].
22 ICO. (2020). What goes into an explanation? Available at: https://ico.org.uk/for-organisations/guide-to-data-protection/key-data-
protection-themes/explaining-decisions-made-with-artificial-intelligence/part-1-the-basics-of-explaining-ai/what-goes-into-an-
explanation/ [Accessed 5 Jul. 2021].
Mechanisms Participatory data stewardship 21
23 Ananny M, Crawford K. (2018). Seeing without knowing: Limitations of the transparency ideal and its application to algorithmic
accountability. New Media & Society. 20(3):973-989. doi:10.1177/1461444816676645
24 M. Kaminski. (2020). Understanding Transparency in Algorithmic Accountability. Forthcoming in Cambridge Handbook of the Law
of Algorithms, ed. Woodrow Barfield, Cambridge University Press (2020)., U of Colorado Law Legal Studies Research Paper No. 20-
34, Available at SSRN: https://ssrn.com/abstract=3622657
25 L. Stirton, M. Lodge. (2002). Transparency Mechanisms: Building Publicness into Public Services. Journal of Law and Society. https://
doi.org/10.1111/1467-6478.00199
26 See, for example the model cards from Google Cloud: https://modelcards.withgoogle.com/about
27 Gebru, T., Morgenstern, J., Vecchione, B., Vaughan, J.W., Wallach, H., Daumeé III, Hal and Crawford, K. (2018). Datasheets for Datasets.
Available at: https://arxiv.org/abs/1803.09010.
Mechanisms Participatory data stewardship 22
Helsinki and Amsterdam were among the first cities to announce open artificial
intelligence (AI) and algorithm registers. These were founded on the premise
that the use of AI in public services should adhere to the same principles of
transparency, security and openness as other city activities, such as public
bodies’ approaches to procurement.
Participatory implications:
These AI and algorithm registers are openly available, and any individual is able
to check them. This illuminates the potential for ‘armchair auditing’ of public-
service algorithmic systems. Accessing the register reveals information about
the systems that are reported on, the information that is being provided, and the
specific applications and contexts in which algorithmic systems and AI are being
used.
28 Dimitrova, A., (2020). ‘Helsinki and Amsterdam with first ever AI registries’ TheMAYOR.eu Available at: https://www.themayor.eu/en/a/
view/helsinki-and-amsterdam-with-first-ever-ai-registries-5982 [Accessed 15 Feb. 2021].
Mechanisms Participatory data stewardship 23
29 Dimitrova, A. (2020).
30 Safak, C. and Parker, I. (2020). ‘Meaningful transparency and in(visible) algorithms’. Ada Lovelace Institute. Available at: https://www.
adalovelaceinstitute.org/blog/meaningful-transparency-and-invisible-algorithms/
31 Edwards, L. and Veale, M. (2018). ‘Enslaving the Algorithm: From a “Right to an Explanation” to a “Right to Better Decisions”?’. IEEE
Security & Privacy, 16(3), pp.46–54. Available at: https://arxiv.org/pdf/1803.07540.pdf
Mechanisms Participatory data stewardship 24
Technologists and Others have highlighted that increased transparency about data use and
policymakers frequently management forces policymakers to be more explicit and transparent
reach for analogies and about the trade-offs they are choosing to make. In this context, it has
metaphors, in efforts been argued that explainability doesn’t just pose a technical challenge,
to communicate but also a policy challenge about which priorities, objectives and trade-
with diverse publics offs to choose.32
about data
32 Coyle, D., & Weller, A. (2020). ‘“Explaining” machine learning reveals policy challenges’. Science, 368 (6498), pp.1433-1434. Available
at: https://doi.org/10.1126/science.aba9647
33 Van den Boomen, M. (2014). Transcoding the digital: How metaphors matter in new media. Instituut voor Netwerkcultuur.
Available at: https://dspace.library.uu.nl/handle/1874/289883
34 Puschmann, C. and Burgess, J. (2014). ‘Big Data, Big Questions | Metaphors of Big Data’, International Journal of Communication, 8(0),
p. 20. Available at: https://ijoc.org/index.php/ijoc/article/view/2169
Mechanisms Participatory data stewardship 25
Data has been called everything from ‘the new oil’, to water or radioactive
waste.35 Metaphors like this are used to make the concept of data more
tangible, more open and more accessible to the public, but they ascribe
qualities to data that aren’t necessarily inherent, and so can have the
effect of mystifying or creating disempowering perceptions of data.
Metaphors can frame issues in particular ways, highlighting certain
qualities while obscuring others, and this can shape perspectives and
belief according to the intentions of those setting the narrative.
Such narratives also suggest that practices about data are, like forces
of nature, unfixed and unchangeable – rather than social practices
undertaken by people and power holders. And it’s important to note that
35 Rajan, A. (2017). ‘Data is not the new oil’, BBC News. Available at: https://www.bbc.com/news/entertainment-arts-41559076; Lupton,
D. (2013). ‘Swimming or drowning in the data ocean? Thoughts on the metaphors of big data’, This Sociological Life, Available at:
https://simplysociology.wordpress.com/2013/10/29/swimming-or-drowning-in-the-data-ocean-thoughts-on-the-metaphors-of-
big-data/; Doctorow, C. (2008). ‘Cory Doctorow: why personal data is like nuclear waste.’ The Guardian. Available at: http://www.
theguardian.com/technology/2008/jan/15/data.security (All accessed: 15 February 2021).
36 Coyle D., Diepeveen S., Wdowin J., et al. (2020). The value of data: Policy Implications. Bennett Institute for Public Policy Cambridge,
the Open Data Institute and Nuffield Foundation. Available at: https://www.bennettinstitute.cam.ac.uk/media/uploads/files/Value_of_
data_Policy_Implications_Report_26_Feb_ok4noWn.pdf
37 Lupton D. (2013). ‘Swimming or drowning in the data ocean? Thoughts on the metaphors of big data’, This Sociological Life,
29 October. Available at: https://simplysociology.wordpress.com/2013/10/29/swimming-or-drowning-in-the-data-ocean-thoughts-on-
the-metaphors-of-big-data/ (Accessed: 15 February 2021).
Mechanisms Participatory data stewardship 26
‘data’ itself is not the source of value – but rather actors’ positioning and
abilities to make use of data is what brings value.
But focusing on one single metaphor and its pitfalls reveals only part of
the power of narratives. Researching the metaphors that exist across
a range of prominent data narratives, Puschmann and Burgess identify
two interconnected metaphorical frames for data: data as a natural
force to resist or control; and data as a resource to be consumed. As a
natural force, data ‘exists’ in the world – like water, gravity or forests – just
waiting to be discovered and harnessed by humans. As a resource to be
consumed, data is conceptualised as a commodity that can be owned
and sold, then used – just like oil or coal.
These two metaphorical frames promote the idea that data is a value-
neutral commodity, and so shape data practices to be extractive
and promote commercial and competitive incentives. Many other
sociocultural meanings exist, or could exist, manifested through
metaphor. Recognising and understanding them allows us to interpret
these sociocultural meanings, and address how they shape data policies,
processes and practices.
One of the major metaphorical frames that embodies data as a natural force or
resource to be consumed is that of ‘big-data mining’, a narrative that emerged in
a range of corporate literature in the 1990s. This framed big data as a new ‘gold
mine’ waiting to be tapped, from which companies could gain wealth if they knew
how to extract it.38
Positioning ‘data’ as a resource from which insight can be ‘mined’ perpetuates the
conceptual model that economic value can be extracted from data, leveraging
the culturally embedded analogy that gold ore mined from the earth
38 Kerssens, N. (2019). ‘De-Agentializing Data Practices: The Shifting Power of Metaphor in 1990s Discourses on Data Mining’, Journal
of Cultural Analytics, 1(1), p. 11049. Available at: https://dspace.library.uu.nl/handle/1874/387149
Mechanisms Participatory data stewardship 27
(for example, in the American Midwest) delivered untold wealth for colonialist
settlers and ignores the unequal power dynamics of the extractive practices that
underpin these narratives.
Participatory implications:
The reality is that data does not exist in the natural world, lying dormant until
it is discovered and its value extracted, and the focus this metaphor places on
wealth value obscures the societal value data can have if stewarded well. To
overcome these framings it’s necessary for those affected by data to participate
in rethinking and reframing how data is conceptualised, to bring data narratives
(and therefore practices) in greater alignment with public expectations and
values.
Following existing metaphors like oil and mining, the natural world remains a good
conceptual foundation to develop alternative metaphors. One option is to see
data as a greenhouse gas, where its uses create harmful byproducts that must
be limited.39 Other metaphors might recognise that oil, gold and other natural
commodities are not perceived and treated in the same way as forests, rivers
and sunlight. These more ‘ecological’ metaphors have been posed as alternatives
that could promote concepts and practices where data is stewarded more
sustainably and less extractively.
39 Tisne M. (2019). ‘Data isn’t the new oil, it’s the new CO2’. Luminate Group. Available at: https://luminategroup.com/posts/blog/data-
isnt-the-new-oil-its-the-new-co2 (Accessed: 15 February 2021).
Mechanisms Participatory data stewardship 28
40 Arnstein, S. (1969). ‘A Ladder of Citizen Participation’. Journal of the American Institute of Planners, 35(4), pp.216-224. Available at:
https://www.tandfonline.com/doi/abs/10.1080/01944366908977225
Mechanisms Participatory data stewardship 29
41 The Consultation Institute. (2018). The Gunning Principles – Implications (2018). Available at: https://www.
consultationinstitute.org/the-gunning-principles-implications/ [Accessed 15 Feb. 2021].
Mechanisms Participatory data stewardship 30
42 Patel, R. and Peppin, A. (2020). ‘Making visible the invisible: what public engagement uncovers about privilege and power in data
systems’. Ada Lovelace Institute. Available at: https://www.adalovelaceinstitute.org/blog/public-engagement-uncovers-privilege-and-
power-in-data-systems/ [Accessed 5 Jul. 2021].
43 Wolpert, M. (2020). ‘Global mental health databank pilot launches to discover what helps whom and why in youth anxiety and
depression’. LinkedIn. Available at: https://www.linkedin.com/pulse/global-mental-health-databank-pilot-launches-discover-wolpert/?
trackingId=uh3v1y0AQ3WGMK1Ssvq9JA%3D%3D [Accessed 5 Jul. 2021].
44 Carroll, S.R., Hudson, M., Holbrook, J., Materechera, S. and Anderson, J. (2020). ‘Working with the CARE principles: operationalising
Indigenous data governance.’ Ada Lovelace Institute. Available at: https://www.adalovelaceinstitute.org/blog/care-principles-
operationalising-indigenous-data-governance/ [Accessed 5 Jul. 2021].
45 Carroll, S.R., Garba, I., Figueroa-Rodríguez, O.L., Holbrook, J., Lovett, R., Materechera, S., Parsons, M., Raseroka, K., Rodriguez-
Lonebear, D., Rowe, R., Sara, R., Walker, J.D., Anderson, J. and Hudson, M., (2020). The CARE Principles for Indigenous Data
Governance. Data Science Journal, 19(1), p.43. Available at: https://datascience.codata.org/articles/10.5334/dsj-2020-043/
Mechanisms Participatory data stewardship 31
A key aim of CityVerve was to build a ‘platform of platforms’ to gather and collate
data across Manchester about the needs of a UK city. This was built by telecoms
company Cisco, and the data was stored by BT’s cloud-based ‘Internet of Things
Datahub’.
Participatory implications:
46 University of Manchester Digital Futures. (2015). The CityVerve Project. Available at: http://www.digitalfutures.manchester.ac.uk/
case-studies/the-cityverve-project/ [Accessed 19 Aug. 2021].
Mechanisms Participatory data stewardship 32
Involve
47 Hawkins, A.J. (2020). ‘Alphabet’s Sidewalk Labs shuts down Toronto smart city project’. The Verge. Available at: https://www.
theverge.com/2020/5/7/21250594/alphabet-sidewalk-labs-toronto-quayside-shutting-down.
Mechanisms Participatory data stewardship 33
Doing this well can take time (through a long-form process, convening
and re-convening people over weekends, weeks or months at a time),
considerable resources (human and financial – to support people to
contribute through the process), and the skills to make complex issues
transparent and understandable.
This is particularly important given that questions about how data is used
and governed can be deeply societally relevant, but also opaque for non-
specialists unless considerable time and effort is made to address questions
around how uses of data-driven systems interact with the complex policy
contexts in which they take place. For instance, in a deliberative process
seeking advice on the parameters of health-data sharing, beneficiaries will
need to be informed about the use of data, the proposal for a particular
data governance mechanism, and also about the implications of its use for
health-data outcomes and the impact on patients.49
48 Solomon, S., & Abelson, J. (2012). ‘Why and when should we use public deliberation?’ The Hastings Center report, 42(2), p.17–20.
Available at: https://doi.org/10.1002/hast.27
49 Understanding Patient Data and Ada Lovelace Institute. (2020). Foundations of Fairness: Where Next For NHS Health Data
Partnerships? Available at: https://understandingpatientdata.org.uk/sites/default/files/2020-03/Foundations%20of%20Fairness%20
-%20Summary%20and%20Analysis.pdf
Mechanisms Participatory data stewardship 34
In addition to and beyond the one-off initiatives, calls have grown for the
‘institutionalisation’ of participatory and deliberative approaches to data
and AI governance. A recent OECD report, Catching the Deliberative
Wave defines ‘institutionalisation’ in two ways. The first is ‘incorporating
deliberative activities into the rules of public decision-making structures
and governance arrangements in a way that is legally constituted so as
to establish a basic legal or regulatory framework to ensure continuity
regardless of political change’. The second is ‘regular and repeated
processes that are maintained and sanctioned by social norms, which
are important for ensuring that new institutions are aligned with societal
values’.56
50 Understanding Patient Data and Ada Lovelace Institute. (2020). Foundations of Fairness: https://understandingpatientdata.org.uk/
sites/default/files/2020-03/Foundations%20of%20Fairness%20-%20Summary%20and%20Analysis.pdf
51 TheGovLab. (2020). The Data Assembly. Available at: https://thedataassembly.org/ [Accessed 5 Jul. 2021].
52 Balaram, B. (2017). ‘The role of citizens in developing ethical AI’. The RSA. Available at: https://www.thersa.org/blog/2017/10/the-role-
of-citizens-in-developing-ethical-ai [Accessed 5 Jul. 2021].
53 ICO (2019). Project explAIn Interim report. Available at: https://ico.org.uk/media/2615039/project-explain-20190603.pdf
[Accessed 5 Jul. 2021].
54 The Ada Lovelace Institute. (2021). The Citizens’ Biometrics Council. Available at: https://www.adalovelaceinstitute.org/project/
citizens-biometrics-council/ [Accessed 5 Jul. 2021].
55 Peppin, A. (2021). ‘How charting public perspectives can show the way to unlocking the benefits of location data.’ Ada Lovelace
Institute. Available at: https://www.adalovelaceinstitute.org/blog/public-perspectives-unlocking-benefits-location-data/ [Accessed
5 Jul. 2021].
56 OECD. (2020). Innovative Citizen Participation and New Democratic Institutions: Catching the Deliberative Wave. Available at:
innovative-citizen-participation-new-democratic-institutions-catching-the-deliberative-wave-highlights.pdf
Mechanisms Participatory data stewardship 35
57 Carugati, F. (2020). ‘A Council of Citizens Should Regulate Algorithms’. Wired. Available at: https://www.wired.com/story/opinion-a-
council-of-citizens-should-regulate-algorithms/ [Accessed 15 Feb 2021].
58 Jørgensen, M.S. (2020). ‘A pioneer in trouble: Danish Board of Technology are facing problems’. EASST. Available at: https://easst.
net/easst-review/easst-review-volume-311-march-2012/a-pioneer-in-trouble-danish-board-of-technology-are-facing-problems/
[Accessed 15 Feb 2021].
59 Grundahl, J. (1995). ‘The Danish consensus conference model.’ In Joss, S.& Durant, J. (eds) Public Participation in Science: The
Role of Consensus Conferences in Europe. Science Museum: London. Available at: https://people.ucalgary.ca/~pubconf/Education/
grundahl.htm
Mechanisms Participatory data stewardship 36
Participatory implications:
Not all data governance questions and issues will demand this level of
engagement – on questions where there is a high level of established
public support and mandate, meaningful transparency and consultation-
based approaches may be adequate.
60 GOV.UK, (2021).The Use of Public Engagement for Technological Innovation. Available at: https://assets.publishing.service.gov.uk/
government/uploads/system/uploads/attachment_data/file/955880/use-of-public-engagement-for-technological-innovation.pdf.
Mechanisms Participatory data stewardship 37
Embedding public and engage with the lived experiences of underrepresented groups
deliberation thoughtfully and communities. Examples include the rapid, online Sounding Board
and effectively requires model prototyped in 2017 by UK Government-funded programme
time and resourcing, Sciencewise,61 with a view to informing rapidly moving policy cycles, and
and this can be in tension a recent prototype undertaken during the COVID-19 pandemic crisis
with the imperative by the Ada Lovelace Institute in partnership with engagement agencies
to work rapidly in Traverse, Involve and Bang The Table.62
developing and designing
data-driven systems The Royal Society of Arts (RSA)’s landmark work on building a public
culture of economics is an example of how deliberative processes
have been complemented with community engagement and broader
outreach work with underrepresented communities.63 This initiative
complemented a deliberative Citizens’ Economic Council with an
economic inclusion community outreach programme (‘On the Road’)
that worked in locations identified as high on the index of multiple
deprivation.64
61 Patel, R. (2015). ‘Digital Public Engagement – Lessons from the Sounding Board’. Involve. Available at: https://www.involve.org.uk/
resources/blog/opinion/digital-public-engagement-lessons-sounding-board [Accessed 5 Jul. 2021].
62 Ada Lovelace Institute. (2020). Confidence in a crisis? Building public trust in a contact tracing app. Available at: https://www.
adalovelaceinstitute.org/report/confidence-in-crisis-building-public-trust-contact-tracing-app/ [Accessed 5 Jul. 2021].
63 The RSA. (2017). The Citizens’ Economic Council ‘On The Road’. Available at: https://www.thersa.org/projects/archive/economy/
citizens-economic-council/the-council2
64 The RSA. (2017).
Mechanisms Participatory data stewardship 38
65 Delacroix, S., and Lawrence, N. D. (2019). ‘Bottom-up Data Trusts: disturbing the ‘one size fits all’ approach to data governance’,
International Data Privacy Law, Volume 9, Issue 4, pp. 236–252. Available at: https://doi.org/10.1093/idpl/ipz014
Mechanisms Participatory data stewardship 39
The Ada Lovelace Institute has not identified any pilots that have been
successful, to date, in embedding deliberative approaches specifically in
the governance of data-sharing partnerships. However, there is potential
for developing future pilots drawing on and iterating from models in
related areas, as the following case study – a pilot project initiated by the
Wellcome Trust – demonstrates.
The Wellcome Trust has funded a pilot project with young people across India,
South Africa and the United Kingdom with a view to co-designing, building and
testing a mental-health databank that holds information about mental and
physical health among people from 17 to 24 years old. The goal is to create a
databank that can be used in the future to help improve the ways mental health
problems like anxiety and depression for young people living in high and low-
income countries around the world are treated and prevented. The Wellcome
Trust expects the databank to help researchers, scientists and young people
answer the questions, ‘What kinds of treatments and prevention activities for
anxiety and depression really work, who do they work for, and why do they work
among young people across different settings?’
Participatory implications:
66 Taylor, K. (2020). ‘Developing a mental health databank’. Medium. Available at: https://medium.com/wellcome-digital/developing-a-
mental-health-databank-99c25a96001d [Accessed 15 Feb. 2021].
Mechanisms Participatory data stewardship 40
The Wellcome Trust’s mental-health databank pilot signals the likely direction
of travel for many participatory approaches to data governance. Although it
remains to be seen what its effect or impact will be, its ambitions to embed
deliberative democracy approaches into its design, and involve young people
experiencing mental health conditions in the process, are likely to be replicable.67
67 Sage Bionetworks. (2020). Collaborating with youth is key to studying mental-health management. Available at: https://www.
eurekalert.org/pub_releases/2020-11/sb-cwy111720.php [Accessed 5 Jul. 2021].
68 Lansdell, S. and Bunting, M. (2019). Designing decision making processes for data trusts: lessons from three pilots. The Involve
Foundation (Involve). Available at: http://theodi.org/wp-content/uploads/2019/04/General-decision-making-report-Apr-19.pdf
Mechanisms Participatory data stewardship 41
69 Bietz, M., Patrick, K. and Bloss, C. (2019). ‘Data Donation as a Model for Citizen Science Health Research’. Citizen Science: Theory and
Practice, 4(1), p.6. Available at: http://doi.org/10.5334/cstp.178
Mechanisms Participatory data stewardship 42
serve society, and that knowing the consequences and potential benefits
of donating data was a critical factor influencing people’s decisions to
participate. Here, data governance becomes a collaborative endeavour,
where the legitimation, consent and pro-societal motives of data
donors become central to both the viability and the effectiveness of the
approach.
UK Biobank has blood, urine and saliva samples from 500,000 volunteers
whose health has been tracked over the past decade, as part of a large-scale
‘data donation’ research initiative. This has enabled it to gather longitudinal data
about a large sample size of the population, helping to answer questions about
how diseases such as cancer, stroke and dementia develop. It has also formed
the basis for intervening in response to COVID-19, with data about positive
coronavirus and GP/hospital appointments added, and 20,000 contributors
sharing blood samples for pandemic response purposes.71 A range of third-party
organisations including those in academia and industry can apply for different
layered levels of access, paying a subscription fee to be able to access this data.
Participatory implications:
Data donors are free to opt out and stop sharing their data at any point without
needing to provide a reason. If someone opts out, UK Biobank will no longer
contact the participant or obtain further information, and any information and
samples collected previously would no longer be available to researchers.
They would also destroy samples (although it may not be possible to trace all
distributed sample remnants) and continue to hold information only for archival
audit purposes.
72 Kim P, Milliken EL. (2019). ‘Minority Participation in Biobanks: An Essential Key to Progress: Methods and Protocols’. Methods
in Molecular Biology. 1897: pp.43-50. Available at: https://www.researchgate.net/publication/329590514_Minority_Participation_in_
Biobanks_An_Essential_Key_to_Progress_Methods_and_Protocols
73 Groenwold, R., & Dekkers, O. (2020). ‘Missing data: the impact of what is not there’. European Journal of Endocrinology, 183(4), E7-E9.
Available at: https://eje.bioscientifica.com/view/journals/eje/183/4/EJE-20-0732.xml.
Mechanisms Participatory data stewardship 44
Another potential issue is the lack of clarity about the exact terms on
which the data is ‘donated’, for instance, some people feeling that they
are expected to share their data to access a healthcare service, when in
reality there is no such expectation, or limited awareness about rights to
opt out of data sharing. This is a particular challenge with ‘opt out’, rather
than ‘opt in’ models of data donation (presently widespread practice
in the UK National Health Service), as recent societal discussion and
debate about the UK’s new centralised data-sharing infrastructure, and
opt-out mechanisms under the General Practice Data for Planning and
Research (GPDPR) proposals have highlighted.74
Empower
74 Machirori, M. and Patel, R. (2021). ‘Turning distrust in data sharing into “engage, deliberate, decide”.’ Ada Lovelace Institute. Available
at: https://www.adalovelaceinstitute.org/blog/distrust-data-sharing-engage-deliberate-decide/ [Accessed 5 Jul. 2021].
Mechanisms Participatory data stewardship 45
Examples of these approaches are relatively rare, but they do exist, are
increasingly emerging and include the following:
75 For more on data cooperatives see chapter 2 in Exploring legal mechanisms for data stewardship (Ada Lovelace Institute, 2021)
76 Data Collaboratives. (n.d.). Salus Coop. Available at: https://datacollaboratives.org/cases/salus-coop.html [Accessed 15 Feb. 2021].
Mechanisms Participatory data stewardship 46
Participatory implications:
Together with citizens Salus has developed a ‘common good data license for
health research’ through a crowd-design mechanism, which it describes as
the first health data-sharing license in the world. The Salus common-good
license applies to data that members donate and specifies the conditions that
any research projects seeking to use the member data must adhere to. The
conditions are:
• Health only: The data will only be used for health-related (i.e. treatment of
chronic disease) research.
• Non-commercial: Research projects will be promoted by entities of general
interest such as public institutions, universities and foundations only.
• Shared results: All research results will be accessible at no cost.
• Maximum privacy: All data will be anonymised and unidentified before any
use.
• Total control: Data donors can cancel or change the conditions of access to
their data at any time.
• individuals have the right to know under what conditions the data they’ve
contributed will be used, for what uses, by which institutions, for how long and
with what levels of anonymisation
• individuals have the right to obtain the results of studies carried out with the
use of data they’ve contributed openly and at no cost
• any technological architecture used allows individuals to know about and
manage any data they contribute.
This means that these approaches are rarely appropriate for all
beneficiaries and data subjects, and so cannot claim to be fully inclusive
or representative. Data cooperatives can, therefore, struggle to generate
the scale and level of participation from data subjects that they might
hope for, but they can nevertheless help broaden out the range of
intelligence informing data governance.
When designing a
participatory process Who to involve when developing
to meet a specific
objective, the choice participatory mechanisms
about who to involve
matters as much as which
types of involvement or
participation mechanisms What all these different methods and approaches have in common is that
are used they seek to involve beneficiaries in the use, development and design of
data and AI, and that they involve some element of sharing power with
those beneficiaries in the way that data-driven systems and governance
frameworks themselves are designed.
These three key questions are informed by a recognition that the data
stewards’ responsibility is not just to manage data itself effectively, but
also to recognise that data often relates, either directly, or indirectly to
people (beneficiaries). As well as recognising the rights of data subjects,
and the potential benefits and harms of data use to beneficiaries, data
stewards need to understand that when data omits or excludes people,
it has the potential to have harmful consequences. This can happen, for
instance, by discriminating against or underrepresenting some people’s
interests and concerns. This means that participation can be as much
about including or involving those who do not have a direct relationship
with the data as assembled, as those who do.
Benefits Participatory data stewardship 49
77 ESRC website. (n.d). Why public engagement is important. Available at: https://esrc.ukri.org/public-engagement/public-engagement-
guidance/why-public-engagement-is-important/ [Accessed 15 February 2021]
78 There are various definitions of values-led and instrumental rationales for public engagement including: Stirling, A. (2012). ‘Opening
Up the Politics of Knowledge and Power in Bioscience’. PLoS Biology 10(1). Available at: https://journals.plos.org/plosbiology/
article?id=10.1371/journal.pbio.1001233
79 The Involve Foundation (‘Involve’), (2019). How to stimulate effective public engagement on the ethics of artificial intelligence.
Available at: https://www.involve.org.uk/sites/default/files/field/attachemnt/How%20to%20stimulate%20effective%20public%20
debate%20on%20the%20ethics%20of%20artificial%20intelligence%20.pdf [Accessed 15 February 2021].
Benefits Participatory data stewardship 50
80 Page, S. (2011). Diversity and complexity. Princeton, N.J: Princeton University Press.
81 Clarke, R. (2015). Valuing Dialogue: Economic Benefits and Social Impacts. London: Sciencewise Expert Resource Centre. Available
at: https://sciencewise.org.uk/wp-content/uploads/2018/12/Valuing-dialogue-2015.pdf [Accessed 15 February 2021].
82 Centre for Public Impact. (2018). Finding Legitimacy. Available at: https://www.centreforpublicimpact.org/assets/documents/Finding-
a-more-Human-Government.pdf [Accessed 15 February 2021].
Benefits Participatory data stewardship 51
Participatory approaches We redefine legitimacy in the context of data, therefore, as the broad
to data governance can base of public support that allows companies, designers, public
help engender some servants and others to use data, and to design and develop data-
societally beneficial governance frameworks. The absence of this legitimacy has been
outcomes illustrated in a series of public, controversial events that have taken
place since 2018. Examples include the Cambridge Analytica/Facebook
scandal, in which political microtargeting was misused;83 or the scandal
that emerged when the NHS shared sensitive patient data with private
research company DeepMind, with limited oversight or constraints.84
More recently, governments’ efforts to implement and deploy digital
contact tracing across the world have also met with considerable public
debate and scrutiny.85
83 Fruchter, N., Yuan, B. and Specter, M. (2018). Facebook/Cambridge Analytica: Privacy lessons and a way forward. Internet Policy
Research Initiative at MIT. Available at: https://internetpolicy.mit.edu/blog-2018-fb-cambridgeanalytica/.
84 ICO. (2018). Royal Free - Google DeepMind trial failed to comply with data protection law. Available at: https://ico.org.uk/about-the-
ico/news-and-events/news-and-blogs/2017/07/royal-free-google-deepmind-trial-failed-to-comply-with-data-protection-law/.
85 Ada Lovelace Institute. (2020). Exit through the App Store? Rapid evidence review. Available at: https://www.adalovelaceinstitute.org/
evidence-review/covid-19-rapid-evidence-review-exit-through-the-app-store/ [Accessed 5 Jul. 2021].
86 Ada Lovelace Institute. (2021). Exploring legal mechanisms for data stewardship. Available at: https://www.adalovelaceinstitute.org/
report/legal-mechanisms-data-stewardship/ [Accessed 5 Jul. 2021].
Benefits Participatory data stewardship 52
These are all potentially valid outcomes that emerge from embedding
participatory data stewardship mechanisms. The mechanisms to enable
these outcomes can be different – if they are to be impactful, they are
quite likely to vary depending on context, use of the data, the type of data
being governed, and those who are most likely now and in future to be
impacted by the data use and governance.
Conclusion Participatory data stewardship 53
Conclusion
Effective participation must start with the goal of informing people about
their data – with transparency (the right to be informed and the right
to take meaningful action based on what one knows), where the data
steward shares information about what is happening to people’s data in
ways that enable them to take action.
Methodology
87 Patel, R., Gibbon, K. and Peppin, A. (2020). Exploring principles for data stewardship. [online] Ada Lovelace Institute. Available at:
https://www.adalovelaceinstitute.org/project/exploring-principles-for-data-stewardship/ [Accessed 16 Feb. 2021].
88 The Ada Lovelace Institute. (2021). Exploring legal mechanisms for data stewardship. Available at: https://www.adalovelaceinstitute.
org/report/legal-mechanisms-data-stewardship/
Appendix Participatory data stewardship 56
Appendix
Citizen control
Partnership
Placation
Consultation Degrees of
tokenism
Informing
Therapy
No power
Manipulation
Data steward
Data beneficiary
Acknowledgements
We would like to thank the following colleagues for taking time to review a
draft of this paper, and offering their expertise and feedback:
• Alix Dunn, Computer Says Maybe and Ada Lovelace Institute Board
member
• Dr Allison Gardner, Keele University
• Andrew Strait, Ada Lovelace Institute
• Astha Kapoor, The Aapti Institute
• Carly Kind, Ada Lovelace Institute
• Professor Diane Coyle, Bennett Institute for Public Policy, Cambridge
University
• Hetan Shah, British Academy and Ada Lovelace Institute Board
member
• Jack Hardinges, Open Data Institute
• Dr Jeni Tennison, Open Data Institute
• Kasia Odrozek, Mozilla Foundation
• Katie Taylor, Wellcome Trust
• Professor Lina Dencik, The Cardiff Data Justice Lab
• Lisa Murphy, NHSx
• Dr Mahlet Zimeta, Open Data Institute
• Miranda Marcus, Wellcome Trust
• Dr Miranda Wolpert, Wellcome Trust
We are also grateful to techUK and to the Cardiff Data Justice Lab, who
both organised workshops to facilitate input into this research, and for
attendance at those workshops by a range of technology companies,
policymakers, civil-society organisations and academics.
The mission of the Ada Lovelace Institute is to ensure that data and
AI work for people and society. We believe that a world where data
and AI work for people and society is a world in which the opportunities,
benefits and privileges generated by data and AI are justly and equitably
distributed and experienced.
Website: Adalovelaceinstitute.org
Twitter: @AdaLovelaceInst
Email: hello@adalovelaceinstitute.org
designbysoapbox.com
Permission to share:
This document is published
under a creative commons
licence: CC-BY-4.0 ISBN 978-1-8382567-6-0