The Profession
.............................................................................................................................................................................................................................................................
Alternatives to Social Science One
Margaret Levi, Stanford University
Betsy Rajala, Stanford University
ABSTRACT
This article responds to King and Persily’s(2019) proposal for a new model of
industry–academic partnership using an independent third party to mediate between firms
and academics. We believe this is a reasonable proposal for highly sensitive individual-level
data, but it may not be appropriate for all types of data. We explore alternative options to
their proposal, including Administrative Data Research Facilities, Data Collaboratives at
GovLab, and Tech Data for Social Good Initiative at the Center for Advanced Study in the
Behavioral Sciences. We believe social scientists should continue to explore, evaluate, and
scale a variety of industry–academic data-sharing models.
P
rivate companies possess valuable data that are
largely inaccessible for social science. The incentives
for academics and industry are sufficiently different
to make any scalable collaboration difficult. King and
Persily (2019) offer a solution. They propose a part-
nership model that is based on an independent third party
(i.e., Social Science One) that adjudicates between companies
and academics on issues of data distribution. This is ideal for
collaborations for which protecting the security of fine-grained
individual-level data and a propriety underlying algorithm is a
necessary condition for making the data available to academics. A
third-party adjudicator is essential when a company confronts
external pressures to release data regarding something the world is
desperate to understand. Social Science One is attempting to
leverage its mediation role to produce a mutually beneficial
agreement that ensures data privacy and addresses a company’s
reputational concerns but that also prioritizes data quality. Face-
book’s data could reveal important new forms of political influ-
ence, and King and Persily are working to ensure that the data are
made available and analyzed responsibly.
Of course, this is not a one-size-fits-all model, nor is it intended
to be. There is a variety of data that, although not collected with
scholarly research purposes in mind, turn out to be useful as
evidence in academic claims. For example, Putnam (2000) repur-
posed marketing data for his book Bowling Alone to show how
individuals have become increasingly disconnected from their fam-
ily, community, and democratic structures.
1
In economics, Cohen
et al. (2016), Cook et al. (2018), and Cramer and Krueger (2016) used
Uber data to explore questions of consumer surplus, the gender gap,
and how technology has changed the transportation industry,
respectively. Although these examples are not as institutionalized
as Social Science One, they do provide different types of collabor-
ations that previously worked and could be replicated.
These types of partnerships drastically reduce the costs for
both academics and industry. Researchers are free to explore
questions that can be answered with the data provided; they do
not need to go through the process of submitting an extensive
proposal to a third party, and companies provide only the data
that fit with their business interests. Of course, this may mean
that researchers are not granted access to all of the data that they
want. However, attempts to access a company’s entire data
archive should not delay or prevent access to some of its data.
In many cases, even partial data from private companies can
surpass the quality of alternative data sources. Having access to
entire datasets from a wide variety of companies is the ideal, of
course, but it simply is not realistic—yet.
To obtain access to the largest possible proportion of private
data, social scientists must use a variety of different partnership
models. Fortunately, there are several efforts exploring additional
data-sharing models for academic–industry partnerships, includ-
ing the following:
• Administrative Data Research Facilities (ADRFs) collate gov-
ernment and private data across agencies, companies, and
jurisdictions in a secure yet accessible way.
2
ADRFs act as
both a data-storage facility and an intermediary to assess the
validity of research questions. However, the adjudication
function of ADRFs is not as intensive as Social Science
One. Therefore, this model of collaboration will meet the
needs of a wide range of data producers except those that
have serious reputational concerns requiring a more hands-
on approach to determine acceptable research questions
(e.g., Social Science One’s relationship with Facebook).
• Data Collaboratives at GovLab allows partner firms to
engage in various approaches ranging from reliance on
trusted intermediaries, in the spirit of Social Science One,
to the creation of data cooperatives, in which data are
provided to one organization or researcher.
3
This option is
Margaret Levi is the Sara Miller McCune director at the Center for Advanced Study
in the Behavioral Sciences and professor of political science at Stanford University. She can
be reached at mlevi@stanford.edu.
Betsy Rajala is a program director at the Center for Advanced Study in the Behavioral
Sciences at Stanford University. She can be reached at betsy.rajala@stanford.edu.
710 PS • October 2020 © American Political Science Association 2020 doi:10.1017/S1049096520000438