SSRN Id3293275
SSRN Id3293275
Abstract: Increased computing power and advanced machine learning (ML) techniques have
accelerated the development and use of “artificial intelligence” (AI). Despite the increased
commercial focus on AI, we have limited information about how AI products are developed by
resource constrained startups. We collect survey data from startups producing commercial AI
products to provide a glimpse in how these product impact labor across industries. This paper
chronicles the last five years of AI startup survey data, documenting information about their
business models, product, suppliers, partners, and customers. Our descriptive findings add to
conversations on whether AI products enhance or replace human capabilities; which algorithms,
data protections, and frameworks high-tech startups use to develop their AI products; and which
geographies, customer-types, and industries are sales targets.
*This manuscript is a revised version of the working paper released on November 29, 2018 with
the same title.
intelligence” (AI) development for a wide range of uses. The potential of these new technologies
has prompted numerous assessments that these products will drastically alter the economy
(Acemoglu and Restrepo, 2018; Frey and Osborne, 2017; Furman and Seamans, 2019; Felton et
al., 2021). According to some commentators, we are entering a “Race Against the Machine”
(Brynjolfsson and McAfee, 2011), a “jobless future” (Ford, 2015), a “Fourth Industrial
Revolution” (Schwab 2016) and the “Second Machine Age” (Brynjolfsson and McAfee, 2014).
Results from these studies differ in their estimation of AI’s impact and their tone1, yet there is
consensus that AI development and using big data to train AI not a passing trend.
Much of this new literature on the economic consequences of AI has been based on
estimated capabilities of the new technology, without accounting for possible difficulties in
commercialization and technology adoption. For instance, the Frey and Osborne study (2017) is
based on the assessments of a small panel of machine learning experts who were asked which of
70 occupations were “completely automatable” in 2013. Even if these assessments are correct, just
because a job is automatable does not mean that it will be automated. Moreover, both theoretical
models and historical evidence show that even when most of the labor of an industry is automated,
If AI is likely to drastically change the economy in a decade or so, then we should see
evidence of that at the actual businesses where AI is being applied today. That is, if 47% of jobs
will soon be at risk from AI, then surely jobs will be at risk already at those firms using AI
1
On the more pessimistic side estimates that 47% of jobs in the US will be “at risk of automation” in the next
decade or so (Frey and Osborne 2017). On the less-pessimistic side, Bessen (2019) and Furman and Seamans (2019)
discuss the potential for AI to augment labor’s capabilities.
To glimpse the impact of AI development, our paper reports on a global survey of startups
developing and licensing commercial AI products. The survey questionnaire covers a wide range
of topics, including questions about startups’ markets and customers, their AI technologies and
frameworks used in product development, their use and protection of training data, their product
users and use cases, and their customers’ demographics and industries. These findings paint a vivid
picture of how startups pull together the needed resources to develop and monetize their AI
applications. And that picture is at odds with alarmist predictions about the impact of AI. Rather
than drastically reducing jobs, current data commercial AI applications are much more often about
enhancing human capabilities than about reducing labor costs. While some occupations are
experiencing job losses, others are experiencing job creation. The actual effect of AI on jobs varies
The survey provides a window into other topics in addition to how AI effects labor,
including how AI startups address barriers to entering this nascent industry, how they protect
customer data and respond to new data protection regulations such as the EU General Data
Protection Regime (GDPR), and the extent to which they develop their own machine learning
algorithms or rely on third party providers. While the survey establishes that there is a vibrant
startup community of AI application developers and many new startups enter each year, it is
possible that investments by more established firms may create barriers to entry in some markets.
Respondents answer questions on data, hardware, software and firm size, which have previously
diversity or scale of data collected by larger firms, 80 percent of survey respondents report using
their customers’ data to train AI. Customers proprietary data correlated with increased future
performance, suggesting that the inability for startups to access barriers to scale from having less
imitable data-resources. In the last several years, access to cheaper computing power (i.e., IT
hardware) has made it easier to enter the industry and develop their products, as introductory offers
from cloud provider and venture capital “spray and pray” investing have reduced the cost of IT
(Ewens et al., 2019). Moreover, access to standardized tools and other resources from cloud
suppliers may lead to time saving, enabling startup to focus on differentiating their product instead
of building core app infrastructure (Impink, 2023). These more standardized tools may benefit
performance without necessarily inhibiting innovation (Miric et al., 2022). Due to the availability
of cloud IT and basic development tools, labor (Rock 2022) and proprietary data (Bessen et al,
In addition to basic facts about the business of AI, this study provides evidence that bears
on several major questions that have been raised in the literature, including the impact of AI on
jobs and occupations, and the role of data as a critical resource for AI-enabled startups. The first
question, noted above, concerns the impact of AI on jobs. Frey and Osborne (2017) base their
literature highlights several economic and technical reasons this might not be a sufficient metric
for understanding the impact of AI on jobs. One reason is that automating some or even most tasks
an occupation performs does not necessarily eliminate the occupation. Historically, most
(Rosenberg, 1963). Even though automation increased productivity, workers may still be cogs in
the broader process (Chandler, 1962). Bessen (2016) finds that of 270 detailed occupations listed
in the 1950 Census, only one can be described as having been eliminated due to automation,
namely the job of elevator operator. Furthermore, partially automating a job can increase
Restrepo, 2018). This is because automation tends to decrease prices, driving greater demand.
When demand is elastic enough, greater demand will offset the labor-saving effect of automation.
For example, during the 19th century, the textile industry was heavily automated, yet employment
rose (Bessen, 2019). More recently, the automatic teller machine (ATM) automated some of the
work of bank tellers, yet their employment grew as well (Bessen, 2016). Also, the effect of AI is
not just to automate tasks which replace humans, but also to enhance human capabilities at both
In the recent empirical literature, (Arntz et al., 2017) modify the basic estimates of Frey
and Osborne to account for the partial nature of automation. However, they do not consider the
adoption of the technology or the labor demand effects that might cause automation to increase
employment. The McKinsey Global Institute (Manyika, Chui et al., 2017; Manyika, Lund et al.,
2017) attempts to take a range of hypothetical adoption rates into account in estimating impacts,
and they also consider hypothetical labor demand effects. Accenture (2018) also estimates how
the impact of AI on employment will vary depending on the role of different skills, and also predict
where there will be deficits and surpluses of these skills on a geographic basis.
Several papers have looked at the impact of robots on employment, with differing results
(Acemoglu and Restrepo 2017, Graetz and Michaels 2015, Dauth et al. 2018). Graetz and Michaels
Federation of Robotics (IFR) and find that robots added an estimated 0.4 percentage points of
annual GDP growth between 1993 and 2007 on average for the 17 countries in their sample
(accounting for about one-tenth of GDP growth during this time period) while having a slightly
In contrast, Acemoglu and Restrepo (2017) use the IFR data to examine the impact of the
increase in industrial robot usage on regional U.S. labor markets between 1990 and 2007. These
authors find that industrial robot adoption in the United States was negatively correlated with
employment and wages during this time period—according to their estimates, each additional
robot reduced employment by six workers and that one new robot per thousand workers reduced
wages by 0.5 percent. The authors note that the effects are most pronounced in manufacturing,
particularly in routine manual and blue-collar occupations, and for workers without a college
degree.
Dauth et al. (2017) combines German labor market data with IFR robot shipment data and
finds that while each additional industrial robot leads to the loss of two manufacturing jobs, enough
new jobs are created in the service industry to offset and in some cases over-compensate for the
negative employment effect in manufacturing. This finding echoes a broader study by Autor and
Salomons (2018) which explores the general impact of productivity growth on employment,
including both own industry and cross industry effects, and finds that productivity growth does not
reduce employment in general, thanks in large part to positive spillovers into related sectors.
A second question we address is which occupations will be affected. Frey and Osborne
(2017) argue that lower wage occupations will experience greater job losses than higher wage
occupations. The McKinsey Global Institute projects, somewhat differently, that high wage
is blind to the color of your collar” and many professions will be devastated. Susskind and
Susskind (2015) argue that new technology will lead to the decline of the professions. Our survey
provides some evidence about which occupations are growing and which are losing jobs in
response to AI. Other recent papers that take the task- or ability-based approach include
Brynjolfsson, Mitchell and Rock (2018) and Felten, Raj and Seamans (2021). Moreover, a large
literature has explored the differential impact of technology on different groups of workers. This
includes the literature on skill-biased technical change (Katz, 1999) and on job polarization (Goos
Next, a third question our study addresses are the extent of entry barriers into AI markets.
Investment in AI is currently dominated by large firms, especially a few large tech firms with slack
cloud compute resources – Amazon, Google, and Microsoft. Moreover, these firms also control
PyTorch), and they have tons of customers that provide data to use their services and products. In
particular, some observers, such as Stucke and Grunes (2016), argue that the combination of data
and network effects creates substantial entry barriers in online markets. Others point to scale
barriers stemming the scale, IT infrastructure, and data-resource wielded by the largest technology
firms (Benzina, 2019; Khan 2017). For example, because Google has an enormous amount of
search data, it might be hard for a new competitor to compete in the search engine market.
Others contend that data, by itself, is not likely to pose an entry barrier (Lambrecht and
Tucker 2015, Sokol and Comerford 2016). It may be that Google’s advantage comes not from the
amount of data, but from the results of the product-focused experiments. Varian, Google’s chief
economist, and co-authors (2010) report that Google conducted 6,000 experiments on its search
there may be diminishing returns to the amount of data beyond a certain point. For instance, Bajari,
Amazon’s chief economist, and co-authors (2019) find that increasing the number of online
products that Amazon tracks does not significantly improve machine learning prediction accuracy
after a certain point, implying that data quantity provides only a low barrier to entry. Amazon and
other large technology companies benefit from the breadth and recency (Chiou and Tucker, 2017)
of the data captured. For example, startups are more likely to sell a subset of the broader product
offering of a larger company and to engage with small to medium size customers. In any case, our
survey provides basic information about startups’ access to data and circumstantial evidence on
Lastly, the fourth question this study addresses is how startups use data protections. Data
protection and data privacy has become a flashpoint in the media, due in part to high profile data
breaches such as at Equifax in 2017 and in part to high profile exposure of Facebook user’s
personal data to Cambridge Analytica in 2016 and 2017. Partially in response to these events,
government regulators have instituted tighter rules on data protection. This has most notably
manifested in Europe in the form of General Data Protection Regime (GDPR). There has been
some concern that the increased data protection required under GDPR may constitute a barrier to
entry for startups, and early research suggests that GDPR may in part be contributing to a
slowdown in VC investment in European based startups (Jia et al., 2021). Additional evidence
suggests that GDPR more negatively effects smaller web technologies firms than larger ones. For
instance, Google gains significant share after GDPR was passed at the detriment of smaller firms
protections that AI startups have in place and allow us to compare the use of data protections across
Sample selection
The main portion of our sample comes from the Crunchbase database of firms. Each year
we select startups tagged as AI firms or described with the term ‘artificial intelligence’ or machine
learning’. At the time of selection, these firms are listed as in-business and operational. We focus
on English-language countries, where Crunchbase has greater coverage, and omit China. The list
of startups selected are primarily based in developed countries, which the vast majority of firms
are headquartered in the USA, Canada, UK, or the EU. In an attempt to have a more homogenous
sample, we only include responses from startups that are less than ten years old and less than 500
employees. This enables us to focus on high growth startups, while weeding out mom and pop
In the first round of the survey, we also included startups from three additional sources:
alumni of the Creative Destruction Lab, a startup incubator, who were identified as working with
AI; a list of machine learning companies obtained from Philipp Hartmann and Joachim Henkel
(Hartmann and Henkel, 2018); also, O’Reilly Media ran a notice of the survey in its AI newsletter,
providing a link to the online survey. Once a startup is added to our sample list, we send them a
survey every year. Moreover, we reach out to multiple email addresses at the firm. For instance,
we would email the individual who responded from the firm in the prior year and any other email
our response rate is around 8%. In total, firms responding at least once our response rate is 22%.
Survey rounds
Survey 1 was administered online using Qualtrics from in the summer of 2018. Surveys 2-
4 occurred the first few months of 2020, 2021, and 2023. Survey 5 occurred in the early spring of
2023. They survey consists of around 36 questions in a typical year. These questions (excluding
name and email questions) are pre-tested on academics and on roughly a half dozen firms with
AI development; and in survey 4 we added question on the governance of AI. Survey 5 asks the
same questions as survey 4, with the goal of building a panel of responses throughout time. After
each round of the survey, we receive feedback from startups that helps us adapt the survey in the
next round. However, the core questions on firm demographics, customers, and core product
development that we report on have remained identical in all five survey rounds.
Across all surveys, we have received 1,178 responses from 917 unique startups. Figure 1
shows the number of responses by survey round. More than 160 startups have responded to the
survey more than one time in the last five years; two startups have responded all five times.2 Figure
2 reports then number of times that each startup has responded to the survey. This will be
2
Expect your gold star to be delivered via mail, if you responded all five times.
10
298
300
263
254
225
count of respondents
200
138
100
0
Survey 1 Survey 2 Survey 3 Survey 4 Survey 5
814
800
600
count of respondents
400
200
133
24
4 2
0
1 2 3 4 5
Number of annual survey responses
11
We show the representativeness of subsets of our data in several papers (Bessen et al.,
2022, Bessen et al, 2023). Generally, we find firms based in California are less likely to respond.
In certain subsets of the data, we find that firms that are less than two years old are less likely to
respond, potentially suggestion that more nascent ventures may not have full time employees.
Table 3 reports the basic summary statistics for responding firms in all surveys. The
startups in our sample are quite small with around half of the firms having less than 10 employees
(Figure 4) and existing for three or fewer years (Figure 5). More than half of the responding
startups are located in North America, for multiple reasons. First, there is a higher concentration
of startups in California, New York, Massachusetts, and Toronto. However, we are no blind to the
fact that Crunchbase has coverage issues in the developing and non-English language countries
and the survey was sponsored and run by two US universities. Upcoming survey will be sponsored
by HEC Paris in addition to NYU Stern and Boston University to increase response rates withing
12
250
227
196
200
163
count of respondents
153
150
132
96
100
72
63
50 41
27
0
1 2 3 4 5 6 7 8 9 10
600
541
518
count of respondents
400
200
73
32
6
0
5 25 50 150 250
13
612
600
count of respondents
400
339
188
200
39
0
Americas Asia Europe MEA
Survey respondents disproportionately sell to mid-sized firms. Figure 7 shows the share
of customers in each size class from the survey. Almost half of the startups sell to firms in the
middle category with 51-1000 employees, but these firms make up only 26% of the market as
measured by employment size from the US Census Longitudinal Business Database (LBD). The
responding startups also sell to both smaller and larger customers, but proportionately less than
would be expected given the distribution of firms in developed countries. Most of the firms are
shipping product (71%), but 20% are in beta testing, and 9% are pre-beta (Figure 9).
14
.5
.4
% of respondents
.2 .1
0 .3
Shipping
60
% of respondents
40
Beta
20
Pre-beta
0
15
become a bit older, from 3.7 years old on average in 2018 to 5.3 years on in 2023. The mix of
responses by geography shows a decline in the percent of respondents from Canada and an increase
in the percent of responses in Europe. Our initial list of startups included data from Creative
Destruction Labs (University of Toronto) in Canada, which may explain of this change. However,
we also had specific data from the TUM accelerator program in Germany in the first round of the
The number of respondents with an MBA is also declining over time, suggesting earlier AI
startups had a higher proportion of management education. This potentially suggests entry barriers
were higher initially, required greater credentialling to kickstart the venture. Lastly, about 10% of
16
The survey respondents sell to customers in all major industry sectors as well as to
individual consumers (Figure 10). In this sense, machine learning is a general-purpose technology
that can be used in a variety of applications across a variety of industries (Cockburn et al 2018).
This broad distribution across industries suggests that the startup environment is healthy
and many opportunities for entry exist. However, some industries may have relatively higher entry
barriers than others. If there are no entry barriers, then both large and small firms invest based on
the size of economic potential in an industry. This means that without entry barriers, the
distribution of AI development spending should look the same for smaller firms as it does for
larger ones. If, on the other hand, startups face significant entry barriers in an industry, then large
firms would spend proportionately more than smaller firms in industries with entry barriers,
holding the anticipated economic potential across industries constant. Comparing startup funding
in different industries relative to total investment in AI in those industries helps identify those
Although our survey only includes startups and not large established firms, the McKinsey
Global Institute conducted a survey of top managers at large and small firms (Bughin et al. 2017).
By comparing estimates from the two studies for different industries, we can identify differences
in relative funding that might signal entry barriers. The McKinsey study reports the current
adoption of AI for select industries, presented as a weighted mean by firm size (Bughin et al. 2017:
Exhibit 4). Using this as a proxy for total AI investment in each industry for 2018, Figure 11 shows
the share of startup participating in each industry from our survey compared with the share of total
17
the same in many industries, but in three industries total investment—likely reflecting large firm
and high tech and telecommunications. On the other hand, it appears that startup investments in
AI applications serving retail and finance are disproportionately larger, likely reflecting venture
Figure 10 – Industry
15 10
5 % of respondents
0
Retail Fin Health IT Manuf Comm Util Serv Govt Edu Agr Law
18
proposition”—the survey asked firms to rate various benefits that their products provided to
customers. Each benefit is rated on a 5-point scale from “strongly disagree” to “strongly agree”
(or “not applicable”). Figure 12 shows the share of responses ranked “strongly agree.”
The three most frequent benefits reported are capabilities to make predictions or decisions,
to manage and understand data, and to create new and improved products and services. These
answers provide a gauge for understanding how much AI enhances human capabilities and how
much, instead, it tends to replace them. Of the survey responses, 53% strongly agree that their
19
That is, AI appears to be about enhancing human capabilities, not just replacing them. AI may very
explore further below), but because facets of the technology appear to augment human capabilities,
it might well be associated with increased employment and wages for at least some occupations.
certain sectors of the economy. We find that, for firms that sell to customers in agriculture,
manufacturing, utilities and transportation industries, 74% strongly agree that their products
benefit customers by automating routine tasks and/or reducing labor costs. For firms that do not
sell to those industries, only 46% strongly agree. This difference across industry groups is highly
Figure 13 reports the share of firms reporting whether each occupational group at their
customer firms is a user or not. White collar occupations are listed most frequently, including
professionals (58%), managers (49%), and sales and marketing occupations (35%). Occupations
that include clerical, administrative, manual or service work are less likely to be listed. In this
regard, the distribution of users of AI across occupations is very similar to the distribution of
20
50
40
% of respondents
30
20
10
0
21
skill requirements. But does the use of these products require STEM skills or specialized
training? The survey responses suggest that in most cases specialized computer skills or specific
training demands are modest. Only 10% of firms require users to have expert coding or data
skills. 59% require general familiarity with computers and the remainder require no special skills
at all.
In terms of the training provided to users, 86% of firms report this training takes one
week or less. 12% report that 2 to 4 weeks of training is required and 2% report up to 3 months
of training is required. Fully 44% of the firms provide no training to their customers. Of those
firms that do provide training, 70% provide onsite training classes, 32% provide offsite training
The analysis of customer benefits above suggests that most firms are oriented to enhancing
customer capabilities rather than reducing customer labor costs, especially in the broad service
sector. This suggests that AI might have some job-creating potential, especially for the
professional, managerial, sales and marketing occupations that tend to be the users of these
products. On the other hand, AI also reduces labor costs for some customers. Moreover, these
effects might differ across occupations because use of AI differs dramatically across occupations.
Figure 14 reports the share of firms that expect their products to create jobs or eliminate jobs in
different occupational groups at their customers, listed in the same order as in Figure 13.
22
benefit form augmentation of their capabilities (i.e., increased productivity). At the same time,
many jobs will be eliminated, especially in the three occupational groups that use AI relatively
less. In many cases, jobs will be created in some occupations and jobs will be destroyed in other
occupations at the same affected firms. Of the firms that responded to this question, almost half
(46%) reported that their products both create jobs and destroy jobs at customer firms. 26% report
only creating jobs and 28% report only eliminating jobs. It is possible, of course, that survey
respondents, perhaps sensitive to publicity about job losses, shaded their answers to reflect better
job outcomes than is actually the case. Nevertheless, Figure 13 reveals dramatic relative
differences between occupational groups. Professionals, managers, and quantitative sales workers
may benefit from AI more than administrative, frontline, and manual workers.
23
In building their products, responding startups use many different types of algorithms, AI
development frameworks, and cloud tools to develop their products. The most commonly used
technology is natural language understanding and text analysis (63% of firms), followed by natural
language classification and decision management (both at 56%). These are followed by visual
recognition, including image, face and video (45% of firms) and sentiment/emotion analysis (43%
Overall, many more firms develop their own software for the most commonly used
technologies rather than purchase them from an external vendor. Only in two areas do firms rely
more on outside vendors: speech recognition with 19% using external products while 13% develop
their own, and natural language translation, with 17% using external software and 13% develop
their own.
Regarding the type of algorithm used (Figure 14), 65% of reporting firms use neural
networks including recurrent, convolutional, and generative adversarial neural networks. The next
most common methods are clustering algorithms (46% of firms) and Bayesian or other methods
of probabilistic inference (41% of firms). Other methods are used less frequently. Most startups
use TensorFlow or PyTorch (Figure 15). However, the selection of the AI development framework
depends on the fit of that framework with the algorithm, training data, and product. For instance,
Figure 16 shows that startups using neural networks or ensemble learning, commonly discussed
as more sophisticated algorithms, are more likely to use PyTorch and Google ML than TensorFlow
and Keras.
24
.6
% of respondents
.2
0 .4
25
40
Tensor TensorFlow
PyTorch
30 PyTorch
Keras
% of category
Google ML
Keras
20
Goog ML
10
0
NN or EL Not NN or EL
Almost all the startups report that their products are cloud-based (97%) although 33% of
the firms additionally provide software on premises. Only 3% of the firms provide software on
premises only. Most of the firms (68%) provide a commercial application using AI. Some use the
AI in their own products and services (43%) and 12% provide developer tools for AI applications
Thus, while startups use a wide variety of technologies to perform a variety of functions, a
typical firm uses Amazon AWS cloud platform services, neural networks, and either TensorFlow
26
Cloud
.6
.4
Both
.2
On-prem
0
Amazon
60
% of respondents
40
Goog
20
Msft
0
27
Amazon
60
% of respondents
40
Goog
20
Msft
0
Retail
Fin
Health
No Tech Data Collab.
IT
Manuf
Comm
Util
Serv
0 5 10 15
% of respondents
28
Startups also use a wide variety of data. 57% of the firms use unstructured text, 44% use
transaction data, 38% use images, 37% use administrative data or other structured records. A
smaller share of firms use audio, video, or other types of data. These data are mainly used to train
algorithms. Consequently, the algorithms are re-trained as more data accumulates. Roughly a
quarter of firms report refreshing their models daily, weekly, or monthly each. 13% of firms report
Startup firms generally use other people’s data. The most common source of data is from
customers. 80% of the startup firms report using customer data, including data about their
customer’s customers and users as well as other data. 63% use data from third parties, including
government data, data scraped from the Internet, and public benchmarking data. 51% of the firms
report using their own proprietary data. Most of this use is in combination with data from other
To protect their access to data, startup firms who use customer data retain secondary reuse
rights 52% of the time. To control the use of proprietary data between the firm and its customers,
83% of the firms use legal contracts that specify data uses. Additionally, firms use a variety of
technical means to protect and control data access, including de-identification, encryption,
Only 22% of firms report that the EU’s General Data Protection Regulation (GDPR) has
impacted sales and marketing to non-EU countries. That figure is 27% for firms headquartered in
Europe, excluding the United Kingdom. Given that the GDPR went into effect during our survey
period, these figures might change as firms have more experience with the regulation. The survey
respondents were asked to select all types of data protection used. As reported in Figure 22 , across
29
more intensively than startup firms without customers in the EU. This could reflect the impact of
There are some differences in data protection across firm size. Startup firms with ten or
fewer employees represent close to 40% of the total survey respondents. These small startup firms
are less likely to use legal contracts, de-identification, data encryption and password protected
access, as reported in Figure 8. However small startup firms are equally likely to use logged access
30
.6 Contracts
Encryption
De-indentification
% of respondents
.4
Passwords
Logged access
.2
API
0
Not 0EU 1
EU
The attached appendix includes data for the survey question on ethics. Bessen et al. (2023)
uses responses from the 2021 and 2022 to examine the relationship between AI ethics policies,
pro-ethics actions, and venture funding. This appendix also includes data from the 2023 survey.
5. Discussion
AI and Jobs
The evidence above in Figure 6 suggests that the benefits of AI are more about enhancing
human capabilities than about replacing humans. This suggests that the technology often makes
human workers more valuable rather than less, possibly raising labor demand for professionals,
managers and sales and marketing personnel. The evidence in Figure 10 suggests that this is
exactly what is happening. AI is associated with increasing labor demand for some occupations,
31
eliminating labor overall and more about shifting work from some occupations to others.
This shift is illustrated by an account given by one of the firms interviewed in the survey
pre-test. The firm’s product automated the retrieval of contact information for prospective sales
contacts from text files. This eliminated clerical work for the people who had manually searched
for contact information, but because it was able to generate more sales prospects quickly, it also
created jobs for sales and marketing personnel. This pattern of shifting work appears quite similar
to the pattern observed for computer use where professional and managerial occupations appeared
to grow while clerical and administrative occupations in the same industries shrank (Bessen 2016).
Entry Barriers
Commentators have identified several factors that might make it difficult for startups or for
their customers to compete using AI. Yet the survey shows that these factors do not pose entry
barriers for many startups in many markets. 80% of startups use customer data and 63% use data
available from third parties, including publicly available data. While data might pose a barrier to
entry in some markets, like search, where large amounts of diverse data are needed, there are
Hardware. Almost all startups provide their products over the cloud, and evidence shows
that small firms are able to access and utilize cloud computing effectively (Jin and McElheran
2017). Consequently, access to large computing hardware does not seem to be an issue for startups.
Most of the startups in our sample develop their own software for most applications,
suggesting that skilled developers and basic software tools are available to them. Anecdotally,
many software tools are available for free under Open Source licenses, such as TensorFlow.
32
network effects. In particular, if the fixed costs of developing a new AI application are large, then
big firms can develop their own, but medium and small sized firms may need to rely on commercial
application developers. These developers effectively spread the fixed costs over multiple
customers. The survey finds that, indeed, startups sell disproportionately to mid-sized firms,
suggesting that a) these mid-sized customers can compete using AI purchased from startups, and
b) the startups can provide a more affordable solution than if the customers developed their own
AI applications.
Data protection regulation has been described as a barrier that raises the cost of entry for
startups. We do find that data protections are higher for startups based in Europe, and large firms
appear more likely than small firms to have data protections in place. But on the other hand small
firms and non-European firms do have a range of data protections in place as well.
This evidence of the entry of startups into some markets is encouraging, however, it is also
possible that competition, entry, and innovation could all further be improved. Moreover, Figure
5 suggests that there are some industries that might have a substantial entry barriers. In particular,
in consumer packaged goods, transportation and logistics, and high tech and telecommunications,
large firms appear to invest disproportionately more than startups in AI development. The disparity
in transportation and logistics might be related to the apparently large investments needed to
credibly enter the market for self-driving cars or the competitive power of the largest firms in those
industries. The disparity in high tech might reflect the importance of network effects in markets
for online commerce and advertising or the disproportionate ownership of IP by the larger firms.
33
Frey and Osborne (2017) predict that 47% of US jobs will be at risk of automation in the
next decade or so. If so many jobs were really at risk, we would see evidence of substantial job
loss associated with emerging AI applications today. This survey makes clear that is not the case.
The commercial AI applications offered by startups today are more about enhancing human
capabilities than they are about replacing humans. Only half of the firms strongly agreed that labor
cost reduction was a benefit to customers. And survey respondents replied that their customers are
using AI to create jobs in certain occupations about as often as they use it to eliminate jobs. As a
group, their applications more often prompt customers to increase employment in managerial,
It is quite possible that AI might have different employment effects in the more distant
future. It is also possible that large firms might have a different impact on employment than the
startups in this survey. Nevertheless, the evidence tempers concern about mass unemployment or
occupation to others, meaning some people will lose jobs and other jobs will be created. However,
the new jobs may require major new skills, requiring workers to make major investments and
perhaps to endure difficult transitions. Thus, it is important to consider whether current labor
market policies are adequate to address these potential demands (Furman and Seamans, 2019). The
changing nature of demand for occupations and skills might well be highly disruptive even if there
Finally, the survey documents the broad scope of AI applications being offered by startups,
across a variety of industries. While the business for AI startups appears robust overall, there is
34
these barriers, their nature, and possible policy remedies is a topic for future research. The survey
also suggests that startups use a variety of data protection mechanisms, and that startups with EU
customers are even more likely to engage in data protection. However, it is too early to assess how
GDPR is affecting AI startups, and this is another topic for future research.
35
36
37
38
All data shown here comes from surveys 3-5 as these questions were not included in the
first tow rounds of the survey. These figures need to be viewed in color.
40
30
% of respondents
Minority Hiring
More Diverse Data
20 Expert Advice
Employee Trainings
10
0
Survey 1 Survey 2 Survey 3 Survey 4 Survey 5
39
.8
.6
% of respondents
Minority Hiring
More Diverse Data
.4 Expert Advice
Employee Trainings
.2
0
No Policy Ethical AI Policy
50
40
% of respondents
30
Turned Down Business
Dropped Data
Dismiss Employee
20
10
40
50
40
% of respondents
30
Turned Down Business
Dropped Data
Dismiss Employee
20
10
0
Survey 1 Survey 2 Survey 3 Survey 4 Survey 5
50
40
% of respondents
30
Turned Down Business
Dropped Data
Dismiss Employee
20
10
0
Smaller (<11) Larger
41
50
40
% of respondents
30
Turned Down Business
Dropped Data
Dismiss Employee
20
10
0
Americas Asia Europe
.25
.2
% of respondents
Human-in-loop
.15 Int. Audits
A/B Testing
Oversight Board
Ext. Audits
.1
.05
42