The Kinds of Data Scientist
The Kinds of Data Scientist
The Kinds
by Yael Garten
of Data Scientist
November 06, 2018
Summary. In 2012, HBR dubbed data scientist “the sexiest job of the 21st
century”. It is also, arguably, the vaguest. To hire the right people for the right roles,
it’s important to distinguish between different types of data scientist. There are
plenty of different distinctions that... more
In 2012, HBR dubbed data scientist “the sexiest job of the 21st
century”. It is also, arguably, the vaguest. To hire the right people
for the right roles, it’s important to distinguish between different
types of data scientist. There are plenty of different distinctions
that one can draw, of course, and any attempt to group data
scientists into different buckets is by necessity an
oversimplification. Nonetheless, I find it helpful to distinguish
between the deliverables they create. One type of data scientist
creates output for humans to consume, in the form of product and
strategy recommendations. They are decision scientists. The
other creates output for machines to consume like models,
training data, and algorithms. They are modeling scientists.
How to organize
Much has already been written about how data science functions
should be organized. Perhaps the most important point is that if
data science is a strategic differentiator for the organization, the
head of the data science unit should ideally report into the CEO. If
this is not possible, they should at least report into someone who
understands data strategy and is willing to invest to give it what it
needs. Data science has its own skillset, workflow, tooling,
integration processes, culture; if it is critical to the organization it
is best to not bury it under a part of the organization with a
different culture.
The other big question is whether and how to embed data science
into the different business lines. There are three basic models:
centralized in one data science team, distributed throughout the
business lines, or a hybrid between the two where you have a
centralized team reporting into one head, but physically co-locate
and embed teams of data scientists into business units long term.
Unless your data operation includes several hundreds of
employees, it’s pretty clear at this point that the hybrid model is
most effective. (If you reach this scale, a fully distributed model
can make sense, but very few companies work this way.)
A final piece of advice for those hiring data scientists: Look for
people who are in love with solving problems, not with specific
solutions or methods, and for people who are incredibly
collaborative. No matter what kind of data scientist you are
hiring, to be successful they need to be able to work alongside a
vast variety of other job functions — from engineers to product
managers to marketers to executive teams. Finally, look for people
who have high integrity. As a society, we have a social
responsibility to use data for good, and with respect. Data
scientists hold the responsibility for data stewardship inside and
outside the organization in which they work.
YG
Yael Garten is the Director of Siri Data Science
and Engineering at Apple.
PODCAST
When ‘Best Practices' Backfire