Data science

From 80,000 Hours Research
Jump to: navigation, search

This wiki is in very early stages and is a work in progress.

This page contains all notes on data science as a career path over and above what we put in our career profile. Read the profile first, here.

Profile type

Exploratory

What is this career path?

Introduction to what data science is

Data scientists analyse data (often very large datasets - see big data) using advanced techniques drawn from the disciplines of computer science and statistics in order to deliver actionable insights for organisations. Data science jobs are related to data analyst jobs -- the main difference between these is that data analysts don't use advanced techniques from computer science and statistics in order to perform their analyses. Data scientists tend to hold PhDs in quantitative subjects.
Drew Conway's Venn Diagram of data science

For most problems you can use existing algorithms - 90-95% of the time.

One of our users said "a data scientist is a statistician who knows a big more software engineering than normal; or a software engineer who knows a bit more statistics than normal."

What are the people like?

This varies widely by company, city and team. We've been told that at highly skilled teams in San Francisco the people are highly talented, hold PhD's in quantitative from elite universities and the environment closely resembles academia, with very cerebral discussions.

Role impact

Direct impact potential

Earnings potential

http://www.oreilly.com/data/free/files/2014-data-science-salary-survey.pdf

http://www.payscale.com/research/US/Job=Data_Scientist_%2f_Engineer/Salary (compared to software engineer: http://www.payscale.com/research/US/Job=Software_Engineer/Salary)

Glassdoor - Average Base Salary: $104,476, number of job openings 3,449 (Software engineer - $98,074, Number of Job Openings: 104,828) (Mobile developer - $79,810, Number of job openings $79,810)

http://www.glassdoor.com/blog/jobs-america/

https://datajobs.com/big-data-salary

"There are in fact edge cases of data scientists getting paid over $250,000 in unique situations – e.g. hedge funds, or special cases of advanced algorithm development – but this well above the norm."[1]

Advocacy potential

Career capital

Common exits

The two main routes for progression within data science are:

  • Management roles - data scientists can become valuable product managers because they know the limits of data science and software engineering. Well-rounded individuals can become senior leaders.
  • Individual contributor roles - these are when you are an expert in a specific area

Some fields of academia are easier to go back into after working in industry than others. Computer Science is easier to re-enter. Astrophysics is very difficult to re-enter.

Culture

Exploration value

Personal fit

We've been told that the most important skill for a data scientist is the ability to think like a scientist and is the slowest and hardest to train. Second most important is ability in probability and statistics, and the third is programming.

"If you don't love data for its own sake, then you will find it hard to compete with such candidates. Burtch, however, says everyone should learn to love data, if only for the sake of their career. "Within 10 years, if you're not a data geek, you can forget about being in the C-suite," Burtch says." [2]

Entry requirements.

What does it take to progress?

The two most important qualities for progression suggested to us in an interview with a senior data scientist were:

  1. Being an excellent communicator
  2. Strategic ability - having insight into what will most efficiently get an organisation to meet its high-level goals

Barriers

Job satisfaction

The only downside of the job, Greenberg says, is the time spent "cleaning" data — pruning it to remove irrelevant findings. "That part's not that exciting and you spend a lot of time doing it," he says. (http://mashable.com/2014/12/25/data-scientist/)

Vacation policies tend to be good because of how in demand the skillset is.

Alternatives

Academia

We've been told that in data science what you work on has direct impact on something tangible. Some companies will have data scientists working at the edge of human knowledge and making original contributions, though at some companies and roles data scientists just apply current domain knowledge to achieve organisational goals.

Past experience

Take action

Learn more

Next steps

For getting a job the most important skills to gain are really strong knowledge of statistics and probability, a statistical programming language like R or Python, and the basics of SQL. R is more for high-level research, whereas Python has broader uses and is a larger investment as there is more to learn.

Making a portfolio of projects is important for demonstrating your ability. You should start doing your own projects and write them up on a blog to show off your visualisation and communication skills, and put your code in a GitHub repository.

Self learning resources

http://datasciencemasters.org/

https://github.com/zipfian/data-science-primer

https://www.udacity.com/course/data-analyst-nanodegree--nd002

https://www.coursera.org/specialization/jhudatascience/1?utm_medium=courseDescripTop

https://www.mysliderule.com/learning-paths/data-analysis

Bootcamps

Lists of bootcamps:

http://datascience.community/bootcamps

http://yet-another-data-blog.blogspot.co.uk/2014/04/data-science-bootcamp-landscape-full.html

Should you do a bootcamp? The placement statistics are impressive. One of our members said Insight was the "the best educational experience of my life".

Best resources

Linear Digressions - Udacity's podcast on machine learning and data science

Remaining issues

Research process

Sources of info

http://en.wikipedia.org/wiki/Data_science

BLS

http://www.bls.gov/careeroutlook/2013/fall/art01.pdf

http://www.bls.gov/ooh/computer-and-information-technology/computer-and-information-research-scientists.htm#tab-1

Udacity

Skills checklist for getting your first data analyst job (pdf)

4 types of data science jobs

Hiring Trends in Data and What They Mean for You

Quora

http://www.quora.com/What-is-the-Data-Science-topic-FAQ

http://www.quora.com/How-do-I-become-a-data-scientist

Journal articles

http://www.nature.com/naturejobs/science/articles/10.1038/nj7546-253a

Media

So you wanna be a data scientist? A guide to 2015's hottest profession (Mashable)

A Very Short History Of Data Science (Forbes)

Data Scientist: The Sexiest Job of the 21st Century (HBR)