+ All Categories
Home > Documents > Data Science Recommended Books

Data Science Recommended Books

Date post: 13-Apr-2018
Category:
Upload: anonymous-pkvcsg
View: 226 times
Download: 0 times
Share this document with a friend

of 23

Transcript
  • 7/25/2019 Data Science Recommended Books

    1/23

    Recommended BookIntroduction to Data Science

    Nina Zumel

    John Mount

  • 7/25/2019 Data Science Recommended Books

    2/23

    Lesson Goals

    Present an overview of a number of helpful and re

    books

    (You certainly do not need to buy all, or even any obut it is good to know they are available)

  • 7/25/2019 Data Science Recommended Books

    3/23

    R

  • 7/25/2019 Data Science Recommended Books

    4/23

    Hands-On Programming

    with RGarrett Grolemund

    A basic introduction to programming and R,using a project-oriented approach.

  • 7/25/2019 Data Science Recommended Books

    5/23

    R for EveryoneJared P. Lander

    Well-liked, very popular R book.

    Covers programming basics and basic Rstructures. Good survey of basic statistics and

    modeling in R.

  • 7/25/2019 Data Science Recommended Books

    6/23

    R in Action, Second EditionRobert I. Kabacoff

    Our go-to R reference.

    Covers R programming structures, datamanagement and statistical functions. Less machinelearning than Lander, but comprehensive coverage

    of classical statistical analysis as done in R.

  • 7/25/2019 Data Science Recommended Books

    7/23

    The Art of R ProgrammingNorman Matloff

    Discusses R from a Computer Science(programming) rather than Statistical

    perspective. Includes a chapter on Rs debuggingenvironment very useful!

  • 7/25/2019 Data Science Recommended Books

    8/23

    Advanced RHadley Wickham

    Good deep dive into the internals of R as aprogramming language. A must if you are

    interested in developing R packages.

    Also available online: http://adv-r.had.co.nz

    http://adv-r.had.co.nz/
  • 7/25/2019 Data Science Recommended Books

    9/23

    Reproducible Research

    with R and RStudioChristopher Gandrud

    Covers R markup and knitrwithin the RStudioenvironment.

    Good introduction to the tool ecosystemaround R, and gives good advice on setting up

    an R-based project data science project.

  • 7/25/2019 Data Science Recommended Books

    10/23

    R Graphics CookbookWinston Chang

    Useful collection of recipes for creating graphicsin R, primarily in ggplot2.

  • 7/25/2019 Data Science Recommended Books

    11/23

    SQL

  • 7/25/2019 Data Science Recommended Books

    12/23

    SQL for Smarties, Fourth

    EditionJoe Celko

    Our go-to SQL reference.

  • 7/25/2019 Data Science Recommended Books

    13/23

    Statistics

  • 7/25/2019 Data Science Recommended Books

    14/23

    Statistics, Fourth Edition

    David Freedman, Robert Pisani,

    Roger PurvesGood introduction to Statistics and Probability.

    Intuitive approach that relies on illustrations andexamples, rather than formulas.

  • 7/25/2019 Data Science Recommended Books

    15/23

    A Handbook of Statistical Analyses

    using R

    Torsten Hothorn,

    Brian S. Everitt

    Intermediate level (assumes familiarity with Rand basic statistics).

    Covers several techniques in statistical dataanalysis, including linear and logistic regression,cluster analysis, and multidimensional scaling.

  • 7/25/2019 Data Science Recommended Books

    16/23

    Theory

  • 7/25/2019 Data Science Recommended Books

    17/23

    The Elements of Statistical

    Learning, 2nd edition

    Trevor Hastie, Robert Tibshirani,

    Jerome Friedman

    Our go-to machine learning reference.

    Covers a wide variety of statistical machinelearning algorithms, including their derivation,

    strengths and weaknesses.

  • 7/25/2019 Data Science Recommended Books

    18/23

    Practice

  • 7/25/2019 Data Science Recommended Books

    19/23

    An Introduction to Statistical

    Learning

    Gareth James, Daniela Witten,

    Trevor Hastie, Robert Tibshirani

    Companion text to The Elements of StatisticalLearning.

    More application oriented, with example codein R.

  • 7/25/2019 Data Science Recommended Books

    20/23

    Applied Predictive

    ModelingMax Kuhn, Kjell Johnson

    Practice-oriented approach to predictivemodeling.

    Good combination of theory and practice. Alsocovers data treatment, model evaluation, and

    feature selection. Example code in R.

  • 7/25/2019 Data Science Recommended Books

    21/23

    Practical Data Science with R

    Nina Zumel,John Mount

    Our book!A practitioner-oriented view of data science.

    Covers all aspects of a data science project: projectmanagement, data treatment, modeling and modelevaluation, deployment and reporting. Extensive

    example code in R. M A N N I N G

    Nina Zumel

    John Mount

    FOREWORD BY Jim Porzak

  • 7/25/2019 Data Science Recommended Books

    22/23

    And more

    More good books (our list is not comprehensive)

    Some web resources:

    http://www.r-bloggers.com

    http://www.statmethods.net (Quick-R)

    http://cran.r-project.org

    http://cran.r-project.org/http://www.statmethods.net/http://www.r-bloggers.com/
  • 7/25/2019 Data Science Recommended Books

    23/23

    What you should take aw

    There are a lotof available resources

    Research what you actually need and you may need t

    one or two extra books.


Recommended