+ All Categories
Home > Technology > The History and Use of R

The History and Use of R

Date post: 01-Nov-2014
Category:
Upload: analyticsweek
View: 382 times
Download: 2 times
Share this document with a friend
Description:
A presentation on the history, design, and use of R. The talk will focus on companies that use and support R, use cases, where it is going, competitors, advantages and disadvantages, and resources to learn more about R. Speaker Bio Joseph Kambourakis has been the Lead Data Science Instructor at EMC for over two years. He has taught in eight countries and been interviewed by Japanese and Saudi Arabian media about his expertise in Data Science. He holds a Bachelors in Electrical and Computer Engineering from Worcester Polytechnic Institute and an MBA from Bentley University with a concentration in Business Analytics.
Popular Tags:
79
The History and Use of R Joseph Kambourakis
Transcript
Page 1: The History and Use of R

The History and Use of R

Joseph Kambourakis

Page 2: The History and Use of R

Ground Rules

• Interrupt me

• These are all my opinions and not of EMC or Big Data Analytics, Discovery & Visualization Meetup

• Slides will be available

Page 3: The History and Use of R
Page 4: The History and Use of R
Page 5: The History and Use of R

Joseph Kambourakis @mouthorjoe

Page 6: The History and Use of R
Page 7: The History and Use of R

Taught Around the World

Page 8: The History and Use of R

WPI

Page 9: The History and Use of R

Bentley University

Page 11: The History and Use of R

Sam Woolford & Dominique Haughton

Page 12: The History and Use of R

First Got Exposed to R

Page 13: The History and Use of R

What is

R is a free software environment for statistical computing and graphics

A language plus a run-time environment with graphics, a debugger, access to certain system functions, and the ability to run programs stored in script files

Page 14: The History and Use of R

What is R’s Hashtag?

Page 15: The History and Use of R

Robert Gentleman & Ross Ihaka

• R: A Language for Data Analysis and Graphics

Page 16: The History and Use of R

Starts with S

1976 1988 1991

Page 17: The History and Use of R

Scheme

• Lexical scoping

Page 18: The History and Use of R

Lexical scoping

• Searches through environments

– First global

• Global is your workspace

– Second namespace of packages

• More on packages later

Page 20: The History and Use of R

Under the Hood

Page 21: The History and Use of R

Open Source

• GNU General Public License

• Freedom 0: The freedom to run the program for any purpose.

• Freedom 1: The freedom to study how the program works, and change it to make it do what you wish.

• Freedom 2: The freedom to redistribute copies so you can help your neighbor.

• Freedom 3: The freedom to improve the program, and release your improvements (and modified versions in general) to the public, so that the whole community benefits.

• source: GNU.org

Page 22: The History and Use of R

R Project

• The R Foundation is a not for profit organization working in the public interest. It has been founded by the members of the R Development Core Team in order to – Provide support for the R project and other innovations in

statistical computing. We believe that R has become a mature and valuable tool and we would like to ensure its continued development and the development of future innovations in software for statistical and computational research.

– Provide a reference point for individuals, institutions or commercial enterprises that want to support or interact with the R development community.

– Hold and administer the copyright of R software and documentation.

• source: R Project

Page 23: The History and Use of R

Contributors

Page 24: The History and Use of R

How it Works: Design

Page 25: The History and Use of R

How it Works: Design

• Functional

– mean()

– plot()

Page 26: The History and Use of R

How it Works: Design • Interpreted language

Page 27: The History and Use of R

How it Works: Install

• Hosted on Comprehensive R Archive Network (CRAN)

• 54 megabytes

Page 28: The History and Use of R

http://cran.rstudio.com/

• Download and Install R

• Precompiled binary distributions of the base system and contributed packages, Windows and Mac users most likely want one of these versions of R:

• Download R for Linux

• Download R for (Mac) OS X

• Download R for Windows

• R is part of many Linux distributions, you should check with your Linux package management system in addition to the link above.

Page 29: The History and Use of R

How it works: Command Line

Page 30: The History and Use of R

How it Works: Packages

• Base

– mean()

• Utils

– read.csv()

• Stats

– lm()

– sd()

Page 31: The History and Use of R

Packages

• Mostly hosted on CRAN

• Many others hosted elsewhere

– Github

– RStudio

– Bioconductor

– RevolutionR

Page 32: The History and Use of R

Packages

• source: http://r4stats.com/articles/popularity/

Page 33: The History and Use of R

Top 10 Most Popular Packages

• source: Revolution Analytics Blog

Page 34: The History and Use of R
Page 35: The History and Use of R

Data Frame

Page 36: The History and Use of R

Capabilities

• ANALYTICS – Basic Mathematics – Basic Statistics – Probability Distributions – Machine Learning – Optimization and Mathematical Programming – Signal Processing – Simulation and Random Number Generation – Statistical Modeling – Statistical Tests

• GRAPHICS AND VISUALIZATION – Static Graphics – Dynamic Graphics – Devices and Formats

Page 37: The History and Use of R

Model & Plot

Page 38: The History and Use of R

GUI:RStudio

Page 39: The History and Use of R
Page 40: The History and Use of R
Page 41: The History and Use of R

How Does it Compare?

Page 42: The History and Use of R

How Does it Compare?

Page 43: The History and Use of R

How Does it Compare? R SAS SPSS Professional MATLAB

Cost Free! Very VERY High High - $9,975 High

Documentation Yes Very comprehensive

OK Some examples

Training Course NA Yes Yes Yes

User interface Low Medium Best Medium

Output Separate commands

Automatically produce diagnosis graph and forecast

Totally automated Some automated via GUI, some specific command

Models* Does not STL moving average

Does not have ARCH/GARCH + and other moving average models

Does not have MA & decomposition models

Certification Program

Yes Yes Yes

Page 44: The History and Use of R

Commercial Support

Page 45: The History and Use of R

Commercial Support

Page 46: The History and Use of R

• Version 3.1.1

7/10/2014

• source

Where it’s Now?

Page 47: The History and Use of R

Where it’s Going

Source: Revolution Analytics Blog

Page 48: The History and Use of R

Where it’s Going: Extensions and Interactions

• Rcpp

– Transfer from R to C++, and from C++ to R

• RLLVM

– Creates code

• H2O

– Big data package

Page 49: The History and Use of R

The best thing about R is that it was developed by statisticians. The worst thing about R is that...it was developed by statisticians.

Bo Cowgill

Page 50: The History and Use of R

Good: Open Source

• So many contributors

• Free!

• Community

Page 51: The History and Use of R

Bad: Open Source

• No customer support

• Features

Page 52: The History and Use of R

Good: Frequent Updates

• Always new packages

• New updates and bug fixes

Page 53: The History and Use of R

Bad: Frequent Updates

• Package updates

• R updates

Page 54: The History and Use of R

Bad: Documentation

Page 55: The History and Use of R

Bad:Speed

• 40 year old code

Page 56: The History and Use of R

Bad:Speed

• Interpreted

Page 57: The History and Use of R

Bad:Speed

• Single threaded

Page 58: The History and Use of R

Bad: Memory

• All stored in memory

Page 59: The History and Use of R

Soccer Example

@11tegen11

Page 60: The History and Use of R

Congressional Approval Rating

@adamramey

Page 62: The History and Use of R
Page 63: The History and Use of R

Use Cases 4

Page 64: The History and Use of R

How to Learn:

Page 65: The History and Use of R

How to Learn:

Page 66: The History and Use of R

How to Learn: RStudio How to Learn:

Page 67: The History and Use of R

How to Learn: Data Camp

How to Learn:

Page 68: The History and Use of R

How to Learn: Springer Series

Page 69: The History and Use of R

How to Learn: Art of R

Programming

Page 70: The History and Use of R

How to Learn: Boot Camp Boston Predictive Analytics Meetup

Page 71: The History and Use of R

How to Learn: Online Videos

Page 72: The History and Use of R

Web Resources:

Page 73: The History and Use of R

Web Resources:

Page 74: The History and Use of R

Web Resources:

Page 75: The History and Use of R

UseR Groups & Conferences

Page 76: The History and Use of R

Closing Thoughts

Page 77: The History and Use of R

Thank You

Page 78: The History and Use of R

Thank You

Page 79: The History and Use of R

Questions

?


Recommended