Date post: | 02-Jan-2016 |
Category: |
Documents |
Upload: | alicia-wilkins |
View: | 215 times |
Download: | 2 times |
A Data Scientist Toolkit
• A scripting language (Python, C#, Java, Perl)• A statistical computing language (R, SAS, SPSS)• Database languages/environments (MSSQL, Oracle, Postgres, sqlite)• Distributed computing environment (MapReduce, in many flavors)
Fundamentally we are flipping bits, but this isn’t software development.
Tools for data preparation
• A scripting language (Python, C#, Java)• A statistical computing language (R, SAS, SPSS)• Database languages/environments (MSSQL, Oracle, Postgres, sqlite)• Distributed computing environment (MapReduce)