Date post: | 14-Jul-2015 |
Category: |
Career |
Upload: | sujee-maniyam |
View: | 707 times |
Download: | 0 times |
+Who Invited This Guy? Hi, I am Sujee Maniyam J
n 15 years+ software development experience
n Consulting & Training in Big Data
n Author n “Hadoop illuminated” open source book n “HBase Design Patterns” coming soon
n Open Source contributor (including Hadoop) http://github.com/sujee
n Founder / Organizer of ‘Big Data Guru’ meetup http://www.meetup.com/BigDataGurus/
n http://sujee.net/
+Riding The Technology Wave
Late 1990s :
Web
2004-06 : Social (FB,
Twitter)
2007 : iPhone
2008 : Hadoop
2014 : Big Data in Real Time
2015 : ?
+Big Data Fad Or Real?
n It is very real !
+Big Data Job Trend
+… and $$$
+Audience For This Talk…
Developers
+This Doesn’t Work….
n Quit Job on Friday
n Attend Big Data Bootcamp over the Weekend
n Start as a Big Data Developer on Monday
n J sorry !
+Road Map For Launching Your Big Data Career
n (1) Learn
n (2) Network
n (3) Be Known
n (4) get hired
+
(c) Elephant Scale.com -- 2014
10
+Big Data / Hadoop Jobs
+Data Scientist
(c) Elephant Scale.com -- 2014
12
+Step (1) Learning
n Preferred Option : learn on your current job J n Take a training class
n Do a Project
n If That is not possible (dead end job / employer) n Learn on your own
+Learning 1 : Learn
n Books n So many, n Start with ‘Hadoop’ by Tom White n Also checkout our free, open source book : ‘hadoop illuminated’ J
n Tutorials (Cloudera / HortonWorks)
n Blogs (Major vendors)
n Webinars n Free, watch at your own time n Signup at Cloudera / HortonWorks / DataStax
n Meetups n Plenty to choose from n My meetup : Big Data Gurus in San Jose J
+Learning 2 : Practice
n Hands-on a must !!!!
n Get a Hadoop VM / Sandbox from a vendor n Easiest way to get Hadoop running
n Free !
n Every thing pre-installed and configured… ready to go!
n Use Hadoop version 2.x n Cloudera 5.x series
n HortonWorks 2.x series
+Learning 2 : Practice
n Most VMs come with some tutorials pre-installed.. Do them
n We maintain an open-source Hadoop labs on github n https://github.com/elephantscale/HI-labs
n 50+ labs on HDFS / MapReduce / Pig / Hive
n Where to get data? n http://hadoopilluminated.com/hadoop_illuminated/
Public_Bigdata_Sets.html
n Amazon hosts some big data sets
+Learning 2 : Practice ++ How to stand-out
n Need more than ‘hello world’
n Practice running Hadoop as a cluster n Use cloud providers like Amazon, Rackspace …
n Cost ?
+Challenges In Self Learning
n Takes up a lot of personal time!
n Can loose motivation n Pair-study with some-one
n Motivate / teach each other
+Very quickly….
+Insight Data Engineering Fellowship (I am mentor!)
n 6 weeks, full time, professional fellowship
n Completely free for students ! (yes, really)
n Mentored by top industry experts (Nathan Marz – creator of Storm, Databricks– creators of Spark ..etc)
n Learn cool Data Engineering tools
n Build an awesome project n Motif finder at large scale n Inspect / visualize BitCoin transactions in real-time
n Demo to top companies (Netflix, Yelp, Facebook, Square)
n Get hired !
Insight Data Engineering Fellows Program
InsightDataEngineering.com
+Big Data Skill Chasm
+Do I Need a Certification?
n Certifications are good n if you have no practical experience
n Consultant
n Cloudera / Horton Works
n You don’t need to take their courses
n Just take the certification exams n Reasonably easy with preparation
n Very affordable (few hundred $)
+Do I Need To Take A Pay Cut?
n Don’t hesitate to take a paycut for your first Big Data job
n You will gain valuable practical experience
n And the next job you will make lot more
+Step (2) Networking
n First get your OWN business card J
n Meetups n tons of meetup in this area
n My meetup : http://www.meetup.com/BigDataGurus/ J
n Conferences n Can be expensive (Strata $3k)
n Beg some one for a ‘visitor pass’ J
n Cheap conferences (HbaseConf $400, Hadoop Summit : $500)
n Money well spent… great connections!
+Networking : How To Find Decision Makers?
+Networking : Uber Networking Tips
n Read this book è
n Soft Networking n Become the connector, do intros
n People will REMEMBER you!
n Volunteering n Help to run a meetup / event
n You will get to know people you otherwise wouldn’t meet (Board of directors …etc)
+Step (3) -- Be Known
It is not what you know
It is who you know
It is WHO knows YOU !
+How to Be Known (aka How To Become an Expert!)
n Open Source n Huge boost to your resume
n Write quality blogs, articles n Lot of magazines wants contributors
n Write a Book n We did it on our own – without a publisher
n ‘Hadoop Illuminated’ : http://hadoopilluminated.com/
n Speak at meetups / conferences
+Old Resume
+Your New Resume
+Github activity log (employers check these !)
+Open Source Contributions
n Open Source involvement is a ‘hot skill’
n Just saying ‘I used TomCat’ isn’t enough J
n Open source tells me two things about you: n You are passionate about technology
(not just b/c this gig pays well)
n You dedicate your personal time à initiative
n Best option : Get Paid to work on open source J cloudera, linkedin, netflix….
n If not, you can still do meaningful contributions..
+How to Contribute To Open Source
n Step 1) Get a GitHub account (free)
n Step 2) Look for open source projects to contribute n Hadoop, cassandra, Spark
n Start fixing bugs (‘low hanging fruits’)
n Step 3) Starting your own project n Has to be unique (not another word count example using Spark!)
n Solve some thing you know about
n E.g. : Mark Kerzner – eDiscovery & Hadoop
+Personal Story : Contributing to HBase
n [code] Improve benchmarking tool n Started as a hobby project
n Submitted patch
n Lots help from senior developers
n https://issues.apache.org/jira/browse/HBASE-4440
n [documentation] improve patch submission process n You don’t need to be a code-ninja to contribute !!
n Documentation is badly needed in open source projects
n https://issues.apache.org/jira/browse/HBASE-5577
+JIRA : HBASE-4440
+Tips On Submissions
n Make it easy for committers n Don’t create extra work for them !
+Speaking at Meetups / Conferences
n Start with meetups
n Have a unique topic n “solving XXX using Spark” ..etc
n Having a popular open source project will help J J
n Submit proposals to many conferences n You may not get into Strata first J
n Pay attention to ‘local’ conferences n SVCodeCamp, Dataweek in SF …etc
n Big Data conference calendar http://elephantscale.com/bigdata_conferences
+Acing The Interview
n Interviewer : So , have you used Hadoop at your work? What kind of practical experience you have?
n If no, then usually interview ends here
n You : Ahem, I haven’t had a chance to use Hadoop at work… But let me tell you about the open source project I am working on… * walk to whiteboard, start drawing, explain …* * gets hired ! *
+Step (4) Get Hired
+Thanks !
Sujee Maniyam
http://elephantscale.com
Expert consulting & training in Big Data
© Elephant Scale, 2014
+Credits