Problem Solving Recipes Learned from Supporting Spark Justin Pihony & Stavros Kontopoulos Lightbend 1. OOM Table of Contents 1. OOM1. OOM 2. NoSuchMethod 5. Strategizing…
Recipes for running Spark Streaming in Production Online Learning with Structured Streaming Ram Sriharsha, Vlad Feinberg @halfabrane Spark Summit, Brussels 27 October 2016…
1. Jorge López-Malla Matute INDEX [email protected] Abel Rincón Matarranz [email protected] Kerberos ● Introduction ● Key concepts ● Workflow ● Impersonation…
Spark Your Legacy: How to distribute your 8-year old monolith Moran Tavori, Tzach Zohar // Kenshoo // June 2016 Who’s this talk for? Who are we? Tzach Zohar, System Architect…
PowerPoint Presentation Using Spark @ Conviva Spark Summit 2013 Summary Who are we? What is the problem we needed to solve? How was Spark essential to the solution? What…
Date Spark Job Server Evan Chan and Kelvin Chu Overview Why We Needed a Job Server • Created at Ooyala in 2013 • Our vision for Spark is as a multi-team big data service…
Spark Summit (2014-06-30) http://spark-summit.org/2014/talk/building-a-data-processing-system-for-real-time-auctions What do you do when you need to update your models sooner…
MOBIUS: C# BINDING FOR SPARK MOBIUS: C# BINDING FOR SPARK Kaarthik Sivashanmugam Microsoft @kaarthikss 1 Quick Background Business Scenario: Next-gen near real-time processing…
1. Spark Community Update Matei Zaharia & Patrick Wendell June 15th, 2015 2. A Great Year for Spark Most active open source project in data processing New language: R…
Spark Streaming for Realtime Auctions @russellcardullo Sharethrough Agenda ⢠Sharethrough? ⢠Streaming use cases ⢠How we use Spark ⢠Next steps Sharethrough…
1. @louisdorard #dsb15 2. –Waqar Hasan, Apigee Insights “Predictive is the ‘killer app’ for big data.” 3. –Mike Gualtieri, Principal Analyst at Forrester “Predictive…
1. StratioistheonlyBig Data platformableto combine, in onequery, storeddata withstreamingdata in real-time (in lessthan30 seconds).Weare polyglotsas well: Weuse SparkovertwonoSQLdatabases,…
Next-Generation Genomics Analysis Using Spark and ADAM 1 Timothy Danford AMPLab, Tamr Inc. Bioinformatics today is workflows and files Sequencing: clustered Data size: terabytes-to-petabytes…
New Directions for Spark in 2015 Matei Zaharia March 18, 2015 2014: an Amazing Year for Spark Total contributors: 150 => 500 Lines of code: 190K => 370K 500+ active…
SPARK + FLASHBLADE DELIVERING INSIGHTS FROM 5PB OF PRODUCT LOGS AT PURE STORAGE Brian Gold Pure Storage © 2017 PURE STORAGE INC. 2 ALL-FLASH STORAGE FOR DATA-INTENSIVE COMPUTING…
CONNECTING PYTHON TO THE SPARK ECOSYSTEM Daniel Rodriguez Software developer/data scientist Continuum Analytics Twitter: @danielfrg Github: github.com/danielfrg Spark Summit…
Spark Autotuning Lawrence Spracklen Alpine Data Overview • Motivation • Spark Autotuning • Future enhancements Motivation We use Spark • End-2-end support…