Date Spark Job Server Evan Chan and Kelvin Chu Overview Why We Needed a Job Server • Created at Ooyala in 2013 • Our vision for Spark is as a multi-team big data service…
SPARK + FLASHBLADE DELIVERING INSIGHTS FROM 5PB OF PRODUCT LOGS AT PURE STORAGE Brian Gold Pure Storage © 2017 PURE STORAGE INC. 2 ALL-FLASH STORAGE FOR DATA-INTENSIVE COMPUTING…
Spark Autotuning Lawrence Spracklen Alpine Data Overview • Motivation • Spark Autotuning • Future enhancements Motivation We use Spark • End-2-end support…
Streaming Outlier Analysis for Fun and Scalability Casey Stella 2016 Casey Stella (Hortonworks) Streaming Outlier Analysis for Fun and Scalability 2016 Table of Contents…
Build Your Next Apache Spark Job in .NET Using Mobius Build Your Next Apache Spark Job in .NET Using Mobius Kaarthik Sivashanmugam @kaarthikss 1 Mobius C# API for building…
SPARK SUMMIT EUROPE 2016 SCALING FACTORIZATION MACHINES ON APACHE SPARK WITH PARAMETER SERVERS Nick Pentreath Principal Engineer, IBM About ⢠About me â @MLnick â Principal…
Project Tungsten Phase II Joining a Billion Rows per Second on a Laptop Apache Spark’s Performance Project Tungsten and Beyond Sameer Agarwal Spark Summit| Brussels| Oct…
Data-Aware Spark Zoltán Zvara [email protected] This project has received funding from the European Union’s Horizon 2020 research and innovation program under grant…
SPARK SUMMIT EUROPE 2016 SPARK AND COUCHBASE AUGMENTING THE OPERATIONAL DATABASE WITH SPARK Michael Nitschinger Couchbase WHY SPARK AND COUCHBASE Overview & Use-Cases…
Sparkling Water 2.0: The next generation of machine learning on Apache Spark Jakub Háva [email protected] Spark Summit Europe, Brussels October 26, 2016 mailto:[email protected] Who…
Automatic Checkpointing in Spark Nimbus Goehausen Spark Platform Engineer [email protected] Copyright 2016 Bloomberg L.P. All rights reserved. A Data Pipeline source…
SPARK SUMMIT EUROPE 2016 Mastering Spark Unit Testing Theodore Malaska Blizzard Entertainment, Group Technical Architect About Me ▪ Ted Malaska - Architect at Blizzard…
SPARK SUMMIT EUROPE 2016 Prediction as a service with Ensemble Model trained in SparkML and Python ScikitLearn on 1Bn observed flight prices daily Josef Habdank Lead Data…
SPARK SUMMIT EUROPE 2016 Distributed Time Series Analysis Framework For Spark Larisa Sawyer Two Sigma Larisa Sawyer November 1, 2016 2 $0.0 $500.0 $1,000.0 $1,500.0 $2,000.0…
Apache Spark 2.0 Performance Improvements Investigated With Flame Graphs Luca Canali CERN, Geneva (CH) Speaker Intro ⢠Database engineer and team lead at CERN IT â Hadoop…
SPARK SUMMIT EUROPE 2016 On Premise Spark-as-a-Service on YARN Jim Dowling Associate Prof @ KTH, Stockholm Senior Researcher, SICS Swedish ICT CEO, Logical Clocks AB Twitter:…
1 © Hortonworks Inc. 2011 – 2016. All Rights Reserved Apache Spark and Object Stores —What you need to know Steve Loughran [email protected] @steveloughran October…
SPARK SUMMIT EUROPE 2016 Sparklint a Tool for Identifying and Tuning Inefficient Spark Jobs Across Your Cluster Simon Whitear Principal Engineer @ Groupon Why Sparklint?…
SparkOscope: Enabling Apache Spark Optimization Through Cross-Stack Monitoring and Visualization Yiannis Gkoufas IBM Research Dublin,Ireland High Performance Systems whoami…