Okebiz Video Search



Title:From Idea to Product: Customer Profiling in Apache Zeppelin with PySpark
Duration:36:06
Viewed:4,261
Published:13-10-2018
Source:Youtube

Sarah Sprich 2018.za.pycon.org/talks/61-from-idea-to-product-cu… Zeppelin is a web based notebook which enables interactive data analytics on big data. Data can easily be ingested from a variety of databases and analysis can be performed in Python and Pyspark. Visualisations can be built and displayed together with the code, using Zeppelin’s built in tool Helium, or Python specific tools such as Matplotlib and Bokeh. The web based interface facilitates easy sharing of results, and collaboration on projects. Developing in Zeppelin has changed the way we approach model development. We are able to take a project from an idea to a product all within one tool using the following process: ‹ol› ‹li›Come up with an idea. Write some notes in a Zeppelin notebook describing how we would like the idea implemented.‹/li› ‹li›Slowly start fleshing out the idea, with real code, until the solution is built. This is great to demo, as the code is in bite size chunks, and visualisations can be added directly in.‹/li› ‹li›Take the code into production. It can be scheduled it to run directly in Zeppelin with a cron scheduler, or from a tool such as Nifi. Interactive visualisations can be embedded in a web-based frontend.‹/li› ‹/ol› This talk is aimed at data scientists, particularly those working with big data. We will demonstrate how we have built a catalogue of subscriber attributes based on customer mobile usage and purchase behavior using Zeppelin and Pyspark. These attributes can be used to profile subscribers, and are the starting point for indivisualised customer engagement. Anyone who attends this talk will get an introduction to Zeppelin and Pyspark and an overview of what can be achieved with these tools. pyconza2018 python

SHARE TO YOUR FRIENDS


Download Server 1


DOWNLOAD MP4

Download Server 2


DOWNLOAD MP4

Alternative Download :



SPONSORED
Loading...
RELATED VIDEOS
Getting The Best Performance With PySpark Getting The Best Performance With PySpark
29:22 | 11,581
Cambridge Spark Webinar: Getting Started with Spark and Zeppelin in Python Cambridge Spark Webinar: Getting Started with S...
51:06 | 13,414
Getting Started with Pieces for Developers | getting started with code snippets Getting Started with Pieces for Developers | ge...
03:09 | 38
At your service: what we've learnt doing data science at the City of Cape Town At your service: what we've learnt doing data s...
09:20 | 131
Profiling & Optimizing in Go / Brad Fitzpatrick Profiling & Optimizing in Go / Brad Fitzpatrick
59:24 | 18,025
Apache Spark Tutorial | What Is Apache Spark? | Introduction To Apache Spark | Simplilearn Apache Spark Tutorial | What Is Apache Spark? |...
38:20 | 200,992
Lightning Talks Lightning Talks
37:37 | 28
Laura Lorenz | How I learned to time travel, or, data pipelining and scheduling with Airflow Laura Lorenz | How I learned to time travel, or...
42:29 | 12,927

shopee ads

coinpayu