Skip to main content
Talks & Workshops
Schedule: February 16, 2018
February 17, 2018
February 16-17, 2018
Event Speaker Registration
Chaos Tools AJAX Demo
Introduction to Distributed Computing with Spark and Python
This workshop will cover :-
Brief introduction to Spark concepts and its working.
Using Spark with Python.
Read from different data sources/formats (CSV, JSON, JDBC)
Applying transformations using UDFs(User Defined Functions)
Saving dataframes to a filesystem.
Industrial use case examples.
Familiarity with python is good to have but not required. A computer running OS with updated browser.
Request new password
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
5 + 2 =
Solve this simple math problem and enter the result. E.g. for 1+3, enter 4.