Apache Spark

$430.00

Apache Spark with Scala – Hands On with Big Data!

Dive right in with 20+ hands-on examples of analyzing large data sets with Apache Spark, on your desktop or on Hadoop!

Bestseller: Rating: 4.5 out of 5 (11,453 ratings) 58,685 students

Category:

Description

What is Apache Spark?

Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, Spark ML for machine learning, GraphX for graph data processing, and Spark Streaming for live data stream processing. With Spark running on Apache Hadoop YARN, developers can create applications to derive actionable insights within a single, shared dataset in Hadoop.

Course Overview

This training course will teach you how to solve Big Data problems using Apache Spark framework. The training will cover a wide range of Big Data use cases such as ETL, DWH, data virtualization, streaming, graph data structure, machine learning. It will also demonstrate how Spark integrates with other well established Hadoop ecosystem products. You will learn the course curriculum through theory lectures, live demonstrations and lab exercises. This course will be taught in primarily Scala programming language.

Duration: 3 Days or 24 Hours

Mode of Delivery: Virtual Training

Prerequisites

Following are the pre-requisites for the course.

  • Basic Knowledge of big data use-cases
  • Basic knowedge of Hadoop, HDFS and Hive
  • Basic knowledge of databases, OLAP/OTLP use cases, SQL
  • Programming knowledhe in python

Reviews

There are no reviews yet.

Be the first to review “Apache Spark”

Your email address will not be published. Required fields are marked *

Title

Go to Top