Professional Course

Machine Learning Foundation: Working With Spark and TensorFlow

5 days
5 days
This provider usually responds within 48 hours 👍

Course description

Machine Learning Foundation: Working With Spark and TensorFlow

Apache Spark, a significant component in the Hadoop Ecosystem, is a cluster computing engine used in Big Data. Building on top of the Hadoop YARN and HDFS ecosystem, offers order-of-magnitude faster processing for many in-memory computing tasks compared to Map/Reduce. It can be programmed in Java, Scala, Python, and R - the favorite languages of Data Scientists - along with SQL-based front ends.

The first part of the course teaches performing Machine Learning at Scale using the popular Apache Spark framework. This course is intended for data scientists and software engineers, and assumes attendees have little or no previous experience with Machine Learning. This course explores popular machine learning algorithms from the ground up. Students will explore Apache Spark essentials, core machine learning concepts, regressions, classifications, clustering and more.

The abundance of data and affordable cloud scale has led to an explosion of interest in Deep Learning. Google has released an excellent library called TensorFlow to open-source, allowing state-of-the-art machine learning done at scale, complete with GPU-based acceleration. Students will explore these skills in an active hands-on manner. The second part of the course introduces students to Deep Learning concepts and how TensorFlow implements them.

Throughout the program, working in a hands-on learning environment guided by our expert instructor, students will

  • Learn popular machine learning algorithms, their applicability, and limitations
  • Practice the application of these methods in the Spark machine learning environment
  • Learn practical use cases and limitations of algorithms
  • Will explore not just the related APIs, but will also learn the theory behind them
  • Work with real world datasets from Uber, Netflix, Walmart, Prosper, etc.

Do you work at this company and want to update this page?

Is there out-of-date information about your company or courses published here? Fill out this form to get in touch with us.

Who should attend?

This is an intermediate level course, geared for Data Scientists, Data Analysts and Developers new to Machine Learning, Spark and TensorFlow.

Pre-Requisites: Students should have attended or have incoming skills equivalent to those in this course:

  • Strong basic Python Skills. Attendees without Python background may view labs as follow along exercises or team with others to complete them.
  • Good foundational mathematics in Linear Algebra and Probability
  • Basic Linux skills, including familiarity with command-line options such as ls, cd, cp, and su

Training content

Part 1: Introduction to Machine Learning

Machine Learning (ML) Overview

  • Machine Learning landscape
  • Machine Learning applications
  • Understanding ML algorithms & models

ML in Python and Spark

  • Spark ML Overview
  • Introduction to Jupyter notebooks
  • Lab: Working with Jupyter + Python + Spark
  • Lab: Spark ML utilities

Machine Learning Concepts

  • Statistics Primer
  • Covariance, Correlation, Covariance Matrix
  • Errors, Residuals
  • Overfitting / Underfitting
  • Cross-validation, bootstrapping
  • Confusion Matrix
  • ROC curve, Area Under Curve (AUC)
  • Lab: Basic stats

Feature Engineering (FE)

  • Preparing data for ML
  • Extracting features, enhancing data
  • Data cleanup
  • Visualizing Data
  • Lab: data cleanup
  • Lab: visualizing data

Linear regression

  • Simple Linear Regression
  • Multiple Linear Regression
  • Running LR
  • Evaluating LR model performance
  • Lab
  • Use case: House price estimates

Logistic Regression

  • Understanding Logistic Regression
  • Calculating Logistic Regression
  • Evaluating model performance
  • Lab: Use case: credit card application, college admissions

Classification: SVM (Supervised Vector Machines)

  • SVM concepts and theory
  • SVM with kernel
  • Lab: Use case: Customer churn data

Classification: Decision Trees & Random Forests

  • Theory behind trees
  • Classification and Regression Trees (CART)
  • Random Forest concepts
  • Labs: Use case: predicting loan defaults, estimating election contributions

Classification: Naive Bayes

  • Theory
  • Lab
  • Use case: spam filtering

Clustering (K-Means)

  • Theory behind K-Means
  • Running K-Means algorithm
  • Estimating the performance
  • Lab: Use case: grouping cars data, grouping shopping data

Principal Component Analysis (PCA)

  • Understanding PCA concepts
  • PCA applications
  • Running a PCA algorithm
  • Evaluating results
  • Lab: Use case: analyzing retail shopping data

Recommendations (Collaborative filtering)

  • Recommender systems overview
  • Collaborative Filtering concepts
  • Lab: Use case: movie recommendations, music recommendations


  • Best practices for scaling and optimizing Apache Spark
  • Memory caching
  • Testing and validation

Part 2: Introduction to Deep Learning with TensorFlow

Machine Learning Quick Review

  • Understanding Machine Learning
  • Supervised versus Unsupervised Learning
  • Regression
  • Classification
  • Clustering

Introducing Tensorflow

  • Tensorflow intro
  • Tensorflow Features
  • Tensorflow Versions
  • GPU and TPU scalability
  • Lab: Setting up and Running Tensorflow

The Tensor: The Basic Unit of Tensorflow

  • Introducing Tensors
  • Tensorflow Execution Model
  • Lab: Learning about Tensors

Single Layer Linear Perceptron Classifier With TensorFlow

  • Introducing Perceptrons
  • Linear Separability and Xor Problem
  • Activation Functions
  • Softmax output
  • Backpropagation, loss functions, and Gradient Descent
  • Lab: Single-Layer Perceptron in Tensorflow

Hidden Layers: Intro to Deep Learning

  • Hidden Layers as a solution to XOR problem
  • Distributed Training with Tensorflow
  • Vanishing Gradient Problem and ReLU
  • Loss Functions
  • Lab: Feedforward Neural Network Classifier in Tensorflow

High level Tensorflow: tf.learn

  • Using high level tensorflow
  • Developing a model with tf.learn
  • Lab: Developing a tf.learn model

Convolutional Neural Networks in Tensorflow

  • Introducing CNNs
  • CNNs in Tensorflow
  • Lab : CNN apps

Introducing Keras

  • What is Keras?
  • Using Keras with a Tensorflow Backend
  • Lab: Example with a Keras

Recurrent Neural Networks in Tensorflow

  • Introducing RNNs
  • RNNs in Tensorflow
  • Lab: RNN

Long Short Term Memory (LSTM) in Tensorflow

  • Introducing RNNs
  • RNNs in Tensorflow
  • Lab: RNN


  • Summarize features and advantages of Tensorflow
  • Summarize Deep Learning and How Tensorflow can help Next steps


  • Price: $2,695.00
  • Discounted Price: $1,751.75

Why choose Trivera Technologies LLC?

Over 25 years of technology training expertise.

Robust portfolio of over 1,000 leading edge technology courses.

Guaranteed to run courses and flexible learning options.

Contact this provider

Contact course provider

Before we redirect you to this supplier's website, do you mind filling out this form so that we can stay in touch? You can unsubscribe at any time.
If you want us to recommend other suitable courses, please fill out all fields below and check the box beside "Please recommend similar options"
Country *

reCAPTCHA logo This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Trivera Technologies LLC
7862 West Irlo Bronson Highway
STE 626
Kissimmee FL 34747

Trivera Technologies

Trivera Technologies is a IT education services & courseware firm that offers a range of wide professional technical education services including: end to end IT training development and delivery, skills-based mentoring programs,new hire training and re-skilling services, courseware licensing and...

Read more and show all training delivered by this supplier