arrow_back
Introduction to the Course
Welcome to the Course
What You Will Learn
Why Spark MLlib for Machine Learning Projects
Course Workflow & Project Overview
Tools We’ll Use Apache Spark, Spark ML, Apache Zeppelin
Overview of House Sale Dataset
Setting Up the Environment
Requirements
(Hands On) Installing JAVA
Steps for Installing JAVA
(Hands On) Setting JAVA environments
Steps for Setting JAVA environments
(Hands On) Apache Zeppelin Installation Steps on Ubuntu machine
Steps for Installing Apache Zeppelin on Ubuntu machine
(Hands On) Installing Docker Desktop on Windows 10/11
Steps for Installing Docker on Windows
(Hands On) Running Apache Zeppelin on Docker (Windows)
Steps for Running Apache Zeppelin on Docker
(Hands On) Configure and Connect to Spark interpreter
Steps for Configure and Connect to Spark Interpreter
Download Resources
Download Resources
Data for Project
Code
Importing Zeppelin file in Zeppelin Environment
Zeppelin Basics
What is Apache Zeppelin
Features & Benefits
Notebook UI Overview
Markdown and text formatting
Creating and running paragraphs
Hands on Creating and Running paragraphs
Visualization Options (Tables, Bar chart, Pie chart, etc.)
Hands On - Types of Default Chart in Zeppelin
Zeppelin with Apache Spark
Spark interpreter details
Working with RDDs and DataFrames
Spark SQL queries and caching
Visualizing Spark outputs
Job tracking and performance tuning basics
Machine Learning Project
Understanding Spark Imports for ML
Loading Source Data in Spark
Preparing Training Data
Understanding StringIndexer in Spark
Defining the Pipeline in Spark MLlib
Split the Data
Using VectorAssembler to Prepare Training Data
Train a Regression Model in Spark
Prepare the Testing Data
Testing the Regression Model in Spark
Evaluating the Regression Model in Spark
Evaluating Model Performance using RMSE
Introduction
Introduction
Download Resources
Download Data for the Project
Download Source Code for the Project
Project Begins
Introduction to Spark
(Old) Free Account creation in Databricks
(New) Free Account creation in Databricks
Provisioning a Spark Cluster
Introduction to Machine Learning
Basics about notebooks
Dataframes
Regression model
Explanation of few terms used in Model
File Content
Project Explaination
Preview - Spark Machine Learning Project (House Sale Price Prediction)
Discuss (
0
)
navigate_before
Previous
Next
navigate_next