arrow_back
Introduction to the Course
Welcome to the Course
What You Will Learn
Overview of the Olympic Games Dataset
Tools We’ll Use: Apache Spark, Spark SQL, Apache Zeppelin
Olympic Game Deep Dive
Olympic Dataset Overview – Athletes, Events, and Countries
Types of Information in the Dataset (Performance, Demographics, Events)
Understanding the Olympic Dataset Structure (Attributes, Years, Metadata)
Setting Up the Environment
Requirements
(Hands On) Installing JAVA
Steps for Installing JAVA
(Hands On) Setting JAVA environments
Steps for Setting JAVA environments
(Hands On) Apache Zeppelin Installation Steps on Ubuntu machine
Steps for Installing Apache Zeppelin on Ubuntu machine
(Hands On) Installing Docker Desktop on Windows 10/11
Steps for Installing Docker on Windows
(Hands On) Running Apache Zeppelin on Docker (Windows)
Steps for Running Apache Zeppelin on Docker
(Hands On) Configure and Connect to Spark interpreter
Steps for Configure and Connect to Spark Interpreter
Download Resources
Download Resources
athlete_events.csv
noc_regions
Olympic Games Analytics(1)
Importing Zeppelin file in Zeppelin Environment
Zeppelin Basics
What is Zeppelin
Features & Benefits
Notebook UI Overview
Markdown and text formatting
Creating and Running Paragraphs
Hands on Creating and Running paragraphs
Visualization Options (Tables, Bar chart, Pie chart, etc.)
Hands On - Types of Default Chart in Zeppelin
Zeppelin with Apache Spark
Spark interpreter details
Working with RDDs and DataFrames
Spark SQL queries and caching
Visualizing Spark outputs
Job tracking and performance tuning basics
Data Exploration with Spark
Defining the Olympic Athlete Events DataFrame
Transforming Olympic Dataset Columns with Spark Casting
Creating Temporary Views
Loading and Creating SQL View for Olympic NOC Regions Data
Analyzing the Age Distribution of Olympic Gold Medalists
Analyzing Gold Medal Wins by Athletes Over 50
Analyzing Women’s Medal Achievements in the Summer Olympics
Top 5 Countries with the Most Olympic Gold Medals
Disciplines with the Greatest Number of Gold Medals
Variation of Male and Female Athletes Over Time
Age Trends of Male and Female Athletes in the Olympics Over Time
Tracking Weight Trends of Olympic Athletes Over Time
Tracking Height Trends of Male Athletes in the Olympics
Tracking Weight Trends of Male and Female Gymnasts Over Time
Evolution of Olympic Weightlifters’ Body Weight Over Time
Evolution of Height Trends in Olympic Weightlifting Athletes
Analyzing Olympic Gold Medal Distribution by Country
Analyzing Olympic Silver Medal Distribution by Country
Analyzing Olympic Bronze Medal Distribution by Country
Introduction
Introduction
Download Resources
Download Source Code for Olympic Games
Download Data file 1 for Olympic Games
Download Data file 2 for Olympic Games
Project Begins
File level details
(Old) Free Account creation in Databricks
(New) Free Account creation in Databricks
Importing Databricks Notebook
Overview and Project Objective
File Content Explaination
Launch Spark Cluster
Spark Notebook Basics
Loading data into Spark Dataframe
Distribution of the age of gold medalists
Gold Medals for Athletes Over 50 based on Sports
Women medals per edition(Summer Season) of the Games
Top 5 Gold Medal Countries
Disciplines with the greatest number of Gold Medals
Height vs Weight of Olympic Medalists
Variation of Male/Female Athletes over time
Variation of (Age/Weight/Height) for Male/Female Athletes over time
Weight over year for Male/Female Gymnasts
Weight/Height over years for Male/Female Lifters
Gold/Silver/Bronze Medals based on Countries
Publish Notebook to the Web
Preview - Olympic Games Analytics Project in Apache Spark for Beginner
Discuss (
0
)
navigate_before
Previous
Next
navigate_next