Apache Druid for Data Engineers (Hands-On)

Learn everything about Apache Druid a modern real-time analytics database.

Language: English

Instructors: Bigdata Engineer

$120 90% OFF



Why this course?


Druid is a high-performance, real-time analytics database that delivers sub-second queries on streaming and batch data at scale and under load.

Apache Druid is a real-time analytics database designed for fast slice-and-dice analytics ("OLAP" queries) on large data sets. Most often, Druid powers use cases where real-time ingestion, fast query performance, and high uptime are important.

Druid is commonly used as the database backend for GUIs of analytical applications, or for highly-concurrent APIs that need fast aggregations. Druid works best with event-oriented data.


One of the most valuable technology skills is the ability to Real-time analytics databases handle analytics on large amounts of data by optimizing resources to enable compute-heavy workloads, and this course is specifically designed to bring you up to speed on one of the best technologies for this task, Apache Duid! The top technology companies like Google, Facebook, Netflix, Airbnb, Amazon, NASA, and more are all using Apache Druid!


Apache Druid Essentials: Unleashing Real-time Analytics and Scalable Data Exploration

Unlock the potential of real-time analytics and scalable data exploration with our comprehensive Apache Druid Essentials course. In this dynamic program, participants will delve into the world of Apache Druid, an open-source, high-performance analytics database designed for fast query response and seamless scalability.


Key Learning Objectives:

  • Introduction to Course

  • Real-time Analytics Databases

  • What is Apache Druid?

  • Key Features of Druid

  • Technology

  • Use cases

  • When to use Druid

  • When not to use Druid

  • List of Company using Apache Druid

  • Installation of Apache Druid

  • Start up Druid services

  • Open the web console

  • Load data

  • Query data

  • Overview of the Druid Web Console

  • Architecture of Druid

  • Druid Servers

  • External Dependencies

  • Storage Design

  • Datasources and Segments

  • Segment Identifiers

  • Segments

  • Introduction to Segments

  • Segment File Structure

  • Data Loading in Druid

  • Load Data from Local Files

  • Load Data from URI

  • Load Data from Kafka (Prerequisite Introduction to Kafka)

  • Installing Single Node Kafka Cluster

  • Change the following to avoid Zookeeper Issue conflict

  • Load Data from Kafka

  • Query Data Explain Plan

  • Aggregate data with rollup

  • Frequently Asked Questions


Course Curriculum

How to Use

After successful purchase, this item would be added to your courses.You can access your courses in the following ways :

  • From the computer, you can access your courses after successful login
  • For other devices, you can access your library using this web app through browser of your device.


Launch your GraphyLaunch your Graphy
100K+ creators trust Graphy to teach online
Learn Bigdata, Spark & Machine Learning | SmartDataCamp 2024 Privacy policy Terms of use Contact us Refund policy