课程目录:Spark Streaming with Python and Kafka培训
4401 人关注
(78637/99817)
课程大纲:

         Spark Streaming with Python and Kafka培训

 

 

 

Introduction

Overview of Spark Streaming Features and Architecture

Supported data sources
Core APIs
Preparing the Environment

Dependencies
Spark and streaming context
Connecting to Kafka
Processing Messages

Parsing inbound messages as JSON
ETL processes
Starting the streaming context
Performing a Windowed Stream Processing

Slide interval
Checkpoint delivery configuration
Launching the environment
Prototyping the Processing Code

Connecting to a Kafka topic
Retrieving JSON from data source using Paw
Variations and additional processing
Streaming the Code

Job control variables
Defining values to match
Functions and conditions
Acquiring Stream Output

Counters
Kafka output (matched and non-matched)
Troubleshooting

Summary and Conclusion