Stanford EE Computer Systems Colloquium

4:15PM, Wednesday, November 12, 2014
NEC Auditorium, Gates Computer Science Building Room B3
http://ee380.stanford.edu


I ♥ Logs: Apache Kafka, Stream Processing, and Real-time Data

Jay Kreps
LinkedIn
About the talk:

This talk will discuss how logs and stream-processing can form a backbone for data flow, ETL, and real-time data processing. It will describe the challenges and lessons learned as LinkedIn built out its real-time data subscription and processing infrastructure. It will also discuss the role of real-time processing and its relationship to offline processing frameworks such as MapReduce.

Slides: Download the slides for this presentation in PDF format.

Videos:

About the speaker:

[speaker photo] Jay Kreps is a Principal Staff Engineer at LinkedIn where he is the lead architect for online data infrastructure. He is among the original authors of several open source projects including a distributed key-value store called Project Voldemort, a messaging system called Kafka, and a stream processing system called Samza.

Contact information:

Jay Kreps
Twitter: @jaykreps