Stanford EE Computer Systems Colloquium

4:30 PM, Wednesday, March 4, 2020
Shriram Center for Bioengineering and Chemical Engineering Room 104
http://ee380.stanford.edu

Data Analytics at the Exascale for Free Electron Lasers Lasers Project

Jana B. Thayer
SLAC National Accelerator Laboratory
Stanford

The increase in velocity, volume, and complexity of the data generated by the upcoming Linac Coherent Light Source upgrade (LCLS-II) at the SLAC National Accelerator Laboratory presents a considerable challenge for data acquisition, data processing, and data management. These systems face formidable challenges due to the extremely high data throughput, hundreds of GB/s to multi-TB/s, generated by the detectors at the experimental facilities and to the intensive computational demand for data processing and scientific interpretation. The LCLS-II Data System is a fast, powerful, and flexible architecture that includes a feature extraction layer designed to reduce the data volumes by at least one order of magnitude while preserving the science content of the data. Innovative architectures are required to implement this reduction with a configurable approach that can adapt to the multiple science areas served by LCLS. In order to increase the likelihood of experiment success and improve the quality of recorded data, a real-time analysis framework provides visualization and graphically-configurable analysis of a selectable subset of the data on the timescale of seconds. A fast feedback layer offers dedicated processing resources to the running experiment in order to provide experimenters feedback about the quality of acquired data within minutes. We will present an overview of the LCLS-II Data System architecture with an emphasis on the Data Reduction Pipeline (DRP) and online monitoring framework.

Video:

To access the live webcast of the talk (active at 16:28 of the day of the presentation) and the archived version of the talk, use the URL SU-EE380-20200304. This is a first class reference and can be transmitted by email, Twitter, etc.

A URL referencing a YouTube view of the lecture will be posted HERE a week or so following the presentation.

About the Speaker:

[speaker photo] Jana is the Division Director for LCLS Data Systems at the SLAC National Accelerator Laboratory, responsible for data acquisition, data management, and data analysis for the LCLS facility and development of the next generation data system to support the LCLS-II upgrade. Jana started at SLAC in 2004 working on the Fermi Gamma-Ray Space Telescope and managed the Flight Software group from 2006 - 2009. Jana has a Ph.D. in Elementary Particle Physics from The Ohio State University and has long nurtured an interest in data acquisition systems and high-performance software in the fields of HEP, astrophysics, and photon science.