Structured State Space Models for Deep Sequence Modeling (Albert Gu, CMU)
Yingzhen Li Yingzhen Li
1.89K subscribers
26,481 views
835

 Published On May 26, 2023

Date: May 26, 2023
(Sorry that the first 2 slides are not recorded, those are motivation slides though.)

Abstract:
This talk will cover recent deep neural networks based on state space models (SSM) starting from S4. I'll go over the core properties and mechanics of SSMs, and discuss the key features of S4 and variants. I'll also focus on discussing the relationship of SSMs with established deep learning models (RNNs, CNNs, Attention) and their corresponding strengths and weaknesses, including potential application areas and promising directions.

Bio:
Albert Gu is an incoming Assistant Professor of Machine Learning at Carnegie Mellon University. His research broadly focuses on theoretical and empirical aspects of deep learning, with a recent focus on new approaches for deep sequence modeling. He completed his PhD at Stanford University under the supervision of Christopher Ré, and is currently working at DeepMind during a gap year.

show more

Share/Embed