Stephen Cheng

I am a first-year CS PhD student at the University of Maryland co-advised by Prof. Sarah Wiegreffe and Prof. Dinesh Manocha. My research interests lie broadly in causally understanding foundation model post-hoc behavior and training methodologies, and applying these findings to improve model performance and alignment. Currently, I am working on mechanistic interpretability methods for studying reasoning models.

Stephen Cheng

News

Selected Publications