Alok Shah

I'm a student at University of Pennsylvania where I study Computer Science (B.S.E), Mathematics (B.A.), and Electrical Engineering (M.S.E) with a focus on Optimization and Control.

Outside of class I lead MLR@Penn, the undergraduate AI and ML research organization and community at Penn and serve on the Student Advisory Board for the Wharton AI & Analytics Initiative.

In my free time, I enjoy history and geography trivia, traveling, trying new food trucks and restaurants around Philly, and burning off those calories by playing pickup soccer.

I'm open to collaborations or talking about anything related to ML and research -- I especially enjoy mentoring younger students interested in getting involved in the field!

Email  /  CV  /  Twitter  /  Github

profile photo

Research

I'm broadly interested in playing simple tricks on models to improve empirical and theoretical bounds alike on performance across modalities. Currently, I'm developing methods to induce more robust, explainable, and generalizable reasoning capabilities in generative models while ensuring their responsible use in practice.

Investigating Language Model Dynamics using Meta-Tokens
Alok Shah, Keshav Ramji, Khush Gupta, Vedant Gaur
NeurIPS ATTRIB Workshop Paper, 2024

Explored novel meta-attention mechanism using meta-tokens to enhance model reasoning and interpretability in language models.

Weak-to-Strong In-Context Optimization of Language Model Reasoning
Keshav Ramji, Alok Shah, Vedant Gaur, Khush Gupta
NeurIPS ATTIB Workshop Paper, 2024

Developed in-context optimization method leveraging weak learners to improve reasoning in strong large language models without additional finetuning

AlfLLM: Limitations on LLMs as Reward Function Surrogates
Alok Shah, Alexander Kyimpopkin, Dominic Olaguera-Delogu

Investigated the efficacy of using large language models as surrogate reward functions for reinforcement learning in the ALFWorld environment.

Selected Coursework

I've had the privilege of learning from excellent professors, each of whom has influenced my approach to research.

* denotes graduate level; ** denotes doctoral level

  • Honors Multivariate Analysis (Herman Gluck)
  • Discrete Mathematics (Rajiv Gandhi)
  • Data Structures and Algorithms (Rajiv Gandhi)
  • Advanced Linear Algebra* (Julia Hartmann)
  • Probability* (Michael Carchidi)
  • Computer Systems (Benjamin Lee)
  • Machine Learning* (Jake Gardner)
  • Real Analysis (John Green)
  • Topology* (Dennis DeTurck)
  • Differential Geometry* (Dennis DeTurck)
  • Operating System and Design (Travis McGaha)
  • Ethical Algorithm Design* (Michael Kearns)
  • Convex Optimization** (Nikolai Matni)
  • State Estimation, Control, and Reinforcement Learning** (Pratik Chaudhari)
  • Deep Learning* (Pratik Chaudhari)
  • Theoretical Computer Science (Anindya De)
  • Analysis of Algorithms* (Sanjeev Khanna)
  • Uncertainty Quantification** (Aaron Roth)
  • Learning for Dynamics and Control** (Ingvar Ziemann)
  • Hardware/Software Co-Design for Machine Learning* (Jing Li)
  • Stability in Optimization and Statistics** (Jason Altschuler)

Teaching

I've served as Teaching Assistant for multiple core ML courses at Penn

* denotes graduate level; ^ denotes head teaching assistant

  • Machine Learning*^ (Surbhi Goel, Eric Wong, Jake Gardner)
  • Mathematics of Machine Learning (Eric Wong)

Forked from Jon Barron's source code.