Joseph Selvaraaj

My broad research interests focus on advancing multi agent autonomous systems by designing algorithms and systems that are

theoretically grounded and efficient under different notions of efficiency,
scalable to complex problems like sequential social dilemmas,
simple to understand and implement.

To that end, I am exploring how deep learning, deep RL, deep MARL as a tool can be combined with insights from pure and applied mathematics.

Bio: I am currently a software engineer in Lucid Software’s Cards 2 team. Alongside my work, I also pursue independent research on multi agent reinforcement learning. I got my B.S computer science, B.S mathematics and B.B.A finance from UMass Amherst in 2023.

At UMass, I studied applying neural differential equations for episode trajectory modeling of grid environment as part of my undergraduate honors thesis with Prof. Bruno Castro da Silva.

news

Jul 2025	Informal research with Jasmine Aloor on Sequential Social Dilemmas.
Mar 2025	Informal work with Jasmine Aloor on JAX implementation of Fair MARL.
Feb 2025	JAX implementation of VMAS simulator that supports Football and MPE environments.
Jan 2025	Informal work with Siddharth Nayak on JAX implementation of InforMARL.
Jul 2024	Informal research with Andrea Baisero on memory reactive policy.
Jun 2023	Started working as Software Engineer at Lucid Software
May 2023	Completed honors thesis on sequential rollouts with neural ordinary differential equations.
Sep 2022	Research Assistant for Prof. Chengbo Ai