Joseph Selvaraaj
My broad research interests focus on advancing multi agent autonomous systems by designing algorithms and systems that are
- theoretically grounded and efficient under different notions of efficiency,
- scalable to complex problems like sequential social dilemmas,
- simple to understand and implement.
To that end, I am exploring how deep learning, deep RL, deep MARL as a tool can be combined with insights from pure and applied mathematics.
Bio: I am currently a software engineer in Lucid Software’s Cards 2 team. Alongside my work, I also pursue independent research on multi agent reinforcement learning. I got my B.S computer science, B.S mathematics and B.B.A finance from UMass Amherst in 2023.
At UMass, I studied applying neural differential equations for episode trajectory modeling of grid environment as part of my undergraduate honors thesis with Prof. Bruno Castro da Silva.
news
| Jul 2025 | Informal research with Jasmine Aloor on Sequential Social Dilemmas. |
|---|---|
| Mar 2025 | Informal work with Jasmine Aloor on JAX implementation of Fair MARL. |
| Feb 2025 | JAX implementation of VMAS simulator that supports Football and MPE environments. |
| Jan 2025 | Informal work with Siddharth Nayak on JAX implementation of InforMARL. |
| Jul 2024 | Informal research with Andrea Baisero on memory reactive policy. |
| Jun 2023 | Started working as Software Engineer at Lucid Software |
| May 2023 | Completed honors thesis on sequential rollouts with neural ordinary differential equations. |
| Sep 2022 | Research Assistant for Prof. Chengbo Ai |