Alok Raj
Robotics Researcher

I am a researcher focused on Robotic Perception, Manipulation, and Vision-Language Models. My work spans vision-based models for autonomous agents, including visual active search, task-oriented grasping, and multi-agent navigation, aiming to create adaptive robots that integrate advanced perception, learning, and control. I wish to advance robot intelligence through vision transformers, reinforcement learning, and 3D vision techniques that enable autonomous decision-making in dynamic environments.
I am currently a CS undergraduate at IIT Dhanbad, India. Most recently, I have been a research intern at the Multi-Agent Robotic Motion (MARMoT) Lab, NUS under Prof. Guillaume A. Sartoretti, working on embodied vision-language models for visual search with test-time adaptation methods and manipulation policy mobilization. I have also worked at the Center of Intelligent Robotics, IIIT Allahabad under Prof. G.C. Nandi and Andrew Melnik, developing the GRIM framework for generatively conditioned task-oriented grasping, and at Samsung R&D Institute India, building low-compute speaker verification systems. Previously, I interned at Clutterbot Technologies, where I improved robot vision models through self-training, knowledge distillation, and curriculum learning.
News
Aug 04, 2025 | Our work, Search-TTA, on Embodied Visual Active Search got accepted to Conference on Robot Learning (CoRL) 2025! |
---|---|
Jun 17, 2025 | Our work, GRIM, on Task-Oriented Grasping got accepted to ICML workshop Building Physically Plausible World Models 2025! |
May 19, 2025 | Started working as an R&D intern at Samsung Research. |
Feb 18, 2025 | Started working under Prof. Guillaume A Sartoretti as a research intern at MARMoT Lab, National University of Singapore. |
Dec 19, 2024 | Started working under Prof. Gora Chand Nandi as a research intern at Center of Intelligent Robotics, IIIT Allahabad. |