Pranav Malpure

Hello! I'm Pranav, a Graduate Student Researcher at UC San Diego, where I am pursuing my MS in ECE, specializing in Intelligent Systems, Robotics, and Control (ISRC). I completed my undergraduate from the Indian Institute of Technology Bombay, receiving an Honors degree in Aerospace Engineering and a Minor degree in Systems and Controls Engineering.

My major work/research focus lies in reinforcement learning in robotics, exploring different ideas in manipulation, currently working on its integration with language for better performances. I've also exploreed multi-robot navigation, sensor-fusion based SLAM, control systems, and embedded electronics. I am currently a researcher at the Existential Robotics Laboratory, working under Prof. Nikolay Atanasov. My undergraduate thesis was on the application of Deep RL for control of a robotic arm to achieve inverse kinematics, and was advised by Prof. Mayank Baranwal.

I am curious to see how robots will shape our everyday lives in the future, and want to be a part of the community that makes that happen. Safe, friendly and useful robots in our homes for the win!

Research Interests: Reinforcement Learning, Dexterous manipulation, VLMs, Visual Learning

Email / GitHub / Resume / LinkedIn

Robotics Intern

Jul '25 - Present

M.S. in ECE(Robotics)

Sep '24 - Dec '25

Robotics Intern

May '23 - Jul '23

B.Tech(Honors) in Aerospace

Nov '20 - May '24

News

Oct 2024: Excited about my first internship in USA at Labelbox, as a Robotics Engineering Intern.
Oct 2024: Started working as Graduate Researcher, advised by Prof. Nikolay Atanasov.
Sep 2024: Excited to start my graduate studies at UC San Diego!
May 2024: Graduated from IIT Bombay.
Apr 2024: Actually in news! Indian Express article.
May 2023: Joined Flytbase Labs, a drone autonomy software platform startup, as a robotics intern.
Dec 2022: My first internship, joined Jio Platforms Ltd, as a software intern, in their knowledge graphs team.
Nov 2020:Excited to start a new journey, at IIT Bombay.

Research

Graduate Research: Reward Engineering of allegro hand for cube grab

Guide: Prof. Nikolay Atanasov
Coolest robotics project yet(detailed info):Slides [Code(env), Code(robot)]

Reward engineered an allegro hand attached to a xarm robot, comprising of 21 DoF, to grab a cube based on RGBD data as observation. This involved breaking the rewards into stages, each accomplishing certain segments of the entire task.

Bachelor's Thesis: Deep Reinforcement Learning for control of robotic arm for Inverse Kinematics

Guide: Prof. Mayank Baranwal
[Code]

Implemented a model-free reinforcement learning approach to train control policies for a robotic arm to navigate to any point in its workspace. Developed a reward structure in dm_control for a kinova Jaco arm.

Multi-Robot navigation in Cyclic Graphs

Guide: Prof. Arpita Sinha

In this proprietary work, I found the shortest time required by the MR DFS algorithm to explore unknown cyclic graphs by placing multiple robots at optimal nodes in the graph.

Key Technical Projects/Experiences

Autonomous underwater navigation

[Code]

Programmed RexROV2 to autonomously navigate underwater using curvature velocity method to avoid obstacles, detected using the onboard sonar sensors. Simulated this in an underwater world using Gazebo in ROS

The Humanoid Project

Team Lead, THP, an all student humanoid team at IIT Bombay
[Insta page]

Led the revival and growth of the Humanoid team at IIT Bombay, securing its first funding, recruiting two successive student batches, and establishing a long-term technical and organizational roadmap. Spearheaded cross-subsystem coordination and mentored juniors hands-on, providing strategic and technical direction to ensure sustained development and execution.

Optimizing recalculation of Visibility Graphs

Research work as an intern at Flytbase Labs

Devised a novel technique to optimize the addition of new polygons in existing visibility graphs for drones. Reduced the recalculation time by around 92%.

credits to