Dhruv Sheth

Computer Science & Robotics @ Caltech

dhruv_website_profile.jpg

I’m an undergraduate student at the California Institute of Technology (Caltech) studying Computer Science and Robotics. I’m currently working with Prof. Georgia Gkioxari in collaboration with AMBER Lab on improving 3D Vision Priors in Robotics. At Caltech, I’m fortunate to be advised by Prof. Soon Jo Chung and Prof. Joel Burdick.

I previously spent time as a ML intern at Persona AI, where I worked on robotics infra and explored scaling effects of data and model size on VLA generalization, at Edge Impulse (acqd. by Qualcomm), where I focused on ML model quantization for embedded systems, and at Luxonis, where I worked on the SpatialAI library for on-device computer vision.

My research interests include long-horizon reasoning in robotics and probing generalization in Vision Language Action (VLA) models.

news

Jun 10, 2025 Joined Persona AI as an ML intern this summer to work on VLAs and scale robotics infra.
Jun 09, 2025 Our paper “SYNAPSE: A Multi-Modal Framework for Interpretable Neural Decoding Using Vision-Language Foundation Models” was accepted to ICML 2025, FML4S workshop.
May 20, 2025 Awarded Second Place at Robotics Grasping and Manipulation Competition (RGMC), In-Hand manipulation track at ICRA 2025.
Jun 16, 2024 Awarded Dr. Jane Chen Summer Undergraduate Research Fellowship (SURF) to purse research at Dr. Joel Burdick’s Robotics Lab on the DARPA Learning Introspective Control (LINC) research program
Sep 18, 2023 Moved to Pasadena to start my undergraduate studies at Caltech!

selected publications

  1. SpatialTrace_Pipeline.png
    SpatialTraceGen: High-Fidelity Traces for Efficient VLM Spatial Reasoning Distillation
    Gio Huh*, Dhruv Sheth*, Rayhan Zirvi*, and Frank Xiao*
    Efficient Reasoning Workshop, NeurIPS (Preprint), 2025
  2. synapse_pic.png
    SYNAPSE: A Multi-Modal Framework for Interpretable Neural Decoding Using Vision-Language Foundation Models
    Dhruv Sheth*, Raaghav Malik*, Edward Ju*, Adarsh Kumarappan*, Shrujana Kunnam*, Geeling Chau, and Kevin Le
    FM4LS Workshop, ICML, 2025
  3. elasticl.png
    ElastiCL: Elastic Quantization for Communication Efficient Collaborative Learning in IoT
    Bharath Sudharsan, Dhruv Sheth, Shailesh Arya, Federica Rollo, Piyush Yadav, Pankesh Patel, John G. Breslin, and Muhammad Intizar Ali
    In Proceedings of the 19th ACM Conference on Embedded Networked Sensor Systems, 2021