About Me

I am a Research Scientist at the Dyson Robot Learning Lab. I received my PhD from the University of Washington, where I was advised by Dieter Fox. Previously, I was a researcher at NUS under David Hsu. My interests are in Human-Robot Interaction, Computer Vision, Natural Language Processing, and Machine Learning. I graduated with a Bachelors in Computer Eng. from NUS. During my stint as an undergrad, I spent a year at Stanford, and also interned at a YCombinator AR startup.

See my CV for more details. 

Contact: mshr └[∵┌]└[ ∵ ]┘[┐∵]┘ cs.washington.edu

Publications

Conferences

Screen Shot 2022-09-10 at 1.43.21 PM


Perceiver-Actor: A Multi-Task Transformer
for Robotic Manipulation.
Mohit Shridhar, Lucas Manuelli, Dieter Fox
Conference on Robot Learning (CoRL) 2022.
Website | Abstract | PDF | VideoColab | Code | Talk | BibTex

Screen Shot 2021-09-23 at 8.39.29 PM

CLIPort: What and Where Pathways
for Robotic Manipulation.
Mohit Shridhar, Lucas Manuelli, Dieter Fox
Conference on Robot Learning (CoRL) 2021.
Website | Abstract | PDF | VideoCode | BibTex


Screen Shot 2021-09-23 at 8.16.19 PM
Language Grounding with 3D Objects.
Jesse Thomason*, Mohit Shridhar*, Yonatan Bisk,
Chris Paxton, and Luke Zettlemoyer
Conference on Robot Learning (CoRL) 2021.
Abstract | PDF | VideosCode | BibTex


Screen Shot 2020-10-10 at 3.38.07 PM
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning.
Mohit Shridhar, Xingdi Yuan, Marc-Alexandre Côté,
Yonatan Bisk, Adam Trischler, Matthew Hausknecht
International Conference on Learning Representations (ICLR) 2021.
Website | Abstract | PDF | VideoCode | BibTex

alfred_thumb-e1576915166279.png


ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks.

Mohit Shridhar, Jesse Thomason, Daniel Gordon, Yonatan Bisk,
Winson Han, Roozbeh Mottaghi, Luke Zettlemoyer, Dieter Fox
Computer Vision and Pattern Recognition (CVPR) 2020.
Website | Abstract | PDF | VideoCode | BibTex


exp_setup_v2.jpg
Interactive Visual Grounding of Referring Expressions for Human-Robot Interaction.
Mohit Shridhar, David Hsu
Robotics: Science & Systems (RSS) 2018.
Abstract | PDF | VideoCode | Poster | Slides | BibTeX

rafflesXPose: Reinventing user interaction with flying cameras.
Ziquan Lan, Mohit Shridhar, David Hsu, Shengdong Zhao. 
Robotics: Science & Systems (RSS) 2017.
Best Systems Paper Award
AbstractPDF | Video | Slides | BibTeX

Journals

ijrr_thumb-e1576913763971.png


INGRESS: Interactive Visual Grounding of Referring Expressions.

Mohit Shridhar, Dixant Mittal, David Hsu
International Journal of Robotics Research (IJRR) 2020.
PDF | Video

Theses

liveframe_davidFree-Viewpoint Video Reconstruction for Immersive Telepresence 
Mohit Shridhar
Bachelor of Engineering Thesis, Dept. of Electrical and Computer Engineering 2016.
NUS 30th Annual Faculty Innovation and Research Award
PDF | Video

Experience

Internships

nvidiaNVIDIA Seattle
Seattle Robotics Lab, Research Intern
July 2022 – Sep 2022

microsoft

Microsoft – Redmond
Reinforcement Learning Group, Research Intern
June 2020 – Sep 2020

nvidia

NVIDIA Seattle
Seattle Robotics Lab, Research Intern
Jan 2020 – Mar 2020

meta

Meta San Mateo
Computer Vision & Graphics Team, Software Intern
Jan 2015 – Dec 2015

hopetechnik

HopeTechnik Singapore
Robotics Team, Software Intern
May 2014 – Aug 2014

Projects

Research

shield_slam_small-e1505654738992.jpgShield SLAM (2015)   
Monocular SLAM for Android devices.
Paper | Code

dense_semantic-e1509871384650.pngDense-Semantic SLAM (2017) 
Combining Monocular SLAM with Dense Captioning for object retrieval
Video

dropout_smallHiggs Boson Detection Challenge (2015)   
Deep-learning for classifying Higgs Boson to tau-tau signal events
Paper | PosterCode


Others

multimapMulti-Map Manager 
(2014)   Video | Code
ROS package for managing multiple static maps (e.g: different floors)

oculusOculus-Rift Gazebo Navigator (2014)    Video | Code
Joystick based navigation tool for FPS navigation in Gazebo

quadsTextured Quads (2016)    Code
Rviz plugin for displaying images and videos

tascaTASCA (2013)    Video | Code
Java-based todo-list application

ardo2.jpgArduino Oscilloscope (2013)    Code
Cheap alternative to digital oscilloscopes

Media

Some of my collaborative work has been featured in:

tc-techcrunch-e1508071918323.png             ted-logo-fb.png             IEEE-Spectrum.Horizontal              gazebo_hor             autodesk-university-2016-logo-2-line-color-black.png

Miscellaneous

My Erdos number is at most three (Mohit Shridhar → David Hsu → Maria Klawe → Paul Erdős).
But unfortunately my Erdos–Bacon–Sabbath number is undefined.

I am a fan of films, science fiction, and Oxford commas.