I am a PhD student at the University of Washington, advised by Dieter Fox. Previously, I was a researcher at NUS under David Hsu. My interests are in Human-Robot Interaction, Computer Vision, Natural Language Processing, and Machine Learning. I graduated with a Bachelors in Computer Eng. from NUS. During my stint as an undergrad, I spent a year at Stanford, and also interned at a YCombinator AR startup.
Contact: mshr ( at ) cs.uw.edu
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks.
Mohit Shridhar, Jesse Thomason, Daniel Gordon, Yonatan Bisk,
Winson Han, Roozbeh Mottaghi, Luke Zettlemoyer, Dieter Fox
Website | Abstract | PDF | Video | Code | BibTex
Interactive Visual Grounding of Referring Expressions for Human-Robot Interaction.
Mohit Shridhar, David Hsu
Robotics: Science & Systems (RSS) 2018.
Abstract | PDF | Video | Code | Poster | Slides | BibTeX
XPose: Reinventing user interaction with flying cameras.
Ziquan Lan, Mohit Shridhar, David Hsu, Shengdong Zhao.
Robotics: Science & Systems (RSS) 2017.
Best Systems Paper Award in Memory of Seth Teller
Abstract | PDF | Video | Slides | BibTeX
INGRESS: Interactive Visual Grounding of Referring Expressions.
Mohit Shridhar, Dixant Mittal, David Hsu
International Journal of Robotics Research (IJRR) 2020.
PDF (coming soon)
Free-Viewpoint Video Reconstruction for Immersive Telepresence
Bachelor of Engineering Thesis, Dept. of Electrical and Computer Engineering 2016.
NUS 30th Annual Faculty Innovation and Research Award
PDF | Video
Computer Vision & Graphics Engineer, Intern
Jan 2015 – Dec 2015
Worked on VI-SLAM and Holographic Skype for AR Headset (Hololens & MagicLeap competitor).
Part of CEO’s ensemble for building demo content during Series B ($50M round)
Robotics Engineer, Intern
May 2014 – Aug 2014
Worked on AGVs for transporting medical supplies inside hospitals
Built a multi-map planner for navigation across multiple floors
May 2013 – Aug 2013
Built a functioning tribometer using just 3D-printed and laser-cut parts.
Autodesk University Panorama 2013 Finalist
Dense-Semantic SLAM (2017)
Combining Monocular SLAM with Dense Captioning for object retrieval
Higgs Boson Detection Challenge (2015)
Deep-learning for classifying Higgs Boson to tau-tau signal events
Paper | Poster | Code
Textured Quads (2016) Code
Rviz plugin for displaying images and videos
Arduino Oscilloscope (2013) Code
Cheap alternative to digital oscilloscopes
Some of my collaborative work has been featured in:
I am a fan of films, science fiction, and Oxford commas.