I am a PhD student at the University of Washington, advised by Dieter Fox. Previously, I was a researcher at NUS under David Hsu. My interests are in Human-Robot Interaction, Computer Vision, Natural Language Processing, and Machine Learning. I graduated with a Bachelors in Computer Eng. from NUS. During my stint as an undergrad, I spent a year at Stanford, and also interned at a YCombinator AR startup.
Contact: mshr ( at ) cs.uw.edu
ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks.
Mohit Shridhar, Jesse Thomason, Daniel Gordon, Yonatan Bisk,
Winson Han, Roozbeh Mottaghi, Luke Zettlemoyer, Dieter Fox
Computer Vision and Pattern Recognition (CVPR) 2020.
Website | Abstract | PDF | Video | Code | BibTex
Interactive Visual Grounding of Referring Expressions for Human-Robot Interaction.
Mohit Shridhar, David Hsu
Robotics: Science & Systems (RSS) 2018.
Abstract | PDF | Video | Code | Poster | Slides | BibTeX
XPose: Reinventing user interaction with flying cameras.
Ziquan Lan, Mohit Shridhar, David Hsu, Shengdong Zhao.
Robotics: Science & Systems (RSS) 2017.
Best Systems Paper Award in Memory of Seth Teller
Abstract | PDF | Video | Slides | BibTeX
INGRESS: Interactive Visual Grounding of Referring Expressions.
Mohit Shridhar, Dixant Mittal, David Hsu
International Journal of Robotics Research (IJRR) 2020.
PDF (coming soon)
Free-Viewpoint Video Reconstruction for Immersive Telepresence
Bachelor of Engineering Thesis, Dept. of Electrical and Computer Engineering 2016.
NUS 30th Annual Faculty Innovation and Research Award
PDF | Video
Computer Vision & Graphics Engineer, Intern
Jan 2015 – Dec 2015
Worked on VI-SLAM and Holographic Skype for AR Headset (Hololens & MagicLeap competitor).
Part of CEO’s ensemble for building demo content during Series B ($50M round)
Robotics Engineer, Intern
May 2014 – Aug 2014
Worked on AGVs for transporting medical supplies inside hospitals
Built a multi-map planner for navigation across multiple floors
May 2013 – Aug 2013
Built a functioning tribometer using just 3D-printed and laser-cut parts.
Autodesk University Panorama 2013 Finalist
Dense-Semantic SLAM (2017)
Combining Monocular SLAM with Dense Captioning for object retrieval
Higgs Boson Detection Challenge (2015)
Deep-learning for classifying Higgs Boson to tau-tau signal events
Paper | Poster | Code
Textured Quads (2016) Code
Rviz plugin for displaying images and videos
Arduino Oscilloscope (2013) Code
Cheap alternative to digital oscilloscopes
Some of my collaborative work has been featured in:
I am a fan of films, science fiction, and Oxford commas.