Search results (21) found.
Learning from Unlabeled 3D Environments for Vision-and-Language Navigation
Shizhe Chen, Pierre-louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev
European Conference on Computer Vision, ECCV, 2022
Core Rank : A* Google Rank :206
Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation
Shizhe Chen, Pierre-louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev
Computer Vision and Pattern Recognition, CVPR, 2022
Core Rank : A* Google Rank :440
Learning Object Manipulation Skills from Video via Approximate Differentiable Physics
Vladim´ır Petr´ık, Mohammad Nomaan Qureshi, Josef Sivic, Makarand Tapaswi
International Conference on Intelligent Robots and Systems, IROS, 2022
Core Rank : A Google Rank :86
Instruction-driven history-aware policies for robotic manipulations
Pierre-louis Guhur, Shizhe Chen, Ricardo Garcia, Makarand Tapaswi, Ivan Laptev, Cordelia Schmid
Conference on Robot Learning, CORL, 2022
Core Rank : - Google Rank :88
Can we Adopt Self-supervised Pretraining for Chest X-Rays?
Arsh Verma, Makarand Tapaswi
Machine Learning for Health Workshop, ML4H, 2022
Core Rank : - Google Rank :-
Language Conditioned Spatial Relation Reasoning for 3D Object Grounding
Shizhe Chen, Pierre-louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev
Neural Information Processing Systems, NeurIPS, 2022
Core Rank : A* Google Rank :337
Sonus Texere! Automated Dense Soundtrack Construction for Books using Movie Adaptations
Jaidev Shriram, Makarand Tapaswi, Vinoo A R
International Society for Music Information Retrieval, ISMIR, 2022
Core Rank : - Google Rank :40
Grounded Video Situation Recognition
Zeeshan Khan, Jawahar C V, Makarand Tapaswi
Neural Information Processing Systems, NeurIPS, 2022
Core Rank : A* Google Rank :337
Long term spatio-temporal modeling for action detection
Makarand Tapaswi, Vijay Kumar, Ivan Laptev
Computer Vision and Image Understanding, CVIU, 2021
Core Rank : - Google Rank :48
Airbert: In-domain Pretraining for Vision-and-Language Navigation
Pierre-louis Guhur, Makarand Tapaswi, Shizhe Chen, Ivan Laptev, Cordelia Schmid
International Conference on Computer Vision, ICCV, 2021
Core Rank : A* Google Rank :291