IIITH

MICap: A Unified Model for Identity-aware Movie Descriptions

Haran S K Raajesh, Naveen Reddy Desanur, Zeeshan Khan, Makarand Tapaswi

Computer Vision and Pattern Recognition, CVPR, 2024

Core Rank : A* Google Rank :440

Abs PDF bibTex

Previously On ... From Recaps to Story Summarization

Aditya Kumar Singh, Dhruv Srivastava, Makarand Tapaswi

Computer Vision and Pattern Recognition, CVPR, 2024

Core Rank : A* Google Rank :440

Abs PDF DOI bibTex

How you feelin? Learning Emotions and Mental States in Movie Scenes

Dhruv Srivastava, Aditya Kumar Singh, Makarand Tapaswi

Computer Vision and Pattern Recognition, CVPR, 2023

Core Rank : A* Google Rank :440

Abs PDF bibTex

GrapeQA: GRaph Augmentation and Pruning to Enhance Question-Answering

Dhaval Taunk, Lakshya Khanna, Kandru Siri Venkata Pavan Kumar, Vasudeva Varma Kalidindi, Charu Sharma, Makarand Tapaswi

WWW Workshop on Natural Language Processing for Knowledge Graph Construction, NLP4KGc, 2023

Core Rank : - Google Rank :-

Abs PDF bibTex

DO VIDEO-LANGUAGE FOUNDATION MODELS HAVE A SENSE OF TIME?

Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek

workshop on International Conference on Learning Representations, ICLR-W, 2023

Core Rank : - Google Rank :-

Abs PDF bibTex

Test of Time: Instilling Video-Language Models with a Sense of Time

Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek

Computer Vision and Pattern Recognition, CVPR, 2023

Core Rank : A* Google Rank :440

Abs PDF bibTex

Unsupervised Audio-Visual Lecture Segmentation

Darshan Singh S, Anchit Gupta, Jawahar C V, Makarand Tapaswi

Winter Conference on Applications of Computer Vision, WACV, 2023

Core Rank : - Google Rank :109

Abs PDF bibTex

Learning from Unlabeled 3D Environments for Vision-and-Language Navigation

Shizhe Chen, Pierre-louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev

European Conference on Computer Vision, ECCV, 2022

Core Rank : A* Google Rank :206

Abs bibTex

Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation

Shizhe Chen, Pierre-louis Guhur, Makarand Tapaswi, Cordelia Schmid, Ivan Laptev

Computer Vision and Pattern Recognition, CVPR, 2022

Core Rank : A* Google Rank :440

Abs PDF bibTex

Learning Object Manipulation Skills from Video via Approximate Differentiable Physics

Vladim´ır Petr´ık, Mohammad Nomaan Qureshi, Josef Sivic, Makarand Tapaswi

International Conference on Intelligent Robots and Systems, IROS, 2022

Core Rank : A Google Rank :86

Abs PDF bibTex