IMG

DiscoGraMS: Enhancing Movie Screen-Play Summarization using Movie Character-Aware Discourse Graph
Maitreya Prafulla Chitale, Uday Bindal, Rajakrishnan P Rajkumar, Rahul Mishra
North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL- HLT, 2025
Core Rank : A Google Rank :132
Prompt-to-Correct: Automated Test-Time Pronunciation Correction with Voice Prompts
Ayan Kashyap, Neilkumar Milankumar Shah, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World
Meghana Tedla, Shrikara A, Karthik Vaidhyanathan
IEEE International Conference on Software Architecture Companion, ICSA, 2025
Core Rank : A Google Rank :28
Achieving Fair PCA using Joint Eigenvalue Decomposition
Vidhi Rathore, Naresh Manwani
Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD, 2025
Core Rank : B Google Rank :30
A Preliminary Analysis of Automatic Word and Syllable Prominence Detection in Non-Native Speech With Text-to-Speech Prosody Embeddings
Anindita Mondal, Rangavajjala Sankara Bharadwaj, Mallela Jhansi, Anil Kumar Vuppala, Chiranjeevi Yarra
Technical Report, arXiv, 2025
Core Rank : - Google Rank :-
MAPWise: Evaluating Vision-Language Models for Advanced Map Queries
Srija Mukhopadhyay, Abhishek Rajgaria, Prerana Khatiwada, Manish Shrivastava, Dan Roth, Vivek Gupta
North American Association for Computational Linguistics, NAACL, 2025
Core Rank : A
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues
Girmaji Rohit, Bhav Beri, Ramanathan Subramanian, Vineet Gandhi
International Conference on Intelligent User Interfaces, IUI, 2025
Core Rank : A Google Rank :52
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
Neilkumar Milankumar Shah, Ayan Kashyap, Shirish Karande, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset
Neilkumar Milankumar Shah, Shirish Karande, Vineet Gandhi
Technical Report, arXiv, 2025
Core Rank : - Google Rank :-
Typical vs. Atypical Disfluency Classification: Introducing the IIITH-TISA Corpus and Temporal Context-Based Feature Representations
Parvathi Priyanka Kommagouni, Narasinga Vamshi Raghu Simha, Purva Barche, Sai Akarsh C, Anil Kumar Vuppala
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129