IMG

Dissecting errors in machine learning for retrosynthesis: a granular metric framework and a transformer-based model for more informative predictions
Arihanth Srikar Tadanki, H Surya Prakash Rao, Deva Priyakumar U
Digital Discovery, DD, 2025
Core Rank : - Google Rank :20
Prompt-to-Correct: Automated Test-Time Pronunciation Correction with Voice Prompts
Ayan Kashyap, Neilkumar Milankumar Shah, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
A Preliminary Analysis of Automatic Word and Syllable Prominence Detection in Non-Native Speech With Text-to-Speech Prosody Embeddings
Anindita Mondal, Rangavajjala Sankara Bharadwaj, Mallela Jhansi, Anil Kumar Vuppala, Chiranjeevi Yarra
Technical Report, arXiv, 2025
Core Rank : - Google Rank :-
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues
Girmaji Rohit, Bhav Beri, Ramanathan Subramanian, Vineet Gandhi
International Conference on Intelligent User Interfaces, IUI, 2025
Core Rank : A Google Rank :52
MRI2Speech: Speech Synthesis from Articulatory Movements Recorded by Real-time MRI
Neilkumar Milankumar Shah, Ayan Kashyap, Shirish Karande, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset
Neilkumar Milankumar Shah, Shirish Karande, Vineet Gandhi
Technical Report, arXiv, 2025
Core Rank : - Google Rank :-
Typical vs. Atypical Disfluency Classification: Introducing the IIITH-TISA Corpus and Temporal Context-Based Feature Representations
Parvathi Priyanka Kommagouni, Narasinga Vamshi Raghu Simha, Purva Barche, Sai Akarsh C, Anil Kumar Vuppala
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
The Sound of Water: Inferring Physical Properties from Pouring Liquids
Piyush Bagad, Makarand Tapaswi, Cees G. M. Snoek, Andrew Zisserman
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129
No Detail Left Behind: Revisiting Self-Retrieval for Fine-Grained Image Captioning
Manu Gaur, Darshan Singh S, Makarand Tapaswi
Transactions in Machine Learning Research, TMLR, 2025
Core Rank : - Google Rank :-
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues
Girmaji Rohit, Siddharth Jain, Bhav Beri, Sarthak Bansal, Vineet Gandhi
International Conference on Acoustics, Speech, and Signal Processing, ICASSP, 2025
Core Rank : B Google Rank :129