IIITH

A Current Injection Based Constant-gm Rail to Rail OTA Achieving Uniform Small And Large Signal Behaviour

International Conference on VLSI Design, VLSID, 2026

Core Rank : - Google Rank :-

Abs PDF bibTex

@inproceedings{bib_A_Cu_2026, AUTHOR = {Khan, Mohammed Hammad and Zope, Saurabh and Acharyya, Ishan and Srivastava, Abhishek }, TITLE = {A Current Injection Based Constant-gm Rail to Rail OTA Achieving Uniform Small And Large Signal Behaviour}, BOOKTITLE = {International Conference on VLSI Design}. YEAR = {2026}}

A Current Injection Based Constant-gm Rail to Rail OTA Achieving Uniform Small And Large Signal Behaviour

Abstract

This paper presents a constant gm, 1.2 V operational transconductance amplifier (OTA) designed in TSMC 65 nm CMOS technology. The proposed OTA achieves a constantgm characteristic over a full rail-to-rail input/output range by employing a constant gm rail-to-rail input stage, a folded cascode summing stage, and a quiescent current controlled Class-AB output stage. The input stage comprises complementary NMOS and PMOS differential pairs connected in parallel to achieve rail-to-rail input operation. Conventional parallel input stages suffer from significant gm variation with input common-mode voltage, leading to changes in small and large signal parameters such as open-loop gain, slew rate and unity-gain bandwidth (UGB). In this work, a current-injection mechanism dynamically adjusts the active input pair’s bias, maintaining a nearly constant transconductance across the entire common-mode range. Postlayout simulation shows that the proposed OTA achieves less than 4.8% gm variation across the entire input range while maintaining uniform large signal characteristics. Performance metrics include a DC gain of 80–87 dB, UGB of 9.72 MHz, phase margin of 64°, CMRR of 97.9 dB, and a power consumption of 260 μW when driving a 100 pF capacitive load.

Let Leaders Play Games: Improving Timing in Leader-based Consensus

International Conference on Autonomous Agents and Multiagent Systems, AAMAS, 2026

Core Rank : A* Google Rank :54

Abs bibTex

@inproceedings{bib_Let__2026, AUTHOR = {Ahmed, Mohammad Rasheed and Desai, Parth Nimish and Gujar, Sujit Prakash }, TITLE = {Let Leaders Play Games: Improving Timing in Leader-based Consensus}, BOOKTITLE = {International Conference on Autonomous Agents and Multiagent Systems}. YEAR = {2026}}

Let Leaders Play Games: Improving Timing in Leader-based Consensus

Abstract

Propagation latency is inherent to any distributed network, including blockchains. Typically, blockchain protocols allow some timing buffer for block propagation in the network. In leader-based blockchains, the leader -- block proposer -- is known a priori for each slot. A fast (or low-latency) proposer may delay the block proposal in anticipation of more rewards from the transactions that otherwise would have been in the subsequent block. Deploying such a strategy by manipulating the timing is known as timing games. It increases the risk of missed blocks due to reduced time for other nodes to vote on the block, affecting the overall efficiency of the blockchain. Additionally, as the proposers who play timing games essentially steal MEV that otherwise would have gone to the next block, it is unfair to the subsequent block-proposers. We propose a dual block-proposal mechanism, 2-Prop to curtail the timing games. 2-Prop selects two proposers per slot to propose blocks, out of which one is finalized. We design a reward-sharing policy for the proposers based on how fast these blocks are propagated to avoid strategic deviations. In the induced game, which we call the Latency Game, we show that it is a Nash Equilibrium for the proposers to propose the block as quickly as possible if both are under the same network conditions. Even under disparate network conditions, we study many configurations. Our analysis shows that a faster proposer would prefer not to delay unless the other proposer is extremely slow. Thus, the efficacy of 2-Prop in mitigating the effect of timing games is established.

STRinGS: Selective Text Refinement in Gaussian Splatting

Winter Conference on Applications of Computer Vision, WACV, 2026

Core Rank : - Google Rank :109

Abs PDF DOI bibTex

@inproceedings{bib_STRi_2026, AUTHOR = {Raundhal, Abhinav Digambar and Behera, Gaurav and J, Narayanan P and Sarvadevabhatla, Ravi Kiran and Tapaswi, Makarand }, TITLE = {STRinGS: Selective Text Refinement in Gaussian Splatting}, BOOKTITLE = {Winter Conference on Applications of Computer Vision}. YEAR = {2026}}

STRinGS: Selective Text Refinement in Gaussian Splatting

Abstract

Text as signs, labels, or instructions is a critical element of real-world scenes as they can convey important contextual information. 3D representations such as 3D Gaussian Splatting (3DGS) struggle to preserve fine-grained text details, while achieving high visual fidelity. Small errors in textual element reconstruction can lead to significant semantic loss. We propose STRinGS, a text-aware, selective refinement framework to address this issue for 3DGS reconstruction. Our method treats text and non-text regions separately, refining text regions first and merging them with non-text regions later for full-scene optimization. STRinGS produces sharp, readable text even in challenging configurations. We introduce a text readability measure OCR Character Error Rate (CER) to evaluate the efficacy on text regions. STRinGS results in a 63.6% relative improvement over 3DGS at just 7K iterations. We also introduce a curated dataset STRinGS-360 with diverse text scenarios to evaluate text readability in 3D reconstruction. Our method and dataset together push the boundaries of 3D scene understanding in text-rich environments, paving the way for more robust text-aware reconstruction methods.

VIZOR: Viewpoint-Invariant Zero-Shot Scene Graph Generation for 3D Scene Reasoning

IEEE Workshop on Applications of Computer Vision, IEEE WACV, 2026

Core Rank : A Google Rank :-

Abs PDF bibTex

@inproceedings{bib_VIZO_2026, AUTHOR = {Vardhan, Madhavaram Vivek and Sengar, Vartika and De, Arkadipta and Sharma, Charu }, TITLE = {VIZOR: Viewpoint-Invariant Zero-Shot Scene Graph Generation for 3D Scene Reasoning}, BOOKTITLE = {IEEE Workshop on Applications of Computer Vision}. YEAR = {2026}}

VIZOR: Viewpoint-Invariant Zero-Shot Scene Graph Generation for 3D Scene Reasoning

Abstract

Scene understanding and reasoning has been a fundamental problem in 3D computer vision, requiring models to identify objects, their properties, and spatial or comparative relationships among the objects. Existing approaches enable this by creating scene graphs using multiple inputs such as 2D images, depth maps, object labels, and annotated relationships from specific reference view. However, these methods often struggle with generalization and produce inaccurate spatial relationships like "left/right", which become inconsistent across different viewpoints. To address these limitations, we propose Viewpoint-Invariant ZerO-shot scene graph generation for 3D scene Reasoning (VIZOR). VIZOR is a training-free, end-to-end framework that constructs dense, viewpoint-invariant 3D scene graphs directly from raw 3D scenes. The generated scene graph is unambiguous, as spatial relationships are defined relative to each object’s front-facing direction, making them consistent regardless of the reference view. Furthermore, it infers open-vocabulary relationships that describe spatial and proximity relationships among scene objects without requiring annotated training data. We conduct extensive quantitative and qualitative evaluations to assess the effectiveness of VIZOR on scene graph generation and downstream tasks, such as query-based object grounding. VIZOR outperforms state-of-the-art methods, showing clear improvements in scene graph generation and achieving 22% and 4.81% gains in zero-shot grounding accuracy on the Replica and Nr3D datasets, respectively.

InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation

Winter Conference on Applications of Computer Vision, WACV, 2026

Core Rank : - Google Rank :109

Abs PDF bibTex

@inproceedings{bib_Inte_2026, AUTHOR = {Rajan, Sreehari and Bhosikar, Kunal Kamalkishor and Sharma, Charu }, TITLE = {InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation}, BOOKTITLE = {Winter Conference on Applications of Computer Vision}. YEAR = {2026}}

InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation

Abstract

Generating realistic human motions that naturally respond to both spoken language and physical objects is crucial for interactive digital experiences. Current methods, however, address speech-driven gestures or object interactions independently, limiting real-world applicability due to a lack of integrated, comprehensive datasets. To overcome this, we introduce InteracTalker, a novel framework that seamlessly integrates prompt-based object-aware interactions with co-speech gesture generation. We achieve this by employing a multi-stage training process to learn a unified motion, speech, and prompt embedding space. To support this, we curate a rich human-object interaction dataset, formed by augmenting an existing text-to-motion dataset with detailed object interaction annotations. Our framework utilizes a Generalized Motion Adaptation Module that enables independent training, adapting to the corresponding motion condition, which is then dynamically combined during inference. To address the imbalance between heterogeneous conditioning signals, we propose an adaptive fusion strategy, which dynamically reweights the conditioning signals during diffusion sampling. InteracTalker successfully unifies these previously separate tasks, outperforming prior methods in both co-speech gesture generation and object-interaction synthesis, outperforming gesture-focused diffusion methods, yielding highly realistic, object-aware full-body motions with enhanced realism, flexibility, and control.(https://sreeharirajan.github.io/projects/InteracTalker/)

SegMango: Early Deep Mango Yield Prediction based on Flower Segmentation and Weather Data

Winter Conference on Applications of Computer Vision, WACV, 2026

Core Rank : - Google Rank :109

Abs PDF bibTex

@inproceedings{bib_SegM_2026, AUTHOR = {Vanabhai, Ven Janaksinh and Sharma, Charu and Azeemuddin, Syed }, TITLE = {SegMango: Early Deep Mango Yield Prediction based on Flower Segmentation and Weather Data}, BOOKTITLE = {Winter Conference on Applications of Computer Vision}. YEAR = {2026}}

SegMango: Early Deep Mango Yield Prediction based on Flower Segmentation and Weather Data

Abstract

Early-stage fruit yield prediction plays a key role in supporting timely agronomic decisions, enhancing market planning, and empowering farmers with data-driven insights. Over the years, most approaches to yield estimation have focused on fruit counting techniques, typically performed just before harvest. While these methods have proven useful, they often come into play late in the cultivation cycle, limiting their impact on early planning and resource optimization. In this work, we introduce a comprehensive baseline framework for predicting mango yield at an earlier stage - during flowering - using image-based learning. Our contributions are twofold. (i) Our approach combines a SegFormer-based segmentation model with a regression pipeline to estimate yield from images, while also exploring the role of contextual features such as weather and scale. (ii) This work introduces a novel benchmark and an enriched dataset, paving the way for scalable, automated tools that can assist farmers and stakeholders in making proactive decisions throughout the mango growing season. Our work demonstrates that for multi-modal yield prediction, integrating features that complement visual representations (like scale) can be more impactful than using features with a stronger standalone linear correlation (like weather). Our single-image model, based on the SegFormer-B1 encoder, achieved a mean absolute error (MAE) of 7.68, R² of 0.76, and mean squared error (MSE) of 115.48. These results highlight the promise of vision-based models for yield estimation from early-stage flowering cues. To the best of our knowledge, this is the first work to address the prediction of mango yield using images from the flowering stage and weather data.

LORETTA: A Low Resource Framework To Poison Continuous Time Dynamic Graphs

Association for the Advancement of Artificial Intelligence, AAAI, 2026

Core Rank : A* Google Rank :220

Abs PDF DOI bibTex

@inproceedings{bib_LORE_2026, AUTHOR = {Pal, Himanshu and Bachina, Venkata Sai Pranav and Gangwal, Ankit and Sharma, Charu }, TITLE = {LORETTA: A Low Resource Framework To Poison Continuous Time Dynamic Graphs}, BOOKTITLE = {Association for the Advancement of Artificial Intelligence}. YEAR = {2026}}

LORETTA: A Low Resource Framework To Poison Continuous Time Dynamic Graphs

Abstract

Temporal Graph Neural Networks (TGNNs) are increasingly used in high-stakes domains, such as financial forecasting, recommendation systems, and fraud detection. However, their susceptibility to poisoning attacks poses a critical security risk. We introduce LORETTA (Low Resource Twophase Temporal Attack), a novel adversarial framework on Continuous-Time Dynamic Graphs, which degrades TGNN performance by an average of 29.47% across 4 widely benchmark datasets and 4 State-of-the-Art (SotA) models. LORETTA operates through a two-stage approach: (1) sparsify the graph by removing high-impact edges using any of the 16 tested temporal importance metrics, (2) strategically replace removed edges with adversarial negatives via LORETTA’s novel degree-preserving negative sampling algorithm. Our plug-and-play design eliminates the need for expensive surrogate models while adhering to realistic unnoticeability constraints. LORETTA degrades performance by upto 42.0% on MOOC, 31.5% on Wikipedia, 28.8% on UCI, and 15.6%on Enron. LORETTA outperforms 11 attack baselines, remains undetectable to 4 leading anomaly detection systems, and is robust to 4 SotA adversarial defense training methods, establishing its effectiveness, unnoticeability, and robustness.

Evaluation of Forest Fire Susceptibility in Mizoram State utilizing Analytical Hierarchy Processes and Frequency Ratios.

Forest Fire and Climate Change, FFCC, 2025

Abs PDF bibTex

@inproceedings{bib_Eval_2025, AUTHOR = {KV, Suresh Babu and Pillutla, Rama Chandra Prasad }, TITLE = {Evaluation of Forest Fire Susceptibility in Mizoram State utilizing Analytical Hierarchy Processes and Frequency Ratios.}, BOOKTITLE = {Forest Fire and Climate Change}. YEAR = {2025}}

Evaluation of Forest Fire Susceptibility in Mizoram State utilizing Analytical Hierarchy Processes and Frequency Ratios.

Abstract

The forest cover in Mizoram State is essential for the region’s socio-economic development and environmental sustainability. This state features abundant natural resources and diverse biodiversity, predominantly harbored within its forests. However, forest fires pose a significant risk, especially during dry seasons. Factors such as human activities, weather conditions, and traditional bamboo and Jhum cultivation practices contribute to the occurrence of forest fires in Mizoram. It’s crucial to map forest fire susceptibility to manage this risk effectively. This proactive approach enables better fire management, reduces risks, and facilitates informed decision-making. By identifying areas vulnerable to wildfires, it’s possible to promote sustainable land use practices, preserve ecosystems, and safeguard people and property. Statistical techniques such as frequency ratio (FR) and analytic hierarchy process (AHP) are used to generate the fire susceptibility maps for Mizoram based on satellite datasets. The MODIS Terra and Aqua active fire points (MCD14) are the basis, divided into training and testing datasets. AHP and FR techniques establish relationships between the training dataset and fourteen key factors, including slope, aspect, curvature, elevation, Normalized Difference Vegetation Index (NDVI), Normalized Multiband Drought Index (NMDI), rainfall, temperature, wind speed, and proximity to settlements and roads. Various datasets such as MODIS Terra (surface reflectance, land surface temperature, vegetation indices), SRTM Digital Elevation Model, ERA-5, and CHRS are utilized in this study. The accuracy of the susceptibility maps generated by FR and AHP is validated against active fire test datasets, ensuring reliability in predicting fire-prone areas.

Island lens approach to historical landscape characterization of the Nicobar Islands using remote sensing

GeoJournal, GJ, 2025

Core Rank : - Google Rank :53

Abs PDF bibTex

@inproceedings{bib_Isla_2025, AUTHOR = {Sulaiman, Firose and Pillutla, Rama Chandra Prasad }, TITLE = {Island lens approach to historical landscape characterization of the Nicobar Islands using remote sensing}, BOOKTITLE = {GeoJournal}. YEAR = {2025}}

Island lens approach to historical landscape characterization of the Nicobar Islands using remote sensing

Abstract

Landscapes are shaped by both natural and cultural factors, influencing their management. In the Andaman and Nicobar Islands, rapid land-use changes from tsunamis and human activities threaten both ecological integrity and local communities. While the Andaman Islands have been extensively studied, research on the Nicobar Islands remains limited, prompting this investigation of historical land-use and land-cover changes using geospatial data. Multi-temporal satellite imagery revealed significant changes across selected Nicobar Islands. Trinket Island, heavily impacted by the 2004 tsunami, experienced considerable forest loss, mudflat formation, and fluctuations in grassland and water body areas. Katchal Island saw major forest declines from the tsunami and human activity, with partial recovery hindered by ongoing seawater intrusion and erosion. Camorta Island experienced reduced forest and grassland areas post-tsunami, followed by gradual forest recovery and water body expansion. Nancowry Island, though minimally affected initially, later faced significant forest loss due to seawater intrusion, followed by recovery and settlement growth. Car Nicobar Island suffered severe forest loss from the tsunami but has seen partial vegetation recovery alongside settlement expansion. Additionally, the Nicobar Islands' tropical grasslands, crucial for biodiversity and local livelihoods, are under threat from environmental changes and proposed oil palm plantations, which could lead to further ecological disruption and cultural impacts.

Decoding Speech Perception from EEG: Extending the Broderick Dataset Study

India Joint International Conference on Data Science & Management of Data, COMAD/CODS, 2025

Core Rank : - Google Rank :19

Abs PDF bibTex

@inproceedings{bib_Deco_2025, AUTHOR = {Kumar, Parvatam Pavan and Ghanathe, Raghav Rao }, TITLE = {Decoding Speech Perception from EEG: Extending the Broderick Dataset Study}, BOOKTITLE = {India Joint International Conference on Data Science & Management of Data}. YEAR = {2025}}

Decoding Speech Perception from EEG: Extending the Broderick Dataset Study

Abstract

Decoding natural speech from non-invasive brain recordings is an exciting and rapidly evolving area in brain–computer interface (BCI) research. In this study, we build on the contrastive learning approach introduced by Défossez et al. (2023), which maps brain activity to self-supervised audio representations from wav2vec 2.0. Using the Broderick 2019 EEG dataset—comprising high-density recordings of participants listening to continuous, natural speech—we replicate this framework and explore a key extension. Specifically, we replace the original convolutional neural network used to process brain signals with a Transformer-based encoder. Given the Transformer’s strength in capturing long-range temporal patterns, we investigate whether this deeper temporal modeling can improve alignment between EEG signals and speech representations. Our findings show that contrastive learning significantly outperforms standard regression approaches, and that the Transformer-based model yields modest but consistent improvements over convolutional baselines. These results suggest that attention-based models may offer a promising direction for decoding semantic content from EEG, with potential applications in both cognitive neuroscience and non-invasive BCI systems.