IIITH

Contextual restoration of severely degraded document images

Computer Vision and Pattern Recognition, CVPR, 2009

Core Rank : A* Google Rank :440

Abs PDF bibTex

@inproceedings{bib_Cont_2009, AUTHOR = {BANERJEE, JYOTIRMOY and Namboodiri, Anoop and V, Jawahar C }, TITLE = {Contextual restoration of severely degraded document images}, BOOKTITLE = {Computer Vision and Pattern Recognition}. YEAR = {2009}}

Contextual restoration of severely degraded document images

Abstract

We propose an approach to restore severely degraded document images using a probabilistic context model. Unlike traditional approaches that use previously learned prior models to restore an image, we are able to learn the text model from the degraded document itself, making the approach independent of script, font, style, etc. We model the contextual relationship using an MRF. The ability to work with larger patch sizes allows us to deal with severe degradations including cuts, blobs, merges and vandalized documents. Our approach can also integrate document restoration and super-resolution into a single framework, thus directly generating high quality images from degraded documents. Experimental results show significant improvement in image quality on document images collected from various sources including magazines and books, and comprehensively demonstrate the robustness and adaptability of the approach. It works well with document collections such as books, even with severe degradations, and hence is ideally suited for repositories such as digital libraries

Retrieval of online handwriting by synthesis and matching

Pattern Recognition, PR, 2009

Core Rank : - Google Rank :118

Abs PDF bibTex

@inproceedings{bib_Retr_2009, AUTHOR = {V, Jawahar C and SUBRAMANIAN, A BALA and MESHESHA, MILLION and Namboodiri, Anoop }, TITLE = {Retrieval of online handwriting by synthesis and matching}, BOOKTITLE = {Pattern Recognition}. YEAR = {2009}}

Retrieval of online handwriting by synthesis and matching

Abstract

Search and retrieval is gaining importance in the ink domain due to the increase in the availability of online handwritten data. However, the problem is challenging due to variations in handwriting between various writers, digitizers and writing conditions. In this paper, we propose a retrieval mechanism for online handwriting, which can handle different writing styles, specifically for Indian languages. The proposed approach provides a keyboard-based search interface that enables to search handwritten data from any platform, in addition to pen-based and example-based queries. One of the major advantages of this framework is that information retrieval techniques such as ranking relevance, detecting stopwords and controlling word forms are extended to work with search and retrieval in the ink domain. The framework also allows cross-lingual document retrieval across Indian languages

Efficient privacy preserving video surveillance

International Conference on Computer Vision, ICCV, 2009

Core Rank : A* Google Rank :291

Abs PDF bibTex

@inproceedings{bib_Effi_2009, AUTHOR = {UPMANYU, MANEESH and Namboodiri, Anoop and Kannan, Srinathan and V, Jawahar C }, TITLE = {Efficient privacy preserving video surveillance}, BOOKTITLE = {International Conference on Computer Vision}. YEAR = {2009}}

Efficient privacy preserving video surveillance

Abstract

Widespread use of surveillance cameras in offices and other business establishments, pose a significant threat to the privacy of the employees and visitors. The challenge of introducing privacy and security in such a practical surveillance system has been stifled by the enormous computational and communication overhead required by the solutions. In this paper, we propose an efficient framework to carry out privacy preserving surveillance. We split each frame into a set of random images. Each image by itself does not convey any meaningful information about the original frame, while collectively, they retain all the information. Our solution is derived from a secret sharing scheme based on the Chinese Remainder Theorem, suitably adapted to image data. Our method enables distributed secure processing and storage, while retaining the ability to reconstruct the original data in case of a legal requirement. The system installed in an office like environment can effectively detect and track people, or solve similar surveillance tasks. Our proposed paradigm is highly efficient compared to Secure Multiparty Computation, making privacy preserving surveillance, practical.

Efficient graph-based image matching for recognition and retrieval

National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics, NCVPRIPG, 2008

Core Rank : - Google Rank :-

Abs PDF bibTex

@inproceedings{bib_Effi_2008, AUTHOR = {DASIGI, PRAVEEN and V, Jawahar C }, TITLE = {Efficient graph-based image matching for recognition and retrieval}, BOOKTITLE = {National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics}. YEAR = {2008}}

Efficient graph-based image matching for recognition and retrieval

Abstract

Graphs can be used for effective representation of images for recognition and retrieval purposes. The problem is often to find a proper structure that can efficiently describe an image and can be matched in reasonably low computational expense. The standard solutions to the graph matching problem are computationally expensive since the search space involves all permutations of the nodesets. We compare two graphical representations called the Nearest-Neighbor Graphs and the Collocation Trees, for the goodness of fit and the computational expense involved in matching. Various schemes to index the graphical structures have also been discussed.

Oxford/IIIT TRECVID 2008-Notebook paper.

Text Retrieval Conference Video Retrieval Evaluation, TRECVID, 2008

Core Rank : - Google Rank :-

Abs PDF bibTex

@inproceedings{bib_Oxfo_2008, AUTHOR = {Philbin, James and Marin-Jimenez, Manuel and Srinivasan, Siddharth and Zisserman, Andrew and Jain, Mihir and SREEKANTH, V and Kompalli, Pramod Sankar and V, Jawahar C }, TITLE = {Oxford/IIIT TRECVID 2008-Notebook paper.}, BOOKTITLE = {Text Retrieval Conference Video Retrieval Evaluation}. YEAR = {2008}}

Oxford/IIIT TRECVID 2008-Notebook paper.

Abstract

The Oxford/IIIT team participated in the high-level feature extraction and interactive search tasks. A vision only approach was used for both tasks, with no use of the text or audio information. For the high-level feature extraction task, we used two different approaches, both based on a combination of visual features. One used a SVM classifier using a linear combination of kernels, the other used a random forest classifier. For both methods, we trained all high-level features using publicly available annotations [3]. The advantage of the random forest classifier is the speed of training and testing. In addition, for the people feature, we took a more targeted approach. We used a real-time face detector and an upper body detector, in both cases running on every frame. Our best performing submission, C OXVGG 1 1, which used a rank fusion of our random forest and SVM approach, achieved an mAP of 0.101 and was above the median for all but one feature. In the interactive search task, our team came third overall with an mAP of 0.158. The system used was identical to last year with the only change being a source of accurate upper body detections.

VIDEO FRAME ALIGNMENT IN MULTIPLE VIEWS

International Conference on Image Processing, ICIP, 2008

Core Rank : B Google Rank :66

Abs PDF bibTex

@inproceedings{bib_VIDE_2008, AUTHOR = {KUTHIRUMMAL, SUJIT and V, Jawahar C and J, Narayanan P }, TITLE = {VIDEO FRAME ALIGNMENT IN MULTIPLE VIEWS}, BOOKTITLE = {International Conference on Image Processing}. YEAR = {2008}}

VIDEO FRAME ALIGNMENT IN MULTIPLE VIEWS

Abstract

Many events are captured using multiple cameras today. Frames of each video stream have to be synchronized and aligned to a common time axis before processing them. Synchronization of the video streams necessarily needs a hardware based solution that is applied while capturing. The alignment problem between the frames of multiple videos can be posed as a search using traditional measures for image similarity. Multiview relations and constraints developed in Computer Vision recently can provide more elegant solutions to this problem. In this paper, we provide two solutions for the video frame alignment problem using two view and three view constraints. We present solutions to this problem for the case when the videos are taken using affine cameras and for general projective cameras. Excellent experimental results are achieved by our algorithms.

Retrieval from Image Datasets with Repetitive Structures

National Conference on Communications, NCC, 2008

Core Rank : - Google Rank :16

Abs PDF DOI bibTex

@inproceedings{bib_Retr_2008, AUTHOR = {DASIGI, PRAVEEN and V, Jawahar C }, TITLE = {Retrieval from Image Datasets with Repetitive Structures}, BOOKTITLE = {National Conference on Communications}. YEAR = {2008}}

Retrieval from Image Datasets with Repetitive Structures

Abstract

This work aims to enhance the matching and retrieval performance over image datasets which have similar spatial structures that occur very frequently. Instead of treating images as bags of features, we try to encode the spatial relationships in the representation. This process would help to resolve the ambiguity when two classes of images have similar sets of features although in different spatial arrangements. To demonstrate the fact a sizeable dataset of license plate images is used. We have proposed a method to use graphs to encode the spatial relationships among features. The problem of image matching thus turns to finding the maximum similarity between labelled graphs. It is shown that the precision of the retrieved results increases with this matching scheme since most of the false matches are eliminated.

Feature Selection for Hand-Geometry based Person Authentication

IEEE Transactions on Image Processing, TIP, 2008

Core Rank : A* Google Rank :113

Abs PDF bibTex

@inproceedings{bib_Feat_2008, AUTHOR = {ROy, Vandana and V, Jawahar C }, TITLE = {Feature Selection for Hand-Geometry based Person Authentication}, BOOKTITLE = {IEEE Transactions on Image Processing}. YEAR = {2008}}

Feature Selection for Hand-Geometry based Person Authentication

Abstract

Biometrics traits such as fingerprints, hand geometry, face and voice verification provide a reliable alternative for identity verification and are gaining commercial and high user acceptibilty rate. Hand geometry based biometric verification has proven to be the most suitable and acceptable biometric trait for medium and low security application. Geometric measurements of the human hand have been used for identity authentication in a number of commercial systems. However not much research has been done in the area of selection of the optimal discriminating features for hand-gemetry based authentication system. In this paper, We argue that the biometric verification problem can be best posed as the single-class problem. We propose to apply Biased Discriminant Analysis and Nonparametric Discriminant Analysis in order to transform the features into a new space where the samples are well separated.

Hybrid visual servoing by boosting IBVS and PBVS

International Conference on Information and Communication Technologies and Applications, ICTTA, 2008

Core Rank : - Google Rank :-

Abs PDF bibTex

@inproceedings{bib_Hybr_2008, AUTHOR = {Hafez, A.H. Aabdul and Cervera, Enric and V, Jawahar C }, TITLE = {Hybrid visual servoing by boosting IBVS and PBVS}, BOOKTITLE = {International Conference on Information and Communication Technologies and Applications}. YEAR = {2008}}

Hybrid visual servoing by boosting IBVS and PBVS

Abstract

In this paper, we present a novel boosted robot vision control algorithm. This method utilizes on-line boosting to produce a strong vision-based robot control starting from two weak algorithms. These weak methods are image-based and position-based visual servoing algorithms. The notion of weak and strong algorithms have been presented in the context of robot vision control. Appropriate error functions are defined for the weak algorithms to evaluate their suitability in the task. The integrated algorithm has superior performance both in image and Cartesian spaces. Experiments validate this claim.

Real Time L∞-based Solution to Multi-view Problems with Application to Visual Servoing

International Conference on Information and Communication Technologies and Applications, ICTTA, 2008

Core Rank : - Google Rank :-

Abs PDF bibTex

@inproceedings{bib_Real_2008, AUTHOR = {Hafez, A. H. Abdul and V, Jawahar C }, TITLE = {Real Time L∞-based Solution to Multi-view Problems with Application to Visual Servoing}, BOOKTITLE = {International Conference on Information and Communication Technologies and Applications}. YEAR = {2008}}

Real Time L∞-based Solution to Multi-view Problems with Application to Visual Servoing

Abstract

In this paper we present a novel real time algorithm to sequentially solve a class of multi-view geometry problems. The triangulation problem is considered as study case. The problem concerns with the estimation of 3D point coordinates given its images as well as the matrices of the concern cameras used in the imaging process. The algorithm has direct application to real time systems like virtual reality, visual SLAM, and visual servoing. Application to visual servoing is considered in detail. Experiments have been carried out for the general triangulation problem as well as the application to visual servoing.