Abstract
Image search using the bag-of-words image representation is investigated further in this paper. This approach has shown promising results for large scale image collections making it relevant for Internet applications. The steps involved in the bag-of-words approach are feature extraction, vocabulary building, and searching with a query image. It is important to keep the computational cost low through all steps. In this paper we focus on the efficiency of the technique. To do that we substantially reduce the dimensionality of the features by the use of PCA and addition of color. Building of the visual vocabulary is typically done using k-means. We investigate a clustering algorithm based on the leader follower principle (LF-clustering), in which the number of clusters is not fixed. The adaptive nature of LF-clustering is shown to improve the quality of the visual vocabulary using this. In the query step, features from the query image are assigned to the visual vocabulary. The dimensionality reduction enables us to do exact feature labeling using kD-tree, instead of approximate approaches normally used. Despite the dimensionality reduction to between 6 and 15 dimensions we obtain improved results compared to the traditional bag-of-words approach based on 128 dimensional SIFT feature and k-means clustering.
Original language | English |
---|---|
Title of host publication | 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops |
Place of Publication | Anchorage, Alaska |
Publisher | IEEE |
Publication date | 2008 |
Pages | 1-6 |
ISBN (Print) | 14-24-42339-2 |
DOIs | |
Publication status | Published - 2008 |
Event | 2008 IEEE Conference on Computer Vision and Pattern Recognition - Anchorage, United States Duration: 23 Jun 2008 → 28 Jun 2008 http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=4558014 |
Conference
Conference | 2008 IEEE Conference on Computer Vision and Pattern Recognition |
---|---|
Country/Territory | United States |
City | Anchorage |
Period | 23/06/2008 → 28/06/2008 |
Internet address |
Bibliographical note
Copyright: 2008 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEEKeywords
- object recognition
- bag-of-words model
- color SIFT features