- John R. Smith
-
- Dept. of Electrical Engineering
- Columbia University
- New York, N.Y. 10027 USA
- (212) 939-7153
- jrsmith@ctr.columbia.edu
- http://www.ctr.columbia.edu/~jrsmith
- Shih-Fu Chang
- Dept. of Electrical Engineering
- Columbia University
- New York, N.Y. 10027 USA
- (212) 854-6894
- sfchang@ctr.columbia.edu
- http://www.ctr.columbia.edu/~sfchang
Abstract
We present a system for querying for images by the spatial and feature attributes of regions (SaFe). The system enables the user to find the images that contain an arrangement of regions similar to that diagrammed in a query image. We propose a general framework which allows for different types of features (e.g., color, texture, shape, motion) to be integrated with spatial information in the query process. We demonstrate that integrated spatial and feature querying improves image search capabilities over previous content-based image retrieval methods.
In this paper, we present the general framework and a prototype system for querying for images by spatial and feature attributes. The spatial and feature (SaFe) system integrates content-based techniques with spatial query methods in order to search for images by arrangements of regions. SaFe has been deployed on-line in an application for querying in a large collection of unconstrained images (more than 650,000 images). Our contribution is the use of fully automated region and feature extraction and indexing, and the integration of spatial and feature image querying. These capabilities of SaFe distinguish it from other recent image retrieval systems (Virage [1], QBIC [2] and Photobook [3]) which do not provide this enhanced functionality.
The spatial and feature image query paradigm provides a powerful method for image retrieval. However, it is extremely complex in that it requires that several disparate image query techniques be combined. First, the feature query component requires the assessment of the feature similarities of regions. Second, the spatial query component requires the assessment of the similarities in spatial locations and sizes of regions. Third, the system requires the comparison of images consisting of multiple regions. Last, the system requires that the spatial relationships, such as ``above,'' ``below,'' ``near,'' and so forth, be resolved. As depicted in Figure 1, the overall comparison of images utilizes both the feature and spatial attributes of the regions in computing their similarity.
Figure 1: Integrated spatial and feature query. Images are compared by comparing the spatial arrangements of regions. (Click on the figure to see a larger version)
The region attributes are stored in a region relational table (REGION), as depicted in Table 1. The spatial attributes of a region are determined by the spatial centroid of the region (x, y), the area of the region, and the width and height (w, h) of the minimum bounding rectangle (MBR) that encapsulates the region. The feature attributes (
) of the region correspond to the visual features of the region, such as its color, texture, shape, motion and so forth.
| | | x | y | area | w | h |
| 0001 | 0001 | | 18 | 63 | 430 | 30 | 15 |
|
0001 | 0002 | | 34 | 45 | 968 | 65 | 32 |
|
0002 | 0001 | | 76 | 54 | 780 | 53 | 42 |
|
0003 | 0001 | | 55 | 12 | 654 | 43 | 55 |
Table 1: The REGION relation stores region attributes.
The region attribute queries are performed using a parallel strategy. For example, given the query region:
, the query is processed by computing queries individually for each attribute. The region attribute match lists are then Joined to obtain the best set of regions. We have also explored a pipeline strategy for the region attribute queries. The query is processed by first computing the query on the feature attribute. This output is then filtered by location, then by size and spatial extent. The pipeline strategy avoids the computation of the attribute Join required in the parallel strategy. However, a special indexing structure may only be used one attribute, i.e., feature value. The pipeline strategy cannot effectively utilize special indexes for the other attributes that follow in the pipeline, i.e., R-tree for region MBRs, or quad-tree for region centroids.
The overall image query strategy consists of Joining the results of the queries on the individual regions in the query image. The Join of the region match results identifies the candidate target images. For these images, the image match score is computed by adding the weighted region scores. In the final stage, the relative spatial locations that may have been specified in the query are evaluated using query-time 2-D string projection. A 2-D string comparison determines whether candidate target images satisfy the constraints of the relative region placement. The image match process is illustrated in Fig. 2.
Figure 2: Overall strategy for computing image matches by Joining the individual regions queries and resolving spatial relationships at the last stage. (Click on the figure to see a larger version)
Querying by absolute locations is not easily extended to include relative locations of regions. We solve the problem of the combinatorial explosion by delaying projection and evaluation of relative spatial relationships until the final stage of the query. The query-time 2-D string projection is generated from the image match lists as follows: the region centroids (x, y) are first quantized. The quantization step size, which is the size of the grid in the query formulation area, is set by the user. The regions are then projected in the x- and y- directions and are sorted to produce the 2-D string. Given the 2-D strings for the query image Q and target images
, the spatial relation constraints in Q are evaluated in the
's by 2-D string comparison. The 2-D string comparison is carried out using a straight-forward implementation of the 2-D string compare function described in [4].
The integrated spatial and color queries are formulated graphically using the interface depicted in Figure 3. The user selects regions, positions them on the query grid and assigns them properties of color, size and absolute location. The user assigns boundaries for region location and size. The user also chooses absolute and/or relative spatial querying. In the following query we illustrate the power and flexibility of the system over non-spatial techniques.
Figure 3: SaFe user interface and returned image matches. (Click on the figure to see a larger version)
In order to extract the color regions from images, we developed a color set back-projection technique. We briefly describe the technique here (see [5] for more details). The back-projection of a color set is accomplished as follows: given image I[x,y] and color set
, let k be the index of the color at image point I[x,y], then generate image B[x,y] by
where
measures the similarity of colors j and k. After back-projecting the model color set, image B[x,y] is filtered and analyzed to reveal a set of spatially localized color regions that match color set c.
We illustrate an example image query to find images that depict blue skies and greenery. Prior to the trials, the test-set of 3,100 images (see [5]) was inspected and each image was manually assigned a relevance of 0, 0.5 or 1. The queries and results are depicted in Fig. 4. The SaFe query (denoted by
) specifies two regions (upper is blue and lower is dark green) and their spatial layout (top left of Figures 4). The best matches (illustrated on top from left to right) have a similar arrangement of similarly colored regions. For the color histogram (denoted by
) and color set (denoted by
) queries, the best match image from
is used as the seed image. Query
is computed using the histogram quadratic distance metric from [2]. Query
is computed using the binary set quadratic distance metric from [5]. The plot in Fig. 4 shows that the retrieval effectiveness for the SaFe query
improves substantially over the global color feature methods given by
and
.
Figure 4: Example color photographic image queries, Q = query images. Best four matches are listed from left to right, where
= SaFe query,
= color histograms,
= color sets. (Click on the figures to see a larger versions)
We described a new paradigm and prototype system for image searching which integrates spatial (region absolute and relative locations, and size) and feature querying (visual features, i.e., color). Content-based image query systems ([1, 2, 3]) do not provide both types of querying. Since the discrimination of images is only partially provided by global features, the system utilizes image regions and their features, sizes, spatial locations, and relationships in order to compare images. The integration of content-based and spatial querying provides for a highly functional query system which allows for wide variety of complex spatial and feature queries. We have recently combined the SaFe image query system with the WebSEEk (WebSEEk) image and video search engine [6] to allow for integrated spatial and feature querying of images from the World-Wide Web.
References
- 1
-
J. R. Bach, C. Fuller, A. Gupta, A. Hampapur, B. Horowitz, R. Humphrey, R. C.
Jain, and C. Shu.
Virage image search engine: an open framework for image management.
In Symposium on Electronic Imaging: Science and Technology -
Storage & Retrieval for Image and Video Databases IV, volume 2670, pages
76 - 87. IS&T/SPIE, 1996.
- 2
-
M. Flickner, H. Sawhney, W. Niblack, J. Ashley, Q. Huang, B. Dom, M. Gorkani,
J. Hafner, D. Lee, D. Petkovic, D. Steele, and P. Yanker.
Query by image and video content: The QBIC system.
IEEE Computer, 28(9):23 - 32, September 1995.
- 3
-
A. Pentland, R. W. Picard, and S. Sclaroff.
Photobook: Tools for content-based manipulation of image databases.
In Proceedings of the SPIE Storage and Retrieval Image and Video
Databases II, February 1994.
- 4
-
S.-K. Chang.
Principles of Pictorial Information Systems Design.
Prentice-Hall, Inc, Englewood Cliffs, NJ, 1989.
- 5
-
J. R. Smith. Integrated Spatial and Feature Image Systems: Retrieval, Analysis and Compression.
PhD thesis, Columbia University, New York, NY, February 1997.
- 6
-
J. R. Smith and S.-F. Chang.
WebSEEk: a content-based image and video search engine for the World-Wide Web.
IEEE Multimedia, Summer 1997.
We provide several example queries that further demonstrate the SaFe system. We also provide links to the prototype image search systems and a link to the java source code for the SaFe system.
We illustrate six "live" example image queries using the SaFe system. In each query, the query grid is illustrated in the icon on the left. By clicking the icon, the query is issued "live" to the SaFe system. For best results, turn-off all caching at the client browser.
| 1. |  | Symbolic image query (absolute locations). Demonstrates spatial querying with three symbols. The best symbolic image matches have three regions with the same symbols in similar locations to those depicted in the query image. |
| 2. |  | Symbolic image query (relative locations). Demonstrates spatial querying with two symbols. The best symbolic image matches have two regions with the same symbols in the same spatial arrangement as that depicted in the query image. |
| 3. |  | Synthetic image query 1.Demonstrates spatial and feature querying with two color regions. The best synthetic image matches have two regions that are similarly colored and located as the regions depticted in the query image. |
| 4. |  | Synthetic image query 2.Demonstrates spatial and feature querying with three color regions. The best synthetic image matches have three regions that are similarly colored and located as the regions depicted in the query image. |
| 5. |  | Color photographic image query (Sunsets images).Demonstrates spatial and feature querying with color photographic images. The goal of the query is to retrieve images of sunsets. The best photographic image matches have two regions that are similarly colored and located as the regions depicted in the query image. |
| 6. |  | Color photographic image query (Nature images)Demonstrates spatial and feature querying with color photographic images. The goal of the query is to retrieve images of nature. The best photographic image matches have two regions that are similarly colored and located to the regions depicted in the query image. |
We provide links to the SaFe integrated spatial and feature image query system and to other systems. The SaFe system implements all of the functions described in this manuscript. The SaFe provides several image test-collections (symbolic images, synthetic images and color photographic images) to illustrate the types of querying provided. The WebSEEk system extends the image search paradigm to produce an image and video search engine for the WWW. VisualSEEk provides additional content-based image query functions.
 | SaFe integrated spatial and feature image query system |
 | WebSEEk content-based image and video search engine |
 | VisualSEEk content-based image search system |
The Java source code for the SaFe query interface is provided by Columbia's content-based visual query group directed by Prof. Shih-Fu Chang. It was developed by John R. Smith.
Document and demos prepared by John R. Smith, March, 1997