Digital Imaging & Data Compression
|
|
IMAGE INDEXING AND RETRIEVAL IN COMPRESSED DOMAIN Overview Present research in content-based image indexing and retrieval is based on pixels[1-2,54-56,58], which means that the imaging information system is constructed by storing original images. As the huge amount of storage required grow, data compression technology becomes essential. In such a case, the only possible way of using data compression to reduce the size of the database is to compress the population of images separately after each original image is analysed and indexed. In this context, decompression is required before any automatic retrieval could be performed. On the other hand, data compression is developed not only for saving the storage space but, more importantly, also for improving the efficiency of information retrieval and processing, since unnecessary redundancy has been removed in the compressed data. Therefore, our research is focused on investigating a number of original ideas to develop an image indexing and compression algorithm via which, automatic image retrieval can be directly operated in compressed data domain. Present research in automatic image indexing and retrieval is based on two approaches[38,41]: attribute based retrieval and feature-extraction/object-recognition based retrieval. The former manually models the contents of the image into a set of attributes which can then be managed and processed by conventional database management systems. The latter represents the latest efforts by image processing communities to extract automatically a set of features such as texture, colour, shapes, edges, etc. to interpret images stored inside a database. Retrieval is carried out by similarity measurements, which produce a ranking order of similar images. One of the typical examples for such a system is QBIC[9] developed by IBM researchers which extracts semi-automatically object motion (for video images), colour and texture patterns (for still images) as the main features. Users may then use these features to construct sketches and example images with which to query the database. This direction of research for image indexing and retrieval in pixel domain is very active both in UK and around the world. Fruitful research in the UK is represented by research groups in Universities of Glasgow, City, Bristol, Northumbria and Sheffield etc[52-58]. Further details about the UK efforts in this area are available from the Library & Information Commission, UK. In this context, our approach will firstly decompose images into a number of features such as colour, texture, shape, spatial relationship etc. in the domain of ortho-normal transforms[44]. The analysis of all the features mentioned will be integrated by a technique called visual activity analysis which will act as the major parameter to control both data compression and key construction for image indexing. On the one hand, a data compression algorithm can be designed by exploiting human visual perception[50,51,55] of different features to determine the appropriate approach for the best possible performance and, on the other hand, a new compressed image indexing technique is developed by using these compressed features as multiple keys to retrieve images. Hence, only compressed codes are stored inside the image database, and the compressed image quality can be controlled by providing three categories of data compression, i.e., (i) lossy compression; (ii) near lossless compression and (iii) lossless compression. As a result, both image indexing and data compression are achieved using one algorithm. Our Research Programme The research programme consists of the following phases. Phase 1. Conventional Image Database Design In this phase, the primary goal is to design a conventional image database to provide a research platform and establish a benchmarking system for the proposed research. This work will also enable the investigators to build up a simple yet demonstrable working model, upon which further improvements can be built in areas such as user-friendly access and convenience, retrieving speed, on-line editing and browsing etc. Apart from those images publicly available on the Internet (such as QBIC), some special images will also be added to test both the data compression performance and image retrieval in compressed domain. Phase 2. Algorithm Development for Image Indexing in Compressed Data Domain (month 5-30) In this phase, effort is concentrated on developing algorithms for image indexing using multiple keys in the compressed data domain. The research will be carried out in three stages. Data compression for image indexing: The data compression technology to be developed in this stage of the project is specifically aimed at providing a transparent compression working seamlessly with automatic image indexing and retrieval. This will be achieved by developing a quality adaptive, content based and rate controllable image compression technique. The term, quality adaptive, means that the compression performance of the algorithm can be controlled by a quality requirement automatically for each individual reconstructed image, with compression ranging from lossy through near lossless to lossless. In response to the quality requirement specified by end users, the algorithm determines which of these three compression techniques is appropriate. With lossy compression, DCT based JPEG is a good candidate to work on, especially the quality index controlling mechanism. With near lossless and lossless compression, however, it will be necessary to investigate: (a) small quantization, (b) statistics modelling for entropy coding and (c) non-linear prediction based on local feature analysis. To integrate DCT-based lossy compression with the prediction-based near lossless and lossless compression, further investigations will be required to streamline the operations for all three categories and hence make the overall algorithm cost effective and easy to implement on various computing platforms. For indexing-based compression, established object identification and recognition techniques will be exploited. We are not, however, going to carry out any development work in object recognition and, instead, we will use those well established techniques such as bit mapping alpha planes etc. to reflect the features of images in data compression. The emerging world-wide standard MPEG-4[51], for instance, is a successful example in this context. It is envisaged that the indexing based compression will provide better linkage with automatic retrieval, especially when manual editing and highlighting of those areas of interests are added. For rate-controllable compression, it means that the compressed bit rate is controlled around a fixed value independent of the variety of images inside the database. This will provide a better management of the storage space for the whole image population. One example of such a scenario is that users often expect the storage space be the same if one image is deleted and replaced by another. Hence, to provide such a concealed data compression for image database management, rate control is inevitable. This will be implemented by adaptively controlling the quantization step in the local feature analysis of those surrounding pixels when each pixel is encoded[35]. In addition, the texture analysis can be embedded in the non-linear prediction process. The JPEG-LS is a good example of this non-linear prediction in which a horizontal or vertical edge is to be detected first before the predictive value is selected. The rate controllable technique, however, is only applicable to lossy and near lossless compression. Construction of multiple keys: From the success of our previous work[26], we will use histograms as the starting point from which to build multiple keys. To reflect the content of each image, the histograms will be constructed from various mature feature analysis techniques including colour mapping[37], texture analysis[38,39] and pattern recognition[41] etc. To improve the indexing performance, manual editing and highlighting of query images are to be added in those areas of interest in the query image deemed to be of importance before the database population is searched. To develop the image indexing and data compression into one single algorithm, it is required in principle that the multiple keys or histograms be constructed from the compressed codes or from the compression process. A number of approaches will be investigated depending on what techniques are to be adopted for feature analysis and identification, such as colour, shape and spatial relationship, texture etc. Specific ideas to be investigated will include:
The activity analysis can be performed using mature feature analysis techniques developed in image processing community, such as segmentation[37,44], edge detection[37-39,41], contour and texture analysis[7,37], spatial relationship construction[37-39,41] etc. Correspondingly, a number of histograms can be built up during the process to reflect various aspects of the image features for more efficient image indexing and retrieval. Various distance measurements need to be extensively investigated in order to provide the best possible performance for indexing and retrieval in various contexts. For those query images with manually highlighted regions, adaptive measurement and search will be developed in the algorithm by adding weighting factors and priorities to these highlighted regions. For the database population, however, it is sufficient to develop the search process in terms of the shape and the size of these highlighted regions. Co-ordination of multiple keys is required to optimise the overall performance not only for the image indexing but also for data compression. Browsing of targeted images will be further designed to enable users to achieve accurate retrieval, exhaustive search and flexibility in working procedures. Feature analysis in the transformed coefficients domain: A number of possible options are investigated to see if mature feature analysis techniques can be implemented in terms of transformed coefficients rather than pixels. The benefits of so doing will enable us to construct histograms directly from these coefficients rather than from the activity values used to determine the encoding of each pixel or coefficient. In this stage, wavelet transforms[43,45,49], DCT[51], and K-L transforms implemented by various neural networks[28,51] will be the candidates for investigation. The development work carried out at this stage will only be useful for lossy compression based indexing and retrieval, due to the fact that, in this project, lossy compression will be developed from established transform based compression techniques such as DCT-based JPEG. Figure 1 illustrates an overall structure of our image indexing and retrieval system.
Phase 3. Database Management Systems Image databases, capable of supporting large visual information systems, could contain millions of images and video clips. To improve the searching speed, so as to achieve effective on-line retrieval, a high performance algorithm will be designed to classify and map the database population into an appropriate structure to support comprehensive and efficient multi-key access. To enable users to explore the database interactivity, the possibility of allowing users to be given facilities to analyse and modify the query image will be investigated. The starting point for our investigation is quad-tree[46] and R* tree[2]. http://www.lic.gov.uk/research/information_retrieval/ir-calla.html This project is funded by Library & Information Commission in the UK.
References
|