Title page for ETD etd-12162005-124025
( Browse | Search ) All Available ETDs
Type of Document Dissertation
Author Zheng, Lei
URN etd-12162005-124025
Title AUTOMATED FEATURE EXTRACTION AND CONTENT-BASED RETRIEVAL OF PATHOLOGY MICROSCOPIC IMAGES USING K-MEANS CLUSTERING AND CODE RUN-LENGTH PROBABILITY DISTRIBUTION
Degree Doctor of Philosophy
Program Information Science
School School of Information Sciences
Advisory Committee
Advisor Name Title
Paul Munro Committee Chair
David Foran Committee Member
Hassan Karimi Committee Member
John Gilbertson Committee Member
Michael Becich Committee Member
Keywords
  • information retrieval
  • color quantization
  • medical imaging
  • image retrieval
Date of Defense 2005-10-31
Availability unrestricted
Abstract
The dissertation starts with an extensive literature survey on the current issues in content-based image retrieval (CBIR) research, the state-of-the-art theories, methodologies, and implementations, covering topics such as general information retrieval theories, imaging, image feature identification and extraction, feature indexing and multimedia database search, user-system interaction, relevance feedback, and performance evaluation. A general CBIR framework has been proposed with three layers: image document space, feature space, and concept space. The framework emphasizes that while the projection from the image document space to the feature space is algorithmic and unrestricted, the connection between the feature space and the concept space is based on statistics instead of semantics. The scheme favors image features that do not rely on excessive assumptions about image content

As an attempt to design a new CBIR methodology following the above framework, k-means clustering color quantization is applied to pathology microscopic images, followed by code run-length probability distribution feature extraction. Kulback-Liebler divergence is used as distance measure for feature comparison. For content-based retrieval, the distance between two images is defined as a function of all individual features. The process is highly automated and the system is capable of working effectively across different tissues without human interference. Possible improvements and future directions have been discussed.

Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  lzheng-ETD-2005-2.pdf 2.54 Mb 00:11:44 00:06:02 00:05:17 00:02:38 00:00:13
If you have questions or comments please send mail to ETD-Feedback.