Title page for ETD etd-11082006-011004
( Browse | Search ) All Available ETDs
Type of Document Dissertation
Author Liu, Xiong
URN etd-11082006-011004
Title Integrating Protein Data Resources through Semantic Web Services
Degree Doctor of Philosophy
Program Information Science
School School of Information Sciences
Advisory Committee
Advisor Name Title
Hassan Karimi Committee Chair
Ivet Bahar Committee Member
John Vries Committee Member
Michael Lewis Committee Member
Vladimir Zadorozhny Committee Member
Keywords
  • bioinformatics
  • user-oriented integration
  • semantic matching
  • data integration
  • protein data
  • semantic web services
Date of Defense 2006-11-07
Availability unrestricted
Abstract
Understanding the function of every protein is one major objective of bioinformatics. Currently, a large amount of information (e.g., sequence, structure and dynamics) is being produced by experiments and predictions that are associated with protein function. Integrating these diverse data about protein sequence, structure, dynamics and other protein features allows further exploration and establishment of the relationships between protein sequence, structure, dynamics and function, and thereby controlling the function of target proteins. However, information integration in protein data resources faces challenges at technology level for interfacing heterogeneous data formats and standards and at application level for semantic interpretation of dissimilar data and queries.

In this research, a semantic web services infrastructure, called Web Services for Protein data resources (WSP), for flexible and user-oriented integration of protein data resources, is proposed. This infrastructure includes a method for modeling protein web services, a service publication algorithm, an efficient service discovery (matching) algorithm, and an optimal service chaining algorithm. Rather than relying on syntactic matching, the matching algorithm discovers services based on their similarity to the requested service. Therefore, users can locate services that semantically match their data requirements even if they are syntactically distinctive. Furthermore, WSP supports a workflow-based approach for service integration. The chaining algorithm is used to select and chain services, based on the criteria of service accuracy and data interoperability. The algorithm generates a web services workflow which automatically integrates the results from individual services.

A number of experiments are conducted to evaluate the performance of the matching algorithm. The results reveal that the algorithm can discover services with reasonable performance. Also, a composite service, which integrates protein dynamics and conservation, is experimented using the WSP infrastructure.

Files
  Filename       Size       Approximate Download Time (Hours:Minutes:Seconds) 
 
 28.8 Modem   56K Modem   ISDN (64 Kb)   ISDN (128 Kb)   Higher-speed Access 
  XiongLiu_Thesis_11_2006.pdf 1.86 Mb 00:08:35 00:04:25 00:03:52 00:01:56 00:00:09
If you have questions or comments please send mail to ETD-Feedback or view
the University of Pittsburgh Electronic Theses and Dissertations (ETD) Project page.