Computational Method for Annotation of Protein Sequence According to Gene Ontology Terms

Annotation of a protein sequence is pivotal for the understanding of its function. Accuracy of manual annotation provided by curators is still questionable by having lesser evidence strength and yet a hard task and time consuming. A number of computational methods including tools have been developed to tackle this challenging task. However, they require high-cost hardware, are difficult to be setup by the bioscientists, or depend on time intensive and blind sequence similarity search like Basic Local Alignment Search Tool. This paper introduces a new method of assigning highly correlated Gene Ontology terms of annotated protein sequences to partially annotated or newly discovered protein sequences. This method is fully based on Gene Ontology data and annotations. Two problems had been identified to achieve this method. The first problem relates to splitting the single monolithic Gene Ontology RDF/XML file into a set of smaller files that can be easy to assess and process. Thus, these files can be enriched with protein sequences and Inferred from Electronic Annotation evidence associations. The second problem involves searching for a set of semantically similar Gene Ontology terms to a given query. The details of macro and micro problems involved and their solutions including objective of this study are described. This paper also describes the protein sequence annotation and the Gene Ontology. The methodology of this study and Gene Ontology based protein sequence annotation tool namely extended UTMGO is presented. Furthermore, its basic version which is a Gene Ontology browser that is based on semantic similarity search is also introduced.





References:
[1] A. Chinnasamy, A. Mittal, and W.K. Sung, "Probabilistic prediction of
protein-protein interactions from the protein sequences," Computers in
Biology & Medicine, vol. 36, no. 10, pp. 1143-1154, Oct. 2006.[2] L. Pireddu, D. Szafron, P. Lu, and R. Greiner, "The Path-A metabolic
pathway prediction web server," Nucleic Acids Research, vol. 34, no. 1,
pp. W714-W719, Jul. 2006.
[3] G.K. Acquaah-Mensah, S.M. Leach, and C. Guda, "Predicting the
subcellular localization of human proteins using machine learning and
exploratory data analysis," Genomics Proteomics Bioinformatics, vol. 4,
no. 2, pp. 120-133, May 2006.
[4] E.J. Whitfield, M. Pruess, and R. Apweiler, "Bioinformatics database
infrastructure for biotechnology research," J. Biotechnology, vol. 124,
no. 4, pp. 629-639, Aug. 2006.
[5] C. Brooksbank, G. Cameron, and J. Thornton, "The European
Bioinformatics Institute-s data resources: towards systems biology,"
Nucleic Acids Research, vol. 33, no. 1, pp. D46-D53, Jan. 2005.
[6] R. Apweiler, A. Bairoch, and C.H. Wu, "Protein sequence databases,"
Current Opinion in Chemical Biology, vol. 8, no. 1, pp. 76-80, Feb.
2004.
[7] E. Kretschmann, W. Fleischmann, and R. Apweiler, "Automatic rule
generation for protein annotation with the C4.5 data mining algorithm
applied on SWISS-PROT," Bioinformatics, vol. 17, no. 10, pp. 920-926,
Oct. 2001.
[8] C.H. Wu, H. Huang, L.S. Yeh, and W.C. Barker, "Protein family
classification and functional annotation," Computational Biology &
Chemistry, vol. 27, no. 1, pp. 37-47, Feb. 2003.
[9] A. Gattiker, K. Michoud, C. Rivoire, A.H. Auchincloss, E. Coudert, T.
Lima, P. Kersey, M. Pagni, C.J. Sigrist, C. Lachaize, A.L. Veuthey, E.
Gasteiger, and A. Bairoch, "Automated annotation of microbial
proteomes in SWISS-PROT," Computational Biology & Chemistry, vol.
27, no. 1, pp. 49-58, Feb. 2003.
[10] W. Fleischmann, S. Moller, A. Gateau, and R. Apweiler, "A novel
method for automatic functional annotation of proteins," Bioinformatics,
vol. 15, no. 3, pp. 228-233, Mar. 1999.
[11] R. Apweiler, A. Bairoch, C.H. Wu, W.C. Barker, B. Boeckmann, S.
Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane, M.J. Martin, D.A.
Natale, C. O-Donovan, N. Redaschi, and L.S. Yeh, "UniProt: the
Universal Protein knowledgebase," Nucleic Acids Research, vol. 32, no.
1, pp. D115-D119, Jan. 2004.
[12] C.H. Wu, R. Apweiler, A. Bairoch, D.A. Natale, W.C. Barker, B.
Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane,
M.J. Martin, R. Mazumder, C. O-Donovan, N. Redaschi, and B. Suzek,
"The Universal Protein Resource (UniProt): an expanding universe of
protein information," Nucleic Acids Research, vol. 34, no. 1, pp. D187-
D191, Jan. 2006.
[13] A. Bairoch, R. Apweiler, C.H. Wu, W.C. Barker, B. Boeckmann, S.
Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane, M.J. Martin, D.A.
Natale, C. O-Donovan, N. Redaschi, and L.S. Yeh, "The Universal
Protein Resource (UniProt)," Nucleic Acids Research, vol. 33, no. 1, pp.
D154-D159, Jan. 2005.
[14] K.A. Snyder, H.J. Feldman, M. Dumontier, J.J. Salama, and C.W.
Hogue, "Domain-based small molecule binding site annotation,"
Bioinformatics, vol. 22, no. 3, pp. 291-296, Feb. 2006.
[15] L.B. Koski, M.W. Gray, B.F. Lang, and G. Burger, "AutoFACT: an
automatic functional annotation and classification tool," BMC
Bioinformatics, vol. 6, no. 1, rec. 151, Jun. 2005.
[16] C.E. Jones, U. Baumann, and A.L. Brown, "Automated methods of
predicting the function of biological sequences using GO and BLAST,"
BMC Bioinformatics, vol. 6, no. 1, rec. 272, Nov. 2005.
[17] A. Prlic, F.S. Domingues, P. Lackner, and M.J. Sippl, "WILMAautomated
annotation of protein sequences," Bioinformatics, vol. 20, no.
1, pp. 127-128, Jan. 2004.
[18] X. Yuan, Z.Z. Hu, H.T. Wu, M. Torii, M. Narayanaswamy, K.E.
Ravikumar, K. Vijay-Shanker, and C.H. Wu, "An online literature
mining tool for protein phosphorylation," Bioinformatics, vol. 22, no.
13, pp. 1668-1669, Jul. 2006.
[19] J.H. Chiang and H.C. Yu, "Literature extraction of protein functions
using sentence pattern mining," IEEE Trans. Knowledge & Data
Engineering, vol. 17, no. 8, pp. 1088-1098, Aug. 2005.
[20] C.J.A. Sigrist, E.D. Castro, P.S. Langendijk-Genevaux, V.L. Saux, A.
Bairoch, and N. Hulo, "ProRule: a new database containing functional
and structural information on PROSITE profiles," Bioinformatics, vol.
21, no. 21, pp. 4060-4066, Aug. 2005.
[21] G.X. Yu, "Ruleminer: a knowledge system for supporting highthroughput
protein function annotations," J. Bioinformatics &
Computational Biology, vol. 2, no. 4, pp. 595-617, Dec. 2004.[22] J. Morbach, A. Yang, and W. Marquardt, "OntoCAPEÔÇöa large-scale
ontology for chemical process engineering," Engineering Applications
Artificial Intelligence, vol. 20, no. 2, pp. 147-161, Mar. 2007.
[23] R.J. Williams, N.D. Martinez, and J. Golbeck, "Ontologies for
ecoinformatics," Web Semantics: Science, Services & Agents on the
World Wide Web, vol. 4, no. 4, pp. 237-242, Dec. 2006.
[24] M. Naphade, J.R. Smith, J. Tesic, S. Chang, W. Hsu, L. Kennedy, A.
Hauptmann, and J. Curtis, "Large-scale concept ontology for
multimedia," IEEE Multimedia, vol. 13, no. 3, pp. 86-91, Jul.-Sep. 2006.
[25] J. Köhler, S. Philippi, M. Specht, and A. R├╝egg, "Ontology based text
indexing and querying for the semantic web," Knowledge-Based
Systems, vol. 19, no. 8, pp. 744-754, Dec. 2006.
[26] C. Hess and C. Schlieder, "Ontology-based verification of core model
conformity in conceptual modeling," Computers, Environment & Urban
Systems, vol. 30, no. 5, pp. 543-561, Sep. 2006.
[27] D. Pérez-Rey, V. Maojo, M. Garc├¡a-Remesal, R. Alonso-Calvo, H.
Billhardt, F. Martin-Sánchez, and A. Sousa, "ONTOFUSION: ontologybased
integration of genomic and clinical databases," Computers in
Biology & Medicine, vol. 36, no. 7-8, pp. 712-730, Jul.-Aug. 2006.
[28] E. Camon, M. Magrane, D. Barrell, V. Lee, E. Dimmer, J. Maslen, D.
Binns, N. Harte, R. Lopez, and R. Apweiler, "The Gene Ontology
Annotation (GOA) database: sharing knowledge in Uniprot with gene
ontology," Nucleic Acids Research, vol. 32, no. 1, pp. D262-266, Jan.
2004.
[29] A. Lewin and I.C. Grieve, "Grouping gene ontology terms to improve
the assessment of gene set enrichment in microarray data," BMC
Bioinformatics, vol. 7, no. 1, rec. 426, Oct. 2006.
[30] X. Wu, L. Zhu, J. Guo, D.Y. Zhang, and K. Lin, "Prediction of yeast
protein-protein interaction network: insights from the gene ontology and
annotations," Nucleic Acids Research, vol. 34, no. 7, pp. 2137-2150,
Apr. 2006.
[31] Z. Cai, X. Mao, S. Li, and L. Wei, "Genome comparison using Gene
Ontology (GO) with statistical testing," BMC Bioinformatics, vol. 7, no.
1, rec. 374, Aug. 2006.
[32] B. Zheng, D.C. McLean, and X. Lu, "Identifying biological concepts
from a protein-related corpus with a probabilistic topic model," BMC
Bioinformatics, vol. 7, no. 1, rec. 58, Feb. 2006.
[33] The Gene Ontology Consortium, "The Gene Ontology (GO) project in
2006," Nucleic Acids Research, vol. 34, no. 1, pp. D322-D326, Jan.
2006.
[34] J. Lomax, "Get ready to GO! A biologist-s guide to the gene ontology,"
Briefings in Bioinformatics, vol. 6, no. 3, pp. 298-304, Sep. 2005.
[35] M.A. Harris, J. Lomax, A. Ireland, and J.I. Clark, "The gene ontology
project," in Encyclopedia Genetics, Genomics, Proteomics &
Bioinformatics, part 4, S. Subramaniam, Ed. New York: John Wiley &
Sons, 2005.
[36] M. Bada, R. Stevens, C. Goble, Y. Gil, M. Ashburner, J.A. Blake, J.M.
Cherry, M.A. Harris, and S. Lewis, "A short study on the success of the
gene ontology," J. Web Semantics, vol. 1, no. 2, pp. 235-240, Feb. 2004.
[37] The Gene Ontology Consortium, "The Gene Ontology (GO) database
and informatics resource," Nucleic Acids Research, vol. 32, no. 1, pp.
D258-D261, Jan. 2004.
[38] The Gene Ontology Consortium, "Creating the gene ontology resource:
design and implementation," Genome Research, vol. 11, no. 8, pp. 1425-
1433, Aug. 2001.
[39] The Gene Ontology Consortium, "Gene ontology: tool for the
unification of biology," Nature Genetics, vol. 25, no. 1, pp. 25-29, May
2000.
[40] D. Beckett. (2004, Feb. 10). RDF/XML syntax specification (revised)
[Online]. Available: http://www.w3.org/TR/rdf-syntax-grammar.
[41] J. Ye, L. Fang, H. Zheng, Y. Zhang, J. Chen, Z. Zhang, J. Wang, S. Li,
R. Li, L. Bolund, and J. Wang, "WEGO: a web tool for plotting GO
annotations," Nucleic Acids Research, vol. 34, no. 1, pp. W293-W297,
Jul. 2006.
[42] H.K. Lee, W. Braynen, K. Keshav, and P. Pavlidis, "ErmineJ: tool for
functional analysis of gene expression data sets," BMC Bioinformatics,
vol. 6, no. 1, rec. 269, Nov. 2005.
[43] H. Liu, Z.Z. Hu, and C.H. Wu, "DynGO: a tool for visualizing and
mining of gene ontology and its associations," BMC Bioinformatics, vol.
6, no. 1, rec. 201, Aug. 2005.
[44] S. Aitken, R. Korf, B. Webber, and J. Bard, "COBrA: a bio-ontology
editor," Bioinformatics, vol. 21, no. 6, pp. 825-826, Mar. 2005.[45] W. Cai, S. Chen, and D. Zhang, "Fast and robust fuzzy c-means
clustering algorithms incorporating local information for image
segmentation," Pattern Recognition, vol. 40, no. 3, pp. 825-838, Mar.
2007.
[46] M. Friedman, M. Last, Y. Makover, and A. Kandel, "Anomaly detection
in web documents using crisp and fuzzy-based cosine clustering
methodology," Information Sciences, vol. 177, no. 2, pp. 467-475, Jan.
2007.
[47] J.J. Huang, G.H. Tzeng, and C.S. Ong, "Marketing segmentation using
support vector clustering," Expert Systems with Applications, vol. 32, no.
2, pp. 313-317, Feb. 2007.
[48] W. Zhong, J. He, R. Harrison, P.C. Tai, and Y. Pan, "Clustering support
vector machines for protein local structure prediction," Expert Systems
with Applications, vol. 32, no. 2, pp. 518-526, Feb. 2007.
[49] G.P. Papamichail and D.P. Papamichail, "The k-means range algorithm
for personalized data clustering in e-commerce," European J.
Operational Research, vol. 177, no. 3, pp. 1400-1408, Mar. 2007.
[50] Z.S.H. Chan, L. Collins, and N. Kasabov, "An efficient greedy k-means
algorithm for global gene trajectory clustering," Expert Systems with
Applications, vol. 30, no. 1, pp. 137-141, Jan. 2006.
[51] A.C. Martinez-Estudillo, C. Hervas-Martinez, F.J. Martinez-Estudillo,
and N. Garcia-Pedrajas, "Hybridization of evolutionary algorithms and
local search by means of a clustering method," IEEE Trans. Systems,
Man & Cybernetics, Part B, vol. 36, no. 3, pp. 534-545, Jun. 2006.
[52] D.F. Rogers and S.S. Kulkarni, "Optimal bivariate clustering and a
genetic algorithm with an application in cellular manufacturing,"
European J. Operational Research, vol. 160, no. 2, pp. 423-444, Jan.
2005.
[53] J.Y. Hwang and W. Kuo, "Model-based clustering for integrated circuit
yield enhancement," European J. Operational Research, vol. 178, no. 1,
pp. 143-153, Apr. 2007.
[54] Y. Zeng and J. Garcia-Frias, "A novel HMM-based clustering algorithm
for the analysis of gene expression time-course data," Computational
Statistics & Data Analysis, vol. 50, no. 9, pp. 2472-2494, May 2006.
[55] H.M. Torres, J.A. Gurlekian, H.L. Rufiner, and M.E. Torres, "Selforganizing
map clustering based on continuous multiresolution entropy,"
Physica A: Statistical Mechanics & its Applications, vol. 361, no. 1, pp.
337-354, Feb. 2006.
[56] S. Mitra, H. Banka, and W. Pedrycz, "Rough-fuzzy collaborative
clustering," IEEE Trans. Systems, Man & Cybernetics, Part B, vol. 36,
no. 4, pp. 795-805, Aug. 2006.
[57] G. Peters, "Some refinements of rough k-means clustering," Pattern
Recognition, vol. 39, no. 8, pp. 1481-1491, Aug. 2006.
[58] E. Zio and P. Baraldi, "Evolutionary fuzzy clustering for the
classification of transients in nuclear components," Progress in Nuclear
Energy, vol. 46, no. 3-4, pp. 282-296, Apr. 2005.
[59] S.A. Mingoti and J.O. Lima, "Comparing SOM neural network with
fuzzy c-means, k-means and traditional hierarchical clustering
algorithms," European J. Operational Research, vol. 174, no. 3, pp.
1742-1759, Nov. 2006.
[60] G. Chicco, R. Napoli, and F. Piglione, "Comparisons among clustering
techniques for electricity customer classification," IEEE Trans. Power
Systems, vol. 21, no. 2, pp. 933-940, May 2006.
[61] H. Guldem─▒r and A. Sengur, "Comparison of clustering algorithms for
analog modulation classification," Expert Systems with Applications,
vol. 30, no. 4, pp. 642-649, May 2006.
[62] P.C.H. Ma, K.C.C. Chan, X. Yao, and D.K.Y. Chiu, "An evolutionary
clustering algorithm for gene expression microarray data analysis," IEEE
Trans. Evolutionary Computation, vol. 10, no. 3, pp. 296-314, Jun.
2006.
[63] M. Laszlo and S. Mukherjee, "A genetic algorithm using hyperquadtrees
for low-dimensional k-means clustering," IEEE Trans. Pattern
Analysis & Machine Intelligence, vol. 28, no. 4, pp. 533-543, Apr. 2006.
[64] W. Sheng, W. Swift, L. Zhang, and X. Liu, "A weighted sum validity
function for clustering with a hybrid niching genetic algorithm," IEEE
Trans. Systems, Man & Cybernetics, Part B, vol. 35, no. 6, pp. 1156-
1167, Dec. 2005.
[65] S. Bandyopadhyay, "Simulated annealing using a reversible jump
Markov chain Monte Carlo algorithm for fuzzy clustering," IEEE Trans.
Knowledge & Data Engineering, vol. 17, no. 4, pp. 479-490, Apr. 2005.
[66] C. Aykanat, B.B. Cambazoglu, F. Findik, and T. Kurc, "Adaptive
decomposition and remapping algorithms for object-space-parallel direct volume rendering of unstructured grids," J. Parallel & Distributed
Computing, vol. 67, no. 1, pp. 77-99, Jan. 2007.
[67] A. Duarte, Á. Sánchez, F. Fernández, and A.S. Montemayor, "Improving
image segmentation quality through effective region merging using a
hierarchical social metaheuristic," Pattern Recognition Letters, vol. 27,
no. 11, pp. 1239-1251, Aug. 2006.
[68] M. Boulif and K. Atif, "A new branch-&-bound-enhanced genetic
algorithm for the manufacturing cell formation problem," Computers &
Operations Research, vol. 33, no. 8, pp. 2219-2245, Aug. 2006.
[69] B.S. Mitchell and S. Mancoridis, "On the automatic modularization of
software systems using the Bunch tool," IEEE Trans. Software
Engineering, vol. 32, no. 3, pp. 193-208, Mar. 2006.
[70] J. Grabowski and J. Pempera, "The permutation flow shop problem with
blocking. A tabu search approach," Omega, vol. 35, no. 3, pp. 302-311,
Jun. 2007.
[71] M. Sun, "Solving the uncapacitated facility location problem using tabu
search," Computers & Operations Research, vol. 33, no. 9, pp. 2563-
2589, Sep. 2006.
[72] M.K. Tiwari, S. Kumar, Prakash, and R. Shankar, "Solving part-type
selection and operation allocation problems in an FMS: an approach
using constraints-based fast simulated annealing algorithm," IEEE
Trans. Systems, Man & Cybernetics, Part A, vol. 36, no. 6, pp. 1170-
1184, Nov. 2006.
[73] G. Attiya and Y. Hamam, "Task allocation for maximizing reliability of
distributed systems: a simulated annealing approach," J. Parallel &
Distributed Computing, vol. 66, no. 10, pp. 1259-1266, Oct. 2006.
[74] M. Moz and M.V. Pato, "A genetic algorithm approach to a nurse
rerostering problem," Computers & Operations Research, vol. 34, no. 3,
pp. 667-691, Mar. 2007.
[75] I.H. Toroslu and Y. Arslanoglu, "Genetic algorithm for the personnel
assignment problem with multiple objectives," Information Sciences,
vol. 177, no. 3, pp. 787-803, Feb. 2007.
[76] A.C. Zecchin, A.R. Simpson, H.R. Maier, M. Leonard, A.J. Roberts, and
M.J. Berrisford, "Application of two ant colony optimisation algorithms
to water distribution system optimization," Mathematical & Computer
Modelling, vol. 44, no. 5-6, pp. 451-468, Sep. 2006.
[77] P.Y. Yin and J.Y. Wang, "Ant colony optimization for the nonlinear
resource allocation problem," Applied Mathematics & Computation, vol.
174, no. 2, pp. 1438-1453, Mar. 2006.
[78] J.S. Heo, K.Y. Lee, and R. Garduno-Ramirez, "Multiobjective control of
power plants using particle swarm optimization techniques," IEEE
Trans. Energy Conversion, vol. 21, no. 2, pp. 552-561, Jun. 2006.
[79] S.H. Jacobson, L.A. McLay, S.N. Hall, D. Henderson, and D.E.
Vaughan, "Optimal search strategies using simultaneous generalized hill
climbing algorithms," Mathematical & Computer Modelling, vol. 43, no.
9-10, pp. 1061-1073, May 2006.
[80] L. You and S. Wood, "Assessing the spatial distribution of crop areas
using a cross-entropy method," Int-l J. Applied Earth Observation &
Geoinformation, vol. 7, no. 4, pp. 310-323, Dec. 2005.
[81] M.A. Arostegui Jr., S.N. Kadipasaoglu, and B.M. Khumawala, "An
empirical comparison of tabu search, simulated annealing, and genetic
algorithms for facilities location problems," Int-l J. Production
Economics, vol. 103, no. 2, pp. 742-754, Oct. 2006.
[82] S. Kannan, S.M.R. Slochanal, and N.P. Padhy, "Application and
comparison of metaheuristic techniques to generation expansion
planning problem," IEEE Trans. Power Systems, vol. 20, no. 1, pp. 466-
475, Feb. 2005.
[83] E. Elbeltagi, T. Hegazy, and D. Grierson, "Comparison among five
evolutionary-based optimization algorithms," Advanced Engineering
Informatics, vol. 19, no. 1, pp. 43-53, Jan. 2005.
[84] S. Kumar, S.H. Ong, S. Ranganath, and F.T. Chew, "A luminance- and
contrast-invariant edge-similarity measure," IEEE Trans. Pattern
Analysis & Machine Intelligence, vol. 28, no. 12, pp. 2042-2048, Dec.
2006.
[85] J.D. Clayden, M.E. Bastin, and A.J. Storkey, "Improved segmentation
reproducibility in group tractography using a quantitative tract similarity
measure," NeuroImage, vol. 33, no. 2, pp. 482-492, Nov. 2006.
[86] P. Paclik, J. Novovicova, and R.P.W. Duin, "Building road-sign
classifiers using a trainable similarity measure," IEEE Trans. Intelligent
Transportation Systems, vol. 7, no. 3, pp. 309-321, Sep. 2006.
[87] Y. Peng and C.W. Ngo, "Clip-based similarity measure for querydependent
clip retrieval and video summarization," IEEE Trans. Circuits
& Systems for Video Technology, vol. 16, no. 5, pp. 612-627, May 2006.[88] F. van der Meer, "The effectiveness of spectral similarity measures for
the analysis of hyperspectral imagery," Int-l J. Applied Earth
Observation & Geoinformation, vol. 8, no. 1, pp. 3-17, Jan. 2006.
[89] M. Popescu, J.M. Keller, and J.A. Mitchell, "Fuzzy measures on the
gene ontology for gene product similarity," IEEE/ACM Trans.
Computational Biology & Bioinformatics, vol. 3, no. 3, pp. 263-274,
Jul.-Sep. 2006.
[90] X. Chen, J. Tian, and X. Yang, "A new algorithm for distorted
fingerprints matching based on normalized fuzzy similarity measure,"
IEEE Trans. Image Processing, vol. 15, no. 3, pp. 767-776, Mar. 2006.
[91] S. Lee and M.M. Crawford, "Unsupervised multistage image
classification using hierarchical clustering with a bayesian similarity
measure," IEEE Trans. Image Processing, vol. 14, no. 3, pp. 312-320,
Mar. 2005.
[92] B. Moghaddam, C. Nastar, and A. Pentland, "A bayesian similarity
measure for deformable image matching," Image & Vision Computing,
vol. 19, no. 5, pp. 235-244, Apr. 2001.
[93] D. Skerl, B. Likar, and F. Pernus, "A protocol for evaluation of
similarity measures for rigid registration," IEEE Trans. Medical
Imaging, vol. 25, no. 6, pp. 779-791, Jun. 2006.
[94] H. N├║├▒ez, M. S├ánchez-Marrè, U. Cortés, J. Comas, M. Mart├¡nez, I.
Rodríguez-Roda, and M. Poch, "A comparative study on the use of
similarity measures in case-based reasoning to improve the classification
of environmental system situations," Environmental Modelling &
Software, vol. 19, no. 9, pp. 809-819, Sep. 2004.
[95] M. Kirac, G. Ozsoyoglu, and J. Yang., "Annotating proteins by mining
protein interaction networks," Bioinformatics, vol. 22, no. 14, pp. e260-
e270, Jul. 2006.
[96] S. Ray and M. Craven, "Learning statistical models for annotating
proteins with function information using biomedical text," BMC
Bioinformatics, vol. 6, no. 1, pp. rec. S18, May 2005.
[97] J.V. Ponomarenko, P.E. Bourne, and I.N. Shindyalov, "Assigning new
GO annotations to protein data bank sequences by combining structure
and sequence homology," Proteins, vol. 58, no. 4, pp. 855-865, Mar.
2005.
[98] R.M. Othman, S. Deris, R.M. Illias, Z. Zakaria, and S.M. Mohamad,
"Automatic clustering of gene ontology by genetic algorithm," Int'l J.
Information Technology, vol. 3, no. 1, pp. 37-46, Apr. 2006.
[99] R.M. Othman, S. Deris, R.M. Illias, H.T. Alashwal, R. Hassan, and F.
Mohamed, "Incorporating semantic similarity measure in genetic
algorithm: an approach for searching the gene ontology terms," Int'l J.
Computational Intelligence, vol. 3, no. 3, pp. 257-266, May 2006.
[100] R.M. Othman, S. Deris, and R.M. Illias, "UTMGO: a tool for searching a
group of semantically related gene ontology terms and application to
annotation of anonymous protein sequence," Int'l J. Biomedical
Sciences, vol. 1, no. 2, pp. 111-119, Jul. 2006.
[101] E. Takashima, Y. Murata, N. Shibata, and M. Ito, "Techniques to
improve exploration efficiency of parallel self-adaptive genetic
algorithms by dispensing with iteration and synchronization," Systems &
Computers in Japan, vol. 37, no. 14, pp. 25-33, Dec. 2006.
[102] Rahul, D. Chakraborty, and A. Dutta, "Optimization of FRP composites
against impact induced failure using island model parallel genetic
algorithm," Composites Science & Technology, vol. 65, no. 13, pp.
2003-2013, Oct. 2005.
[103] K. Katayama, H. Hirabayashi, and H. Narihisa, "Analysis of crossovers
and selections in a coarse-grained parallel genetic algorithm,"
Mathematical & Computer Modelling, vol. 38, no. 11-13, pp. 1275-
1282, Dec. 2003.