Cluster Algorithm for Genetic Diversity

With the hardware technology advancing, the cost of storing is decreasing. Thus there is an urgent need for new techniques and tools that can intelligently and automatically assist us in transferring this data into useful knowledge. Different techniques of data mining are developed which are helpful for handling these large size databases [7]. Data mining is also finding its role in the field of biotechnology. Pedigree means the associated ancestry of a crop variety. Genetic diversity is the variation in the genetic composition of individuals within or among species. Genetic diversity depends upon the pedigree information of the varieties. Parents at lower hierarchic levels have more weightage for predicting genetic diversity as compared to the upper hierarchic levels. The weightage decreases as the level increases. For crossbreeding, the two varieties should be more and more genetically diverse so as to incorporate the useful characters of the two varieties in the newly developed variety. This paper discusses the searching and analyzing of different possible pairs of varieties selected on the basis of morphological characters, Climatic conditions and Nutrients so as to obtain the most optimal pair that can produce the required crossbreed variety. An algorithm was developed to determine the genetic diversity between the selected wheat varieties. Cluster analysis technique is used for retrieving the results.




References:
[1] Dias, Picoli, Rocha and Alfenas "A priori choice of hybrid parents in
plants", Genetics and Molecular Research. Vol. 12, 2004, pp 116-130.
[2] Fan, Jianhua and Li, Deyi "Overview of data mining and knowledge
discovery" Journal of Computer Science and Technology. 13 (4), 1998,
pp 348-368.
[3] Fayyad, Usama; Stolorz and Paul "Data mining and KDD: Promise and
challenges" Generation Computer Systems. 13 (2-3), 1997, pp. 99-115
[4] Jagdeep Singh "Development of Biotechnology Information System
using a Web Server" M.Tech Thesis PAU, Ludhiana, Punjab, India,
2002, pp 1-40.
[5] Manpreet Singh "Development of Data Mining model for bioinformatics
system" M.Tech Thesis PAU, Ludhiana, Punjab, India, 2003, pp 1-30.
[6] Raghavan, Vijay V.; Deogun, Jitender S. and Server, Hary "Introduction
to Data Mining" Journal of the American Society for Information
Science 49 (5), 1998, pp 397-402.