Vector Space of the Extended Base-triplets over the Galois Field of five DNA Bases Alphabet

A plausible architecture of an ancient genetic code is derived from an extended base triplet vector space over the Galois field of the extended base alphabet {D, G, A, U, C}, where the letter D represent one or more hypothetical bases with unspecific pairing. We hypothesized that the high degeneration of a primeval genetic code with five bases and the gradual origin and improvements of a primitive DNA repair system could make possible the transition from the ancient to the modern genetic code. Our results suggest that the Watson-Crick base pairing and the non-specific base pairing of the hypothetical ancestral base D used to define the sum and product operations are enough features to determine the coding constraints of the primeval and the modern genetic code, as well as the transition from the former to the later. Geometrical and algebraic properties of this vector space reveal that the present codon assignment of the standard genetic code could be induced from a primeval codon assignment. Besides, the Fourier spectrum of the extended DNA genome sequences derived from the multiple sequence alignment suggests that the called period-3 property of the present coding DNA sequences could also exist in the ancient coding DNA sequences.





References:
[1] L. E. Orgel, Prebiotic Chemistry and the Origin of the RNA World.
Critical Reviews in Biochemistry and Molecular Biology 39 (2004) 99-
123,.
[2] J. A. Piccirilli, Krauch T., Moroney, S. E. & Benner, S. A. Nature 343
(1990) 33-37.
[3] C. Switzer, S. E. Moroney. and S. A. Benner, Enzymatic incorporation
of a new base pair into DNA and RNA. J. Am. Chem. Sot. I I I (1989)
8322 -8323.
[4] A. Rich, in Horizons in Biochemistry, eds Kasha, M. & Pullman, B.
(Academic, New York) (1962) 103-126.
[5] V. M. Kolb, Dworkin, J. P. & Miller, S. L. J. Mol. Evol. 38 (1994) 549-
5573.
[6] M. Levy and M. L. Stanley, The stability of the RNA bases: Implications
for the origin of life. Proc. Natl. Acad. Sci. USA 95 (1998) 7933-7938
[7] R.. Sánchez, E. Morgado and R. Grau, Gene algebra from a genetic code
algebraic structure. J. Math. Biol. 51 (2005) 431 - 457
[8] Sánchez R., Grau R., Morgado E. A Novel Lie Algebra of the Genetic
Code over the Galois Field of Four DNA Bases. Mathematical
Biosciences 202 (2006) 156-174
[9] R. Sánchez, R. Grau, A Novel Algebraic Structure of the Genetic Code
over the Galois Field of Four DNA Bases. Acta Biotheoretica 54 (2006)
27-42
[10] F. Crick, Codon-anticodon pairings: the wobble hypothesis, J. Mol. Biol.
19 (1966) 548.
[11] C. Guthrie and J. Abelson, Organization and expression of tRNA genes
in Saccharomyces cerevisiae, in The Molecular Biology of the Yeast
Saccharomyces: Metabolism and Gene Expression (eds. Strathern J., et
al. ), New York: Cold Spring Harbor Laboratory Press, (1982) 487.
[12] W. Xiyin, S. Xiaoli and H. Bailin, The transfer RNA genes in Oryza
sativa L. ssp. Indica. Science in China C. 45 (2006) 504-511.
[13] S. Ohno and J. T. Epplen, The primitive code and repeats of base
oligomers as the primordial protein-encoding sequence. Proc. Natd
Acad. Sci. USA 80 (1983) 3391-3395
[14] R. Shapiro, Prebiotic cytosine synthesis: A critical analysis and
implications for the origin of life. Proc. Natl. Acad. Sci. USA 96 (1999)
4396-4401,
[15] C.R. Woese and G. E. Fox. The concept of cellular evolution. J. Mol.
Evol. 10 (1962) 1-6.
[16] H.. Hartman and A. Fedorov, The origin of the eukaryotic cell: A
genomic investigation. Proc. Natl. Acad. Sci. USA, 99 (2002) 1420-
1425
[17] J.W. Fickett. Recognition of protein coding regions in DNA sequences.
Nucleic Acids Res. 10 (1982) 5303-5318
[18] A. A. Tsonis, J. B. Elsner and P.A. Tsonis,.Periodicity in DNA coding
sequences: implications in gene evolution. J. Theor. Biol. 151 (1991)
323
[19] T. Shrish, S. Ramachandran, A. Bhattacharya, S. Bhattacharya and R.
Ramaswamy, Prediction of probable genes by Fourier analysis of
genomic sequences. CABIOS, 13 (1997) 263-270,