The concept of the genetic code was initially proposed by George Gamow, suggesting a fundamental relationship between nucleotide sequences and the amino acids they encode. The actual deciphering of the genetic code was achieved through the groundbreaking work of Marshall Nirenberg, Heinrich Matthaei, and Har Gobind Khorana. Their research illuminated how sequences of amino acids in proteins are determined by the sequences of nucleotides in DNA or its RNA transcript.
Proteins are composed of 20 different amino acids, each specified by the genetic information encoded in DNA. During transcription, the genetic instructions in DNA are transcribed to mRNA, transferring the information through complementary nitrogenous base pairing. Each amino acid is represented by a specific sequence of nucleotides on the mRNA, known as a codon. Thus, a codon is a triplet of nucleotides on mRNA that specifies a particular amino acid. The entirety of these codons, comprising the mRNA molecule, forms the genetic code, which encompasses all necessary information for polypeptide chain synthesis.
The central challenge in understanding the genetic code involved determining the precise number of nucleotides within a codon required to specify each of the 20 different amino acids. Messenger RNA (mRNA) is composed of four types of nitrogenous bases: adenine (A), uracil (U), guanine (G), and cytosine (C). The complexity arose in figuring out how combinations of these four bases could encode 20 distinct amino acids.
A singlet code, where each codon is composed of a single nitrogen base, would produce only four possible codons (A, U, G, C). This arrangement is inadequate because it allows for only four amino acids to be specified, far less than the 20 needed.
Calculation: 41 =4 codons
A doublet code, with codons made up of two nitrogen bases, increases the number of possible codons to 16 (by considering all possible base pairs).
Calculation: 42=16 codons
However, 16 codons are still insufficient to represent all 20 amino acids.
Recognizing these limitations, George Gamow, in 1954, proposed the concept of a triplet code. In this model, each codon consists of three nitrogen bases, which significantly expands the coding capacity.
Triplet code calculation: 43=64 codons
With 64 possible codons, this system is more than adequate to encode all 20 amino acids. This realization was pivotal in understanding the genetic code's structure, providing a sufficient number of codons to not only represent each amino acid but also include start and stop signals necessary for protein synthesis.
There are three amino acids encoded by six different codons: serine, leucine, arginine. Only two amino acids are specified by a single codon; one of these is the amino-acid methionine, specified by the codon AUG, which also specifies the start of translation; the other is tryptophan, specified by the codon UGG. The degeneracy of the genetic code is what accounts for the existence of silent mutations.
Degeneracy results because a triplet code designates 20 amino acids and a stop codon. Because there are four bases, triplet codons are required to produce at least 21 different codes. For example, if there were two bases per codon, then only 16 amino acids could be coded for (4²=16). Because at least 21 codes are required, then 4³ gives 64 possible codons, meaning that some degeneracy must exist.
According to this hypothesis, the base in the first position of anticodon on tRNA is usually an abnormal base, like inosine, pseudouridine, tyrosine, etc. These abnormal bases are able to pair with more than one type of nitrogenous base in the third position of the codon on mRNA.
eg. Inosine (I) can pair with A, C, or U. This base is called a Wobble base or fluctuating base. Wobble occurs at position 1 of the anticodon and position 3 of the codon.
The wobble hypothesis states that the third position (3') of the codon on mRNA and the first position (5') of the anticodon on tRNA are bound less tightly than the other pair and therefore, offer unusual base combinations.
A single amino acid may be specified by many codons, i.e. called degeneracy. Degeneracy is due to the last base in a codon, which is known as a wobble base. Thus the first two codons are more important in determining the amino acid and the one differs without affecting the coding known as the Wobble hypothesis.
Thus according to this, in codon-anticodon pairing, the third base of the tRNA anticodon does not have to pair with a complementary codon. It is called a wobble base and this position is called the wobble position. The actual base pairing occurs in the first two positions only.
(Session 2025 - 26)