The 20 Amino Acids and Their Role in Protein Structures

Each of the 20 most common amino acids has its specific chemical characteristics and its unique role in a protein structure and function. For example, based on the propensity of the side chain to be in contact with water, amino acids are classified as hydrophobic (low propensity to be in contact with water), polar and charged (energetically favorable contact with water). The charged amino acids include two basic, lysine and arginine (+ charge), and two acidic, aspartate and glutamate (- charge). Polar amino acids include serine and threonine (contain a hydroxyl group), asparagine and glutamine (contain amide group). Histidine is also a polar residue, although its behavior depends on the polarity of its environment. It has two –NH group with a pKa value of around 6. When both groups are protonated, the side chain has a charge of +1. However, the pKa may be modulated by the environment inside the protein, and when raised the side chain may give away a proton, loosing its positive charge and becoming neutral. By other words, histidine may easily give away and accept a proton, making it especially useful within enzyme active sites. The aromatic residues tyrosine and tryptophan and the non-aromatic methionine are often called amphipathic due to their ability to have both polar and non-polar character. These residues are often found close to the surface of proteins. The –OH group of tyrosine is able to both donate and accept a hydrogen bond. The side chains of histidine, tyrosine, phenylalanine and tryptophan are also able to form weak hydrogen bonds of the types, OH-π and CH-O, by other words using electron clouds within their ring structures. For discussion of OH-π, and CH-O type of hydrogen bonds see Scheiner et al., 2002.

The hydrophobic amino acids include alanine, valine, leucine, isoleucine, proline, phenylalanine, cysteine and methionine. These residues participate in van der Waals type of interactions. The classification above is based on the type of the amino acid side chain. However, glycine, being one of the common amino acids, does not have a side chain and for this reason it is not straightforward to assign it to one of the above classes. Generally, glycine is often found at the surface of proteins, within loop- or coil (without defined secondary structure) regions, providing high flexibility to the polypeptide chain at these locations. This suggests that it is rather hydrophilic. Proline, on the other hand, is generally non-polar and is mostly found buried inside the protein, although similarly to glycine, it is often found in loop regions. In contrast to glycine, proline provides rigidity to the polypeptide chain by imposing certain torsion angles on the segment of the structure. The reason for this is discussed in the section on torsion angles. Glycine and proline are often highly conserved within a protein family since they are essential for the conservation of a particular protein fold.
Below the 20 most common amino acids in proteins are listed with their three-letter and one-letter codes:

Charged (side chains often form salt bridges):
• Arginine - Arg - R
• Lysine - Lys - K
• Aspartic acid - Asp - D
• Glutamic acid - Glu - E

Polar (form hydrogen bonds as proton donors or acceptors):
• Glutamine - Gln - Q
• Asparagine - Asn - N
• Histidine - His - H
• Serine - Ser - S
• Threonine - Thr - T
• Tyrosine - Tyr - Y
• Cysteine - Cys - C

Amphipathic (often found at the surface of proteins or lipid membranes, sometimes also classified as polar):
• Tryptophan - Trp - W
• Tyrosine - Tyr - Y
• Methionine - Met - M (may function as a ligand to metal ions)

Hydrophobic (normally buried inside the protein core):
• Alanine - Ala - A
• Isoleucine - Ile - I
• Leucine - Leu - L
• Methionine - Met - M
• Phenylalanine - Phe - F
• Valine - Val - V
• Proline - Pro - P
• Glycine - Gly - G

Most protein molecules have a hydrophobic core, which is not accessible to solvent and a polar surface in contact with the environment (although membrane proteins follow a different pattern).
For a hydrogen bond to be formed, two electronegative atoms (for example in the case of an alpha-helix the amide N, and the carbonyl O) have to interact with the same hydrogen. The hydrogen is covalently attached to one of the atoms (called the hydrogen-bond donor), but interacts electrostatically with the other atom (the hydrogen bond acceptor, O). In proteins essentially all groups capable of forming H-bonds (both main chain and side chain, independently of whether the residues are within a secondary structure or some other type of structure) are usually H-bonded to each other or to water molecules. Due to their electronic structure, water molecules may accept 2 hydrogen bonds, and donate 2, thus being simultaneously engaged in a total of 4 hydrogen bonds. Water molecules may also be involved in the stabilization of protein structures by making hydrogen bonds with the main chain and side chain groups in proteins and even linking different protein groups to each other. In addition, water is often found to be involved in ligand binding to proteins, mediating ligand interactions with polar or charged side chain- or main chain atoms. It is useful to remember that the energy of a hydrogen bond, depending on the distance between the donor and the acceptor and the angle between them, is in the range of 2-10 kcal/mol.
The salt bridges formed by positively and negatively charged amino acids have also been found to be important for the stabilization of protein three-dimensional structure - for example proteins from thermophilic organisms (organisms that live at elevated temperatures, up to 80-90 C, or even higher) often have an extensive network of salt bridges on their surface, which contributes to the thermostability of these proteins, preventing their denaturation at high temperatures.

The preferred location of different amino acids in protein molecules can be quantitatively characterized by calculating the extent by which an amino acid is buried in the structure or exposed to solvent. Below you can see a figure showing the distribution of the different amino acids within protein molecules. While hydrophobic amino acids are mostly buried within the core, a smaller fraction of polar groups are found to be buried and charged residues are exposed to solvent to a much higher degree.

Fraction of buried amino acids

The vertical axis shows the fraction of highly buried residues, while the horizontal axis shows the amino acid names in one-letter code. Image from the tutorial by J.E. Wampler,