The primary structure of a protein is fundamentally defined by the composition and sequence of amino acids linked together in a polypeptide chain through peptide bonds. When free amino acids are covalently bonded, they transform into amino acid residues, which are the building blocks of proteins. The composition refers to the types and quantities of amino acids present, while the sequence denotes the specific order of these residues from the N-terminal end to the C-terminal end of the protein.
For instance, if a protein consists of six amino acids, including one each of valine, alanine, methionine, and phenylalanine, along with two glycines, the composition provides insight into the types and counts of these amino acids but does not reveal their arrangement. The sequence, however, is crucial as it dictates the protein's structure and function, akin to how the order of words in a sentence conveys meaning.
In terms of structure, the N-terminal end of a protein features a free amino group, typically represented as NH3+ at physiological pH, while the C-terminal end has a free carboxylate anion, denoted as COO-. Internal amino acid residues, however, do not possess free amino or carboxyl groups due to the formation of peptide bonds through dehydration synthesis, which involves the interaction of the amino group of one amino acid with the carboxyl group of another.
It is essential to understand that any alteration in the composition or sequence of amino acids will result in a change to the primary protein structure. This foundational knowledge of protein structure is critical for further exploration of protein function and interactions in biological systems.