Direct protein sequencing is a crucial process in biochemistry that allows scientists to determine the amino acid sequence of proteins, which is essential for understanding their structure and function. The primary structure of a protein, which consists of its amino acid composition and sequence, dictates its higher-order structures and biological roles. To sequence proteins effectively, especially large ones that can contain hundreds to thousands of amino acids, it is necessary to cleave them into smaller peptide fragments.
The process begins with protein extraction from cells through cell lysis, leading to a mixture of proteins. Various purification techniques, such as ion exchange chromatography, size exclusion chromatography, and affinity chromatography, are employed to isolate the protein of interest. Once isolated, the protein can be subjected to different sequencing paths, depending on the available techniques.
One common method involves treating the protein with a chemical called FDNB (fluorenylmethyloxycarbonyl chloride) to identify the N-terminal amino acid residue. Following this, amino acid hydrolysis can be performed, which non-specifically cleaves all peptide bonds, resulting in free amino acids. This step provides information about the amino acid composition, which is the quantity and types of amino acids present, but does not reveal their sequence.
To obtain the actual sequence, further steps are necessary. The isolated protein can be subjected to chemical cleavage, where specific chemicals cleave particular peptide bonds, generating distinct peptide fragments. Alternatively, peptidases, which are enzymes that cleave specific peptide bonds, can be used to produce additional fragments. After generating these peptide fragments, they are separated using techniques like high-performance liquid chromatography (HPLC).
Once separated, the individual peptide fragments can be sequenced using methods such as Edman degradation, which requires the peptides to be cleaved into smaller fragments for accurate sequencing. This process ultimately reveals the one-letter amino acid codes for the residues, allowing for the complete sequencing of the protein.
As the course progresses, each of these techniques—FDNB treatment, amino acid hydrolysis, chemical cleavage, and peptidase action—will be explored in greater detail, providing a comprehensive understanding of direct protein sequencing. This overview serves as a roadmap for the upcoming lessons, guiding students through the intricate steps involved in determining protein sequences.