Exploring Phosphoproteomics: Methods and Applications

Author: Iva Colter

The proteome is the sequence identification and content analysis of all proteins in a living organism, but the information of PTM (post-translational modification) cannot be ignored. Phosphorylation proteins account for about 1/3 of all proteins in organisms, so phosphorylation has become one of the types of protein PTMs that has received the most attention.

Common phosphorylated protein identification methods

Western-blot

The proteins that are separated by SDS-PAGE gel electrophoresis are moved from the PAGE gel to the solid phase carrier. The primary antibody is used as a probe and is mixed with a labeled secondary antibody. Then, a chromogenic reagent is used to develop the image. Based on how specific the binding between antigen and antibody is, only the phosphorylated protein will show up as a band at the correct molecular mass.

Features: Strong specificity and high resolution, but only semi-quantitative, and now mostly used to verify mass spectrometry results.

Disadvantages: It is difficult to identify new sites and difficult to detect multiple phosphorylation sites of proteins due to the difficulty of preparing specific phosphorylated antibodies.

Isotope labeling

Principle: Using radioisotope 32P-labeled phosphate as the phosphate group donor, the phosphate group with 32P labeling is transferred to the substrate protein under kinase catalysis and then separated by gel electrophoresis, and the phosphorylated protein is detected by radioactive autoradiography.

Features: It can detect protein phosphorylation sites and single protein phosphorylation.

Disadvantages: Highly radioactive and dangerous.

Mass Spectrometry

Principle: The mass spectrometer can ionize the analyzed sample into charged ions. These ions are separated in space or time under the action of an electric or magnetic field. After detection, the mass-to-charge ratio (m/z) and relative intensity of the mass spectrum are obtained. To calculate the mass of the molecules in the analyte, the mass spectrogram can be used for qualitative analysis of the analyzed substances, and quantitative analysis can be done according to the ionic strength. Since the molecular weight of the phosphorylated polypeptide will increase by 79.9663 Da, the sequence-level phosphorylation modification site can be inferred from other fragment ion signals.

Features: High throughput; thousands of proteins can be obtained simultaneously for qualitative and quantitative information, and multiple phosphorylation modification sites and levels of the same protein can be determined.

Disadvantages: High technical difficulty, relying on a professional platform, and rich knowledge of mass spectrometry.

Phosphorylated Protein Identification Method

Since protein phosphorylation is a dynamic process, it is not easy to detect low abundances of phosphorylated proteins at low levels in cells, and high levels of phosphorylated peptides not only inhibit phosphorylated peptide ionization but also obscure phosphorylated peptide signals. Therefore, effective enrichment of purified phosphorylated proteins becomes critical before performing mass spectrometry analysis.

Phosphorylated protein enrichment method

Based on how much they can enrich and how much they cost, immobilized metal ion affinity chromatography (IMAC) and titanium dioxide affinity chromatography (TiO2) are the most common ones.

IMAC principle: IMAC is mainly composed of three parts: metal ions, chelating agents, and matrix carriers. Under acidic conditions, the chelating agent in the stationary phase forms a coordination compound with the metal ion, and the metal ion is immobilized on it. The positively charged metal ion combines with the negatively charged phosphate group to adsorb the phosphorylated peptide segment.

Principle of TiO2: TiO2 belongs to a metal oxide with strong enrichment ability in metal oxide affinity chromatography (MOAC), although both MOAC and IMAC are based on the electrostatic interaction between metal ions and phosphorylated peptides to purify and enrich phosphorylation proteins. However, due to the strong interaction of metal atoms and oxygen atoms and the fact that metal atoms are difficult to lose, TiO2 is widely used.

Phosphorylated Protein Quantitative Methods

The phosphorylated peptides that have been enriched are separated by chromatography, and mass spectrometry can be used to find them. Commonly used mass spectrometry quantification methods include TMT, LFQ and DIA.

TMT quantification

TMT is a quantitative technique that adds isotope labeling. It can label and analyze 16 samples very accurately while doing both at the same time. In addition to the identification of the whole proteome, TMT can also be used for the study of PP. However, each labeling reagent can label a maximum of 100ug of peptides. In order to achieve an ideal enrichment efficiency, it is generally recommended that the initial amount of protein phosphorylation modification enrichment should not be less than 1 mg, and if the TMT labeling quantitative method is used, the number of samples should be at least 10. Therefore, the TMT-labeled quantification method requires a high sample volume and is more expensive, while non-labeled quantification is more suitable for large-scale assays.

LFQ

Compared with TMT-labeled quantification, non-labeled quantification is simpler to operate, has less sample loss, and can quantify more proteins in one experiment, but it is heavily dependent on the stability of the mass spectrometer. The traditional label-free quantification method is LFQ (label-free), which relies on the data acquisition mode of DDA, while phosphopeptide isomers are difficult to sample and assign with DDA. Systematic and reproducible analysis of phosphorylation sites in a large number of samples is challenging.

DIA

Can DIA acquisition mode be used to analyze phosphopeptides?

Since the DIA data acquisition window is bigger and the spectra are more complicated, it takes a higher level of spectral processing to correctly find phosphorylation sites in mixed spectra and to correctly analyze phosphopeptide isomers.

In February 2020, a research team from Denmark overcame this technical problem and published an article in Nature Communications with a full workflow for DIA and direct DIA (DIA without DDA library building) to find phosphorylated proteins. The research team enriched phosphorylated peptides in wild-type yeast cells by TiO2 and then quantified them by DDA, DIA, and dDIA, respectively. The optimized DIA method was found to quantify up to twice as many phosphorylated peptides as DDA, and dDIA identified more than DDA with better reproducibility. Yeast phosphopeptide was mixed with HeLa cell phosphopeptide samples in different proportions and then quantified separately. The results showed that the peptides quantified by DIA and dDIA also increased proportionally with the increase in concentration, and their accuracy was better. DIA and DDA have comparable error rates, but DIA is superior to DDA in locus coverage, sensitivity, and dynamic range.