Exploring Protein Function on the Scale of Cells or Whole Organisms

Rucete ✏ Lehninger Principles of Biochemistry In a Nutshell

9.2 Exploring Protein Function on the Scale of Cells or Whole Organisms

This chapter explains how scientists explore protein function at molecular, cellular, and organismal levels. It presents the use of comparative genomics, transcriptomics, proteomics, imaging, interaction assays, mutagenesis, and genome-editing techniques such as CRISPR/Cas9 to reveal when, where, and how proteins function. Modern large-scale screening approaches and bioinformatics tools are also discussed.


Levels of Protein Function

Phenotypic function: Describes the observable effects of a protein on an organism (e.g., growth rate, morphology, or lethality when missing).

Cellular function: Defines the network of interactions between proteins within a cell and their participation in metabolic or signaling pathways.

Molecular function: Describes the precise biochemical activity, including catalysis, substrate binding, or receptor–ligand interactions.

• Understanding these levels requires genomic-scale tools to determine when proteins are expressed, where they localize, and which cellular processes they affect.

Comparative Genomics and Sequence Relationships

Comparative genomics uses genome sequences to infer gene and protein functions based on evolutionary conservation and sequence similarity.

Genome annotation identifies genes and predicts their functions using homology to known genes. Tools such as BLAST (NCBI) and Ensembl help assign functions to unknown genes.

Orthologs are genes in different species that share common ancestry and usually similar functions; paralogs are duplicated genes within the same genome that may acquire new functions.

Synteny — conservation of gene order between related species — provides evidence for orthologous relationships.

• Detection of conserved sequence motifs (e.g., ATP-binding sites, zinc fingers) or enzyme active-site residues helps predict biochemical function.

When and Where a Protein Is Present

• The location and timing of protein expression give clues to its function. Techniques like RNA sequencing (RNA-Seq), mass spectrometry, and fluorescence microscopy provide this information.

RNA-Seq and Transcriptomics

• RNA-Seq determines which genes are transcribed under specific conditions. RNA is isolated, fragmented, reverse-transcribed into cDNA, and sequenced.

• The method quantifies transcript abundance and detects all RNA types, including mRNAs and noncoding RNAs.

Single-cell RNA-Seq (scRNA-Seq) allows mapping of transcriptional profiles at cellular resolution, revealing heterogeneity within tissues or tumors.

• Gene expression patterns derived from RNA-Seq can diagnose diseases such as diabetes or cancer and reveal regulatory networks.

Proteomics and Mass Spectrometry

Mass spectrometry defines the proteome — the full set of proteins expressed by a cell — and quantifies protein abundance under various conditions.

• It provides information about posttranslational modifications, helping assess protein activity and regulatory states.

Fusion Proteins and Immunofluorescence

Green fluorescent protein (GFP) tagging allows visualization of protein location in living cells. GFP, derived from Aequorea victoria, fluoresces when exposed to blue light.

GFP fusion proteins enable real-time tracking of protein movement and localization. Variants emit different colors for multi-protein studies.

Immunofluorescence detects native or tagged proteins using antibodies linked to fluorescent dyes. Indirect staining (primary + fluorescent secondary antibody) amplifies signal intensity.

Epitope tags (short peptide sequences) allow standardized antibody recognition of fusion proteins.

Protein–Protein Interaction Analysis

Immunoprecipitation and Tandem Affinity Purification

Immunoprecipitation isolates protein complexes using antibody binding to an epitope-tagged target. Co-precipitated partners reveal interaction networks.

Tandem affinity purification (TAP) employs two tags (e.g., protein A and calmodulin-binding peptide) and two purification steps to minimize nonspecific contaminants.

Yeast Two-Hybrid System

• Based on the Gal4 transcription factor of yeast, this system detects protein–protein interactions in vivo.

• Two fusion constructs are made: one linking the DNA-binding domain (DBD) to a “bait” protein, and another linking the activation domain (AD) to a “prey” protein.

• Interaction between bait and prey reconstitutes Gal4 activity, activating transcription of a reporter gene (e.g., enzyme for color change or growth selection).

• Large-scale yeast two-hybrid screens identify potential interacting partners across the proteome, though false positives can occur.

CRISPR/Cas9 and Gene Disruption Studies

• Gene deletion or modification reveals how loss or change of a protein affects cellular function. CRISPR/Cas9 has revolutionized this process.

Mechanism of CRISPR/Cas9

• Derived from bacterial antiviral defense systems, CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) works with Cas proteins to target and cleave DNA.

• The Cas9 protein binds to a single guide RNA (sgRNA) designed to complement the target DNA sequence.

• Cas9 introduces a double-strand break, repaired by nonhomologous end joining (NHEJ) or, if a donor template is supplied, by homologous recombination for precise editing.

• Modified Cas9 variants can create single-strand nicks or fuse with transcription activators/repressors to regulate gene expression without cutting DNA.

Applications of CRISPR/Cas9

• CRISPR enables gene knockouts, point mutations, or activation/repression of specific genes.

• It is used in genetic screens, gene therapy trials (e.g., muscular dystrophy, retinal disorders), agriculture, and pest control research.

• Off-target cleavage remains a major technical concern.

Box 9-1: Gene Drives for Pest Control

Gene drives exploit CRISPR/Cas9 to bias inheritance of engineered traits through populations, rapidly spreading mutations.

• The X-shredder system destroys X chromosomes in male germ cells, producing only male offspring and potentially collapsing invasive species populations.

• Though promising, ecological risks and unintended spread require careful containment and ethical consideration.

High-Throughput Genetic Screening

• Genetic screens identify genes affecting specific biological processes by systematically perturbing genes across a population.

• In CRISPR-based screens, cells are infected with a library of sgRNAs targeting different genes, typically via lentiviral vectors.

• Each sgRNA acts as a barcode, allowing tracking by sequencing after selection for survival or death under stress conditions.

• Cas9 variants determine the effect: standard Cas9 knocks out genes, while catalytically inactive forms fused to activators/repressors modulate expression.

In a Nutshell

Protein function can be explored at three levels: phenotypic, cellular, and molecular. Comparative genomics and conserved motifs help infer function, while transcriptomics (RNA-Seq) and proteomics (mass spectrometry) reveal expression patterns. Fusion proteins and immunofluorescence visualize localization, and protein–protein interaction assays (immunoprecipitation, TAP, yeast two-hybrid) map interaction networks. CRISPR/Cas9 enables targeted genome editing and functional gene screens, offering unparalleled control for studying gene and protein roles across cells and organisms.

Post a Comment

Previous Post Next Post