Rucete ✏ Lehninger Principles of Biochemistry In a Nutshell
9.3 Genomics and the Human Story
This chapter explores how genome sequencing reveals the molecular basis of human biology, evolution, and disease. It discusses the structure and composition of the human genome, comparative genomics with other species, the discovery of disease-related genes, and how genomic data illuminate human ancestry and migration.
The Expanding World of Genomics
• Since 2001, sequencing the human genome has become routine, with genomes of tens of thousands of species now available across Bacteria, Archaea, and Eukarya.
• Genome databases allow researchers to compare gene conservation and variation, map disease-related genes, and trace evolutionary history.
• Extinct species such as Homo neanderthalensis and ancient humans have also been sequenced, providing new insight into human evolution.
The Human Genome Contains Many Types of Sequences
• Humans have approximately 20,000 protein-coding genes—fewer than rice (38,000) and only slightly more than nematodes or flies.
• Eukaryotic genes contain both coding segments (exons) and noncoding segments (introns), with introns spliced out of precursor RNA.
• Exons often correspond to protein domains, allowing domain recombination to create diverse multidomain proteins.
• Alternative splicing increases protein diversity by generating multiple mRNAs from a single gene, particularly common in vertebrates.
• Less than 1.5% of human DNA is protein-coding, but when introns are included, about 30% of the genome consists of genes encoding proteins.
Transposons and Repeated DNA
• About half of the human genome is composed of transposons—mobile DNA elements that can move and duplicate within the genome.
• Transposons include DNA transposons and retrotransposons (which transpose via RNA intermediates), many of which are now inactive relics.
• Transposons have contributed to genome evolution by redistributing DNA segments and altering regulatory structures.
The ENCODE Project and Functional Genomics
• The ENCODE project (2003–present) found that over 80% of the human genome is transcribed or participates in chromatin regulation.
• Even noncoding regions play crucial roles in regulating gene expression and maintaining chromatin structure.
• Many single-nucleotide polymorphisms (SNPs) associated with genetic diseases occur in these regulatory noncoding regions.
Simple-Sequence Repeats and Human Variation
• About 3% of the genome consists of simple-sequence repeats (SSRs), found mainly in centromeres and telomeres (e.g., GGTTAG repeats).
• Short tandem repeats (STRs) — isolated repeat sequences — serve as key markers in forensic DNA analysis.
• Millions of single nucleotide polymorphisms (SNPs) produce individual genetic diversity (1 variation per 1,000 bp on average).
• Groups of SNPs inherited together form haplotypes, which serve as genetic markers for populations and help trace human ancestry.
Genome Sequencing and Comparative Genomics
• Human and chimpanzee genomes are 98.8% identical; base-pair differences of ~1.2% amount to over 30 million nucleotide changes.
• Larger genomic rearrangements (insertions, deletions, inversions, duplications) account for additional differences—about 4% overall.
• Human chromosome 2 resulted from the fusion of two ancestral primate chromosomes, reducing the count to 23 pairs.
• Outgroup comparisons (e.g., orangutans) help identify mutations unique to the human lineage.
Genes Defining Humanity
• Many human-specific traits arise not from new protein-coding genes but from regulatory changes affecting gene expression.
• Genes associated with brain development, such as glutamate dehydrogenase and noncoding RNAs like HAR1F, show signs of accelerated evolution.
• Gene duplication and changes in regulatory RNA expression likely contributed to the expansion and complexity of the human brain.
Genomics and Disease Gene Discovery
• Linkage analysis maps disease genes based on inheritance patterns of nearby polymorphisms across family pedigrees.
• Example: Early-onset Alzheimer disease was linked to mutations in the PS1 (presenilin-1) gene on chromosome 14 using linkage analysis.
• Once a region is identified, sequencing of affected and unaffected individuals narrows down the causal mutation.
• Genome databases accelerate discovery by integrating SNP maps, protein functions, and interaction data to identify candidate genes.
• Many genetic diseases involve single-gene mutations (monogenic), while others involve multiple genes or regulatory regions (digenic and polygenic).
Using Genome Data Beyond Humans
• These same approaches are applied to locate genes for traits or diseases in plants and animals.
• Comparative genomics enables breeders and researchers to enhance desired characteristics across species.
Genomics and Human Evolution
• Modern humans originated in Africa 250,000–350,000 years ago and began migrating out of Africa 100,000–120,000 years ago.
• As Homo sapiens expanded, they replaced other hominid species like Neanderthals and Denisovans, though some interbreeding occurred.
• Up to 5% of non-African human genomes derive from Neanderthals, and up to 6% of Melanesian/Australian DNA derives from Denisovans.
• Neanderthal DNA contributed to immune system diversity but also increased susceptibility to autoimmune diseases.
Neanderthal Genome Research
• Ancient DNA sequencing techniques, adapted from forensic science, allow recovery of Neanderthal DNA from bones and remains.
• Comparison with human and chimpanzee genomes identifies authentic Neanderthal sequences, distinct from contaminants.
• Mitochondrial DNA analyses show Neanderthals had unique haplotypes and diverged from humans about 700,000 years ago.
• Genome data confirm limited gene flow between Neanderthals and early humans until ~45,000 years ago.
Future of Genomics
• Falling sequencing costs make personal genomics practical for identifying inherited disease risks and tailoring medical treatments.
• Gene-editing technologies and synthetic genomics allow modification of genomes in bacteria, plants, and mammals.
• Human gene therapy is advancing through improved delivery systems, offering hope for curing inherited diseases.
• Genomics will profoundly shape future biology, medicine, and our understanding of human origins.
In a Nutshell
The human genome contains protein-coding genes, introns, transposons, and numerous regulatory and repetitive sequences. Comparative genomics clarifies both our evolutionary origins and the genetic causes of disease. Studies of Neanderthal and Denisovan genomes reveal ancient interbreeding and genetic exchange. Modern genomics not only reconstructs our past but also opens possibilities for personalized medicine, gene therapy, and deeper insight into what it means to be human.
