Gene expression
What Is Gene Expression?
Gene expression is the process by which the information encoded in a gene is used to produce a functional product, most commonly a protein. The process proceeds in two principal stages: transcription, in which a segment of DNA is copied into a messenger RNA (mRNA) molecule, and translation, in which the ribosome reads the mRNA sequence and assembles the corresponding sequence of amino acids into a protein. Together, these stages constitute the flow of genetic information from nucleic acid sequence to molecular function, a principle central to molecular biology. The NCBI Molecular Biology of the Cell chapter on DNA to RNA provides a detailed account of the molecular machinery involved.
Gene expression is not a fixed output of any given gene. Cells in a multicellular organism carry identical DNA, yet they differentiate into hundreds of distinct cell types with different morphologies and functions. This differentiation arises because different genes are expressed at different levels in different cell types, developmental stages, and environmental conditions. Understanding which genes are expressed when, where, and how strongly is a central problem of molecular biology, genomics, and biomedical engineering.
Transcription and Translation
Transcription is catalyzed by RNA polymerase, an enzyme that binds to a gene's promoter region, unwinds the double helix, and synthesizes a complementary RNA strand. In eukaryotic cells, this occurs in the nucleus; the resulting pre-mRNA is processed by splicing to remove non-coding intron sequences before export to the cytoplasm. Translation occurs in the cytoplasm, where ribosomes read the mRNA sequence in triplet codons and recruit transfer RNA (tRNA) molecules carrying the corresponding amino acids. Each codon specifies one amino acid according to the genetic code, and the ribosome polymerizes amino acids in sequence until a stop codon is reached, releasing the completed polypeptide.
In prokaryotes, the nuclear envelope is absent and transcription and translation are coupled: ribosomes begin translating an mRNA while RNA polymerase is still synthesizing it. This coupling, described in PMC research on transcription-translation coupling, enables faster regulatory responses and allows the translational machinery to influence transcriptional termination.
Regulation of Gene Expression
The rate of gene expression is controlled at multiple levels: transcriptional initiation is the most consequential, because it determines whether an mRNA is produced at all. Transcription factors are proteins that bind to specific DNA sequences near a gene's promoter and either activate or repress RNA polymerase binding. Signal transduction pathways transmit extracellular signals into the nucleus by modifying transcription factor activity, allowing cells to adjust gene expression in response to hormones, nutrients, stress, and developmental cues.
Post-transcriptional regulation includes control of mRNA splicing, mRNA stability, and translational efficiency. MicroRNAs (miRNAs) are short non-coding RNA molecules that bind to complementary sequences in mRNA and reduce either its stability or its translation, adding another layer of regulatory precision. Epigenetic mechanisms, including DNA methylation and histone modification, alter chromatin structure and thereby the accessibility of gene promoters without changing the DNA sequence itself. The NCBI bookshelf entry on gene expression and regulation in molecular biology summarizes these regulatory layers in an accessible reference format.
Measurement Technologies
High-throughput technologies have made it possible to quantify gene expression across an entire genome simultaneously. DNA microarrays measure the abundance of thousands of mRNA species by hybridization to spotted probes. RNA sequencing (RNA-seq) uses high-throughput short-read sequencing to count transcripts at single-nucleotide resolution and can detect novel transcripts, alternative splicing events, and allele-specific expression. Single-cell RNA-seq extends this capability to individual cells, enabling characterization of cell-type composition and developmental trajectories in complex tissues.
Applications
Gene expression has applications in a wide range of disciplines, including:
- Disease biomarker discovery and molecular diagnostics in cancer and infectious disease
- Drug target identification and pharmaceutical research
- Synthetic biology and metabolic engineering for biomanufacturing
- Agricultural biotechnology for developing disease-resistant and high-yield crops
- Neuroscience, where expression profiling maps gene activity across brain regions and developmental stages