Having DNA is like having a library full of recipe books. But a library is useless if nobody reads the books. Gene expression is the process by which cells read specific genes and build the proteins they encode. It's the bridge between the static code in your DNA and the dynamic machinery of a living cell.
Understanding gene expression is critical for gene editing because when we edit a gene, we're ultimately trying to change what protein gets made — or whether it gets made at all.
The Two-Step Process
Gene expression follows two major steps:
Step 1: Transcription (DNA → mRNA)
Transcription happens in the cell's nucleus. An enzyme called RNA polymerase binds to a region just before a gene called the promoter, unwinds the DNA double helix, and reads one strand of the gene. As it moves along, it builds a complementary molecule called messenger RNA (mRNA).
The key differences between mRNA and DNA:
- mRNA is single-stranded (not a double helix)
- mRNA uses the base uracil (U) instead of thymine (T)
- mRNA is temporary — it's eventually degraded after use
Once the mRNA is built, it's processed (introns are removed, a cap and tail are added) and exported from the nucleus into the cytoplasm.
Step 2: Translation (mRNA → Protein)
In the cytoplasm, structures called ribosomes read the mRNA. They move along the mRNA three bases at a time, and each three-base unit (a codon) specifies one amino acid:
| Codon | Amino Acid |
|---|---|
| AUG | Methionine (start) |
| UUU | Phenylalanine |
| GCA | Alanine |
| UAA | Stop signal |
Small adapter molecules called transfer RNA (tRNA) carry the correct amino acid to the ribosome. The amino acids are linked together one by one to form a polypeptide chain, which folds into a functional protein.
A single mRNA molecule can be read by multiple ribosomes simultaneously, producing many copies of the same protein — like a popular book being photocopied for distribution.
Not All Genes Are On At Once
Here's the crucial insight: every cell in your body contains the same DNA, but different cells express different genes. A liver cell and a neuron have identical genomes, but they look and function differently because they've turned on different sets of genes.
This selective activation is called gene regulation, and it happens at multiple levels:
Transcription Factors
Transcription factors are proteins that bind to specific DNA sequences near a gene's promoter. Some are activators (they increase transcription), and some are repressors (they block it). The combination of transcription factors present in a cell determines which genes are expressed.
Enhancers and Silencers
Enhancers are DNA regions (sometimes far from the gene itself) that boost transcription when bound by specific proteins. Silencers do the opposite. These elements can be thousands of base pairs away from the gene they regulate, looping through 3D space to make contact.
Epigenetic Modifications
Cells can chemically modify DNA or the histone proteins that DNA wraps around, without changing the base sequence:
- DNA methylation: Adding methyl groups to cytosine bases typically silences a gene
- Histone acetylation: Adding acetyl groups to histones loosens the DNA packaging, making genes more accessible
- Histone methylation: Can either activate or silence genes depending on which amino acid is modified
These epigenetic marks are heritable during cell division — a liver cell divides into liver cells, not neurons — but they can also be changed, which is the basis of epigenetic editing.
When Gene Expression Goes Wrong
Errors in gene expression underlie many diseases:
- Cancer: Often involves genes being turned on when they shouldn't be (oncogenes) or turned off when they should be active (tumor suppressors)
- Genetic disorders: A mutation in a gene can produce a defective protein (like the misshapen hemoglobin in sickle cell disease) or no protein at all
- Autoimmune diseases: Immune system genes may be expressed inappropriately, causing the body to attack its own tissues
Why This Matters for Gene Editing
Gene editing technologies intervene at different points in this process:
- CRISPR-Cas9 cuts the DNA itself, disrupting a gene's ability to be transcribed. This is permanent.
- Base editing changes individual DNA bases without cutting, correcting point mutations that produce defective proteins.
- Prime editing can rewrite short stretches of DNA, fixing insertions, deletions, or substitutions.
- Epigenome editing modifies the chemical marks on DNA or histones, turning genes on or off without altering the sequence. This is potentially reversible.
- RNA editing modifies the mRNA after transcription, changing the protein produced without touching the DNA at all. This is temporary and wears off.
Each approach targets a different level of gene expression, and the right choice depends on the disease mechanism.
Key Takeaways
- Gene expression is the two-step process of transcription (DNA → mRNA) and translation (mRNA → protein)
- Not all genes are active in every cell — regulation determines which genes are expressed
- Transcription factors, enhancers, silencers, and epigenetic marks control gene activity
- Different gene editing technologies target different levels of gene expression
- Understanding gene expression helps explain why different editing approaches suit different diseases
Now that you understand both DNA structure and gene expression, you're ready to learn about the breakthrough technology that made precise gene editing practical: CRISPR.