Lesson 5 of 34 · Basic Molecular Theory

DNA Replication

DNA Replication

Overview

Before a cell divides, it must copy its entire genome so that each daughter cell receives a complete, accurate set of instructions. DNA replication is the process that performs this copy. It is also the conceptual foundation of the central dogma — the flow of information from DNA to RNA to protein — and the first of those processes we examine in this module.

This lesson assumes the structure of DNA from the nucleic-acid-chemistry module: the antiparallel double helix, the sugar-phosphate backbone, the 5’ and 3’ ends of each strand, and complementary base pairing (A with T, G with C). Those features are not re-taught here; they are the rules that make faithful copying possible. The same enzymatic requirements introduced below also explain why the polymerase chain reaction (PCR) works the way it does — a connection developed at the end of this lesson and revisited when PCR is covered as a technique.

Semiconservative Replication

DNA is copied semiconservatively: each strand of the parent double helix serves as a template for a new complementary strand, so every daughter molecule contains one old (conserved) strand and one newly synthesized strand 1. Because the two strands carry complementary sequences, either one fully specifies the other. This is the deep reason replication can be accurate at all — the information is stored twice, redundantly, in the pairing rules themselves.

Origins and the Replication Fork

Replication does not begin randomly. It starts at specific sequences called origins of replication, where initiator proteins open the helix and recruit the replication machinery 1. Large eukaryotic chromosomes contain many origins so the whole genome can be copied within the time available; bacterial chromosomes typically use a single origin.

From each origin the two strands separate to form a replication fork — a Y-shaped junction where double-stranded DNA is unwound into two single-stranded templates. Forks usually proceed in both directions away from an origin, forming a replication bubble that grows as synthesis advances.

                  replication fork
                        |
   5' ==================+--------------------- 3'   parent strand
                         \  3'              (single-stranded templates)
                          \____ leading strand synthesis 5'->3' ->
                          ____  lagging strand (Okazaki fragments)
                         /  <- 5'
   3' ==================+--------------------- 5'   parent strand
                        |
                  (helicase unwinds here)

The Players at the Fork

Replication is carried out by a coordinated set of enzymes and proteins. Each solves a specific physical problem created by copying a long, double-stranded, antiparallel molecule 2.

  • Helicase unwinds the parent duplex at the fork, breaking the hydrogen bonds between paired bases to expose single-stranded templates.
  • Single-strand binding proteins (SSBs) coat the exposed single strands, keeping them from re-annealing or folding back on themselves until they are copied.
  • Topoisomerase relieves the torsional strain (supercoiling) that builds up ahead of the fork as the helix is unwound, nicking and resealing the DNA to let it rotate.
  • Primase synthesizes a short RNA primer, because DNA polymerase cannot start a strand from scratch — it can only extend an existing base-paired 3’ end.
  • DNA polymerase adds deoxyribonucleotides to the growing strand, reading the template and obeying base-pairing rules.
  • DNA ligase seals the breaks left between newly made segments, joining them into one continuous strand.

What DNA Polymerase Requires

DNA polymerase is the central enzyme, and its constraints shape the entire process 2. It requires, without exception:

  • a template strand to read,
  • a primer with a free 3’-hydroxyl to extend (it cannot initiate synthesis),
  • a supply of the four dNTPs (dATP, dGTP, dCTP, dTTP) as building blocks, and
  • synthesis strictly in the 5’ to 3’ direction, adding each new nucleotide to the 3’ end of the growing strand.

These four requirements — template, primer, dNTPs, and 5’->3’ extension — are not incidental. They are exactly the ingredients an experimentalist supplies to copy DNA in a test tube, a point returned to below.

Leading and Lagging Strands

Because DNA polymerase synthesizes only 5’->3’ and the two template strands are antiparallel, the two new strands cannot be made the same way 1.

On the template oriented so that synthesis follows the fork as it opens, the new leading strand is made continuously in one long stretch, needing only a single priming event.

On the opposite template, synthesis must run away from the fork. The new lagging strand is therefore built discontinuously, in short pieces called Okazaki fragments, each begun by its own RNA primer. As the fork advances and exposes more template, primase lays down another primer and polymerase fills in the next fragment. The RNA primers are later removed and replaced with DNA, and DNA ligase joins the fragments into a continuous strand by sealing the phosphodiester bonds between them.

Fidelity and Proofreading

A genome copied carelessly would accumulate errors every generation, so accuracy is essential. DNA polymerase achieves high fidelity in two ways 2. First, correct base pairing is strongly favored at the moment of nucleotide addition. Second, many DNA polymerases possess a 3’ to 5’ proofreading exonuclease activity: when a mismatched nucleotide is inserted, the enzyme can reverse direction, excise the incorrect base from the 3’ end, and then resume synthesis. This proofreading lowers the error rate by orders of magnitude beyond base selection alone, and additional repair pathways correct most of what remains 1.

The distinction between the 5’->3’ polymerase activity and the 3’->5’ exonuclease (proofreading) activity is worth holding onto: they run in opposite directions, and whether a given polymerase has proofreading activity matters in the laboratory, where high-fidelity enzymes are chosen when copying accuracy is critical.

The End-Replication Problem and Telomerase

Linear chromosomes face a structural problem at their ends. Because lagging-strand synthesis always needs a primer, once the terminal RNA primer is removed there is no upstream 3’ end available for polymerase to extend, leaving a short stretch unreplicated. Each round of replication would therefore shorten the chromosome — the end-replication problem 1.

Cells that must divide indefinitely address this with telomerase, an enzyme that carries its own RNA template and uses it to extend the repetitive telomere sequences at chromosome ends, offsetting the loss. At a high level, telomerase preserves chromosome length across many divisions; most ordinary somatic cells have little telomerase activity, which contributes to the gradual shortening of telomeres over a cell’s lifespan.

The Laboratory Payoff

The requirements of DNA polymerase are not just biochemical trivia — they are the basis of the most important technique in the molecular laboratory. To copy DNA in vitro, an experimentalist need only supply what the enzyme demands: a template to read, short primers that define where copying begins, the four dNTPs, and a DNA polymerase to extend 5’->3’ 3.

This is precisely what the polymerase chain reaction (PCR) does. PCR uses heat to separate the strands (taking over the role of helicase), then lets synthetic primers base-pair to chosen sites, and lets a heat-stable polymerase extend them. Repeating the cycle doubles the target each time, amplifying a region of interest from vanishingly small amounts of starting material. Every element of PCR maps directly onto a requirement introduced in this lesson — which is why replication is the right place to begin. PCR itself is developed in detail when amplification techniques are covered.

Summary

DNA replication is semiconservative, begins at origins, and proceeds at replication forks driven by helicase, SSBs, and topoisomerase. Primase lays down RNA primers; DNA polymerase extends them 5’->3’ using a template and dNTPs, with 3’->5’ proofreading for fidelity. The leading strand is continuous; the lagging strand is assembled from Okazaki fragments joined by DNA ligase. Linear chromosome ends are maintained by telomerase. The enzyme’s fixed requirements — template, primer, dNTPs, and 5’->3’ extension — are exactly what PCR exploits to copy DNA in the laboratory.

References

  1. Bruce Alberts, Rebecca Heald, Alexander Johnson, David Morgan, Martin Raff, Keith Roberts, Peter Walter. Molecular Biology of the Cell. 7th ed. W. W. Norton & Company. 2022. verified
  2. David L. Nelson, Michael M. Cox, Aaron A. Hoskins. Lehninger Principles of Biochemistry. 8th ed. W. H. Freeman (Macmillan Learning). 2021. verified
  3. Lela Buckingham. Molecular Diagnostics: Fundamentals, Methods, and Clinical Applications. 3rd ed. F.A. Davis Company. 2019. verified