Lesson 3 of 34 · Nucleic Acid Chemistry
DNA-Associated Proteins and Chromatin
The double helix described in the previous lessons does not float freely inside a cell. In a human nucleus, roughly two meters of DNA must fit into a compartment only about six micrometers across — a packing problem of extraordinary scale. The cell solves it by wrapping and folding DNA around proteins into a compact, organized complex called chromatin. That packaging is not merely storage: how tightly a region is folded controls whether the genes it contains can be read, and the proteins that bind DNA are the same molecules that replicate, transcribe, and repair it. This lesson introduces those DNA-associated proteins and the levels of chromatin organization, building on the structure and base-pairing concepts already covered.
The Packaging Problem
The numbers make the challenge concrete. A single human diploid cell contains about 6.4 billion base pairs of DNA divided among 46 chromosomes. Laid end to end, that DNA would stretch roughly two meters, yet it is confined to a nucleus thousands of times smaller in diameter 1. Compaction on this order — many thousand-fold — cannot be achieved by simply coiling a bare strand. The negatively charged phosphodiester backbone repels itself, so the cell uses positively charged proteins to neutralize that charge and organize the fiber. The result must be compact enough to fit and stable enough to protect the genome, yet still allow access to the specific sequences a cell needs at any given moment.
Histones and the Nucleosome
The principal packaging proteins are the histones, a family of small, highly basic proteins rich in the positively charged amino acids lysine and arginine. That positive charge lets them bind tightly to the negatively charged DNA backbone 2. Histones are among the most evolutionarily conserved proteins known, reflecting how essential their packaging role is.
Four histone types form the structural core: H2A, H2B, H3, and H4. Two copies of each assemble into a histone octamer, and about 147 base pairs of DNA wrap nearly twice around this octamer to form a nucleosome, the fundamental repeating unit of chromatin 1. A fifth histone, the linker histone H1, sits outside the core. It binds where DNA enters and exits the nucleosome and along the short stretch of “linker” DNA between adjacent nucleosomes, helping to clamp and organize them.
Under the electron microscope, a string of nucleosomes on extended DNA has a characteristic appearance often described as “beads on a string,” where each bead is a nucleosome and the string is the linker DNA connecting them:
linker DNA linker DNA
----------( O )----------( O )----------( O )----------
^histone octamer wrapped by ~147 bp of DNA
(each "bead" = one nucleosome)
This beads-on-a-string fiber, roughly 10 nanometers wide, is the first and least condensed level of packaging.
Levels of Chromatin Compaction
Packaging proceeds through successive levels, each more condensed than the last. The 10-nanometer beads-on-a-string fiber is folded and coiled — aided by H1 and other proteins — into a thicker fiber about 30 nanometers across. This fiber is in turn arranged into large looped domains anchored to a protein scaffold, and those loops are folded and coiled still further 1. The most extreme condensation occurs during cell division, when chromatin is compacted into the familiar X-shaped metaphase chromosome visible by light microscopy. A simplified hierarchy:
bare DNA double helix (~2 nm)
|
nucleosomes: "beads on a string" (~10 nm fiber)
|
condensed chromatin fiber (~30 nm fiber)
|
looped domains on a scaffold
|
metaphase chromosome (maximally condensed, mitosis)
Most of the time the cell is not dividing, and its chromatin is far less condensed — but even then the degree of folding varies from region to region, and that variation carries biological meaning.
Euchromatin, Heterochromatin, and Gene Accessibility
Within an interphase nucleus, chromatin exists in two broad states. Euchromatin is loosely packed and relatively open; heterochromatin is densely packed and condensed 1. The distinction matters because the machinery that reads genes must physically reach the DNA. In open euchromatin, regulatory proteins and the transcription apparatus can access promoters and genes, so this state is associated with genes that are available for expression. In condensed heterochromatin, the DNA is largely shielded, and the genes it contains are generally silent.
Accessibility, then, is not fixed by the DNA sequence alone but by how that sequence is packaged. The same gene can be transcriptionally active in one cell type and silenced in another depending on its local chromatin state — a central theme of gene regulation that later coursework develops in detail.
Epigenetic Marks: Histone Modifications and DNA Methylation
The transitions between open and condensed chromatin are guided in part by chemical marks placed on histones and on DNA itself. Histone proteins carry flexible “tail” regions that can receive reversible chemical groups — acetyl, methyl, phosphate, and others. These histone modifications alter how tightly the histones grip DNA and serve as signals recruiting other proteins, thereby nudging a region toward a more open or more closed state 1.
A second mark acts directly on the DNA: DNA methylation, the addition of a methyl group to cytosine bases, most often where a cytosine is followed by a guanine. Methylation in regulatory regions is broadly associated with reduced gene expression 3. Crucially, these marks change gene activity without changing the underlying base sequence — the defining feature of epigenetics. The mechanisms, inheritance, and laboratory measurement of these marks (including bisulfite-based methods for detecting cytosine methylation) are topics for later lessons; here it is enough to recognize that chromatin is a regulated, chemically annotated structure rather than inert packaging.
Non-Histone DNA-Binding Proteins
Histones dominate by mass, but many other proteins bind DNA to carry out its functions. Transcription factors recognize specific short sequences, typically in promoter and regulatory regions, and control whether nearby genes are transcribed. The polymerases — DNA polymerase for replication and RNA polymerase for transcription — bind and move along the template to synthesize new nucleic-acid strands. DNA repair proteins survey the genome and correct damage and mismatches, preserving sequence fidelity across cell divisions 2. Unlike the largely sequence-independent binding of histones, many of these proteins read specific sequences or recognize particular structural features, and their access to DNA depends on the chromatin state described above.
Why This Matters in the Laboratory
For the molecular laboratory, chromatin is something that must be taken apart. Because genomic DNA is tightly bound to histones and other proteins, recovering clean, usable nucleic acid requires disrupting these protein–DNA complexes. Extraction procedures therefore break open cells and use agents — such as detergents and protein-digesting enzymes — to strip away histones and other associated proteins, freeing the DNA for downstream analysis 3. The specific chemistries and workflows for nucleic-acid isolation are covered in an upcoming topic; the point to carry forward is that the very packaging that organizes the genome inside the cell is an obstacle the analyst must overcome to study it.
References
- Bruce Alberts, Rebecca Heald, Alexander Johnson, David Morgan, Martin Raff, Keith Roberts, Peter Walter. Molecular Biology of the Cell. 7th ed. W. W. Norton & Company. 2022. verified
- David L. Nelson, Michael M. Cox, Aaron A. Hoskins. Lehninger Principles of Biochemistry. 8th ed. W. H. Freeman (Macmillan Learning). 2021. verified
- Lela Buckingham. Molecular Diagnostics: Fundamentals, Methods, and Clinical Applications. 3rd ed. F.A. Davis Company. 2019. verified