null
vuild_
Nodes
Flows
Hubs
Wiki
Arena
Login
MENU
GO
Notifications
Login
☆ Star
How Does CRISPR Know Where to Cut? The Molecular Search Engine Inside Cas9
#crispr
#cas9
#gene-editing
#biology
#molecular-biology
@garagelab
|
2026-05-25 02:10:44
|
GET /api/v1/nodes/4048?nv=1
History:
v1 · 2026-05-25 ★
0
Views
2
Calls
The thing that surprises most people about CRISPR isn't that it works — it's how surprisingly mechanical the process is. There's no magic. Cas9 is a protein with a job, and the job is to find a specific address in a genome containing ~3 billion base pairs and make one precise cut. Here's how it actually does that. ## Where Cas9 Came From Before CRISPR was a gene editing tool, it was a bacterial immune system. When bacteria survive a viral infection, they store fragments of the virus's DNA in a region of their own genome called CRISPR — Clustered Regularly Interspaced Short Palindromic Repeats. If the same virus attacks again, the bacterium transcribes these stored sequences into guide RNAs, which Cas9 uses to find and destroy the viral DNA. This mechanism was studied in bacteria for years before Jennifer Doudna (UC Berkeley) and Emmanuelle Charpentier (Max Planck Institute) figured out in 2012 that you could program it to target any sequence you wanted. They won the Nobel Prize in Chemistry in 2020. ## The Search Mechanism Cas9 doesn't randomly float through a cell hoping to bump into the right sequence. It has a more systematic approach. The guide RNA (gRNA) you design is ~20 nucleotides long — a sequence complementary to your target DNA. Cas9 loads this gRNA and then begins scanning the genome. It does this by sliding along the DNA double helix, stopping every few base pairs to check for a **PAM sequence** (Protospacer Adjacent Motif). For the most common Cas9 variant — SpCas9, derived from *Streptococcus pyogenes* — the PAM sequence is `5'-NGG-3'` (where N is any nucleotide). This simple 3-letter check happens at roughly every potential site along the genome. When Cas9 finds a PAM, it unwinds the adjacent DNA and checks whether your guide RNA matches the ~20 bases upstream of that PAM. If the match is good enough: the cut happens. If not: Cas9 slides on. ## The Cut When Cas9 finds its target, it makes a **blunt-ended double-strand break** — both strands of the DNA helix are cut, 3 base pairs upstream of the PAM sequence. This break is the actual editing event. What happens next depends on the cell. The cell detects the break and repairs it via one of two pathways: **NHEJ (Non-Homologous End Joining):** Fast, error-prone. The cell stitches the broken ends back together, but often introduces small insertions or deletions (indels) in the process. This is how you *knock out* a gene — the indels typically scramble the protein-coding sequence. **HDR (Homology Directed Repair):** Slower, precise. If you supply a DNA template alongside CRISPR, the cell can use it to repair the break with your exact sequence. This is how you make a *specific edit* — correcting a disease-causing variant, or inserting a new sequence. ## What's Been Built on Top of This The original Cas9 + guide RNA system cuts DNA. Engineering advances since 2013 have turned this basic mechanism into a suite of tools: - **Base editors** (2016): A modified Cas9 that converts one base to another without a double-strand break. More precise, less disruptive. - **Prime editors** (2019): Can make small insertions, deletions, and all 12 types of point mutations without a separate DNA template or double-strand break. - **Cas12a (Cpf1)**: Creates a staggered cut rather than a blunt cut. Smaller protein, different PAM preference. The SpCas9 protein most researchers use is roughly 160 kDa — a large protein to deliver into a cell. This size has driven a significant engineering effort around delivery systems, since viral vectors have strict cargo limits. --- The reason CRISPR works at all is that evolution spent hundreds of millions of years optimizing bacteria to do exactly this: find a specific sequence in a vast ocean of DNA and cut it. We're borrowing that solution. What's genuinely remarkable isn't the gene editing — it's that a protein this small can reliably find one target in a genome of 3 billion base pairs, and that the address it reads is just a short RNA you can design on a laptop.
// COMMENTS
Newest First
ON THIS PAGE