Scientists have made a major breakthrough in mapping the human genome, filling in millions of missing DNA letters that were previously unresolved. An international team led by Christine Beck of The Jackson Laboratory and the University of Connecticut Health Center sequenced 65 ancestry-diverse genomes and closed 92 percent of the gaps left by earlier projects. The missing pieces often carry variants that influence digestion, immunity, and muscle control.
Without them, risk models for many conditions were blind to entire classes of DNA changes. The new assemblies bring those stretches into view, allowing variant calling software to flag complex rearrangements that older methods missed. The team used a new type of sequencing that reads much longer pieces of DNA than older methods.
They combined two types of reads: one that’s very accurate and another that’s extra long, enabling them to capture big and tricky sections. The researchers stitched those reads together to build complete sets of DNA from each person, including both their mother’s and father’s versions. The researchers uncovered nearly 2,000 complicated changes that were previously too challenging to find.
They also identified over 12,000 pieces of “jumping” DNA, which are bits that can move around and change how genes work. Additionally, the team fully mapped more than 1,200 centromeres, the central parts of chromosomes, which help them divide properly. Earlier references were built mostly from European genomes, a limitation that skewed risk scores and drug studies for decades.
Breakthroughs in human genome mapping
Clinicians in Africa, South America, and Asia have repeatedly reported mismatches between test results and patient outcomes. Nearly 60 percent of the newly found insertions and 14 percent of deletions occur in fewer than one in 100 people, making them ideal markers for rare disease diagnosis.
“It’s only been in the last three years that technology finally got to the point where we can sequence complete genomes,” noted Charles Lee of the Jackson Laboratory for Genomic Medicine. He considers 65 complete genomes to be a starting point, not a finish line. “There’s more and more realization that these sequences are not junk,” added Jan Korbel, referring to the repetitive DNA now decoded.
Korbel highlighted that the resource is open for anyone to explore. Both scientists see the data as a launchpad for large health care projects, from newborn screening to predictive polygenic tools, that work equally well for every community. The consortium is already integrating its assemblies into graph-based tools, allowing routine short-read data to benefit from the richer reference.
Early tests push per genome variant detection past 26,000 structural changes, roughly double earlier counts. Sequencing costs are falling so quickly that fully phased, telomere-to-telomere genomes may soon be standard in diagnostic labs, ending the era in which physicians worked with partial maps and educated guesses. Understanding health requires the full genetic blueprint, and this study significantly advances that goal by providing most of the missing pages.
Remaining gaps may close as long-read sequencing becomes more common in everyday medicine.
