Sequencing the Falcon genome: Long-Read Technology Maps Avian Evolution
Source PublicationG3: Genes, Genomes, Genetics
Primary AuthorsZuccolo, Mohammed, Alrefaei et al.

Researchers have finally mapped the Falcon genome at a chromosome level, a feat previously hindered by the highly repetitive sequences within avian DNA. Older sequencing methods fragmented these complex regions, leaving massive gaps in our understanding of how these raptors diversified so quickly over the past 7.5 million years.
These results were observed under controlled laboratory conditions, so real-world performance may differ.
Context: Rebuilding the Falcon genome
Previously, older short-read sequencing produced millions of tiny genetic fragments that computers struggled to reassemble, especially around transposable elements (TEs). These highly repetitive sequences consistently disrupt sequence mapping and mask structural variations.
Now, using PacBio HiFi sequencing and long-read RNA Iso-Seq, scientists can read much longer, continuous stretches of DNA with high fidelity. This updated method drastically reduces assembly errors, providing a highly contiguous map with over 97% completeness. It allows researchers to see the exact physical arrangement of genes across entire chromosomes.
The Discovery: Measuring Raptor DNA
The team assembled complete genetic profiles for the peregrine (Falco peregrinus) and lanner (Falco biarmicus) falcons. By comparing these maps to the distantly related chicken (Gallus gallus), they measured extensive chromosomal rearrangements. Conversely, they found strong sequence conservation among the falcons themselves, a synteny further confirmed through comparison with another sequenced species, Falco rusticolus.
The study quantified several specific genetic features previously obscured by older technologies:
- Between 18,638 and 19,858 genes per bird, verified by long-read RNA Iso-Seq to ensure accurate coding region predictions.
- Primary immune receptors (MHC class I/II) and exactly 24 to 25 olfactory receptor genes.
- Transposable elements comprising roughly 7.4% to 8.4% of the total genetic material.
- Over 8,700 distinct structural variants, with more than 40% directly involving these repetitive TEs.
Interestingly, the researchers identified contigs belonging to the W sex chromosome based on a significantly higher concentration of TEs (30% or more) compared to autosomes and the Z chromosome.
What This Study Leaves Unanswered
Despite the technical precision of this work, the study does not solve the exact functional mechanisms of falcon speciation. The researchers catalogued structural variants and identified immune genes, but this static data alone cannot explain how specific physical traits evolved in the wild. We still lack experimental evidence linking these newly mapped genes to specific physiological adaptations or hunting behaviours.
Furthermore, while comparative data exists for Falco rusticolus, sampling primarily two species in this study leaves the broader genetic architecture of the remaining 35 falcon species largely unexplored. Identifying a gene computationally is only the first step; proving its ecological function requires extensive field and laboratory validation beyond these specific reference assemblies.
The Impact: A Rigorous Baseline
These highly contiguous assemblies establish a strict baseline for future comparative biology. Researchers can now track how specific transposable elements might drive structural polymorphism across different environments. As ecological pressures mount, having a precise genetic map could help conservationists identify populations with the highest adaptive potential.