Close Menu
My Blog

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Nautilus debuts Voyager platform in push toward next-gen proteomics

    March 1, 2026

    First-in-Human Success for Prenatal Stem Cell Therapy in Spina Bifida

    February 28, 2026

    Pressure-Driven Pathway Links Liver Congestion to Fibrosis and Cancer

    February 28, 2026
    Facebook X (Twitter) Instagram
    X (Twitter) YouTube
    My BlogMy Blog
    Sunday, March 1
    • Home
    • About Us
    • Healthy Living
    • DNA & Genetics
    • Podcast
    • Shop
    My Blog
    Home»DNA & Genetics»Compression Technique Shrinks the Size of Massive Pangenome Data Stores
    DNA & Genetics

    Compression Technique Shrinks the Size of Massive Pangenome Data Stores

    adminBy adminJanuary 15, 2026No Comments3 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp VKontakte Email
    The Scientist Logo
    Share
    Facebook Twitter LinkedIn Pinterest Email

    The new technique compresses vast amounts of genomic data.

    Image credit:© iStock.com, jxfzsy 

    The recent revolution in genomic sequencing has opened new fields of study, but it’s also produced a DNA data deluge that has led to almost too much genomic information for researchers to handle.

    In a new study, published in Nature Genetics, researchers have described a method that could help the largest genomics projects handle these vast data volumes by achieving unrivaled levels of compression.1 The new approach could make these data resources accessible and usable to a wider group of scientists.

    Pangenomics: DNA Analysis at the Largest Scale

    While early genomics projects focused on representative reference genomes derived from a single individual, the emerging field of pangenomics has set bigger goals. In pangenomics studies, researchers assemble many genomes from a single species to capture all of the genetic variation present in that species’ DNA. This approach can demonstrate how mutations affect pathogen spread or drug resistance.

    While this approach may offer a broader lens for research, it also puts significant strain on labs’ data storage. Consortia storing pangenomes are amassing terabytes of uncompressed FASTA files (text-based files of nucleotide sequences), and the data handling required to make these files accessible can take an impractically long time.2

    These pangenomic data resources are also challenging to visualize. Graph-based data formats have become popular in the field, but these approaches still have high storage requirements and don’t capture all of the relevant information from the genomes’ genetic history. This includes the collected genomes’ shared mutational and evolutionary histories.

    “The data structures used for pangenomics research are critical because they determine not only how efficiently genetic data is represented, but also what the data can represent,” said Sumit Walia, a study coauthor and electrical engineer at the University of California, San Diego (UCSD), in a statement.

    Walia and a team led by UCSD engineer Yatish Turakhia have developed a new file format and data structure, called Pangenome Mutation-Annotated Network (PanMAN), that could maximize the potential of pangenomic data.

    Impressing by Compressing

    In their new study, the team tested PanMAN’s ability to compress genomic data on the SARS-CoV-2 virus genome. They first created a massive viral pangenome, made up of eight million separate viral genomes. They were able to compress it 3,000-fold, reducing this trove of genetic data to 366 megabytes—about half the file size of a mid-definition TV episode.

    The format also allows researchers to directly analyze this compressed data, opening up unusably large data volumes for study. “Our compressive technique with PanMANs allows doing more with less, greatly improving the scale and scope of current pangenomic analysis,” said Turakhia in a statement.

    The PanMAN format visualizes individual genomes as the roots of graphical trees. Different branches of the tree represent genomic features, such as mutations. Complex mutations involving multiple parent sequences are shown as connecting edges between these trees. This means that single mutations are stored only once on shared branches, rather than in multiple locations. The technique also directly and indirectly stores useful data that other graphical representations miss, such as ancestral sequences and phylogeny.

    The team’s next step is to apply PanMAN to human genomes to broaden the technique’s impact.

    “Extending compressive pangenomics to human genomes can fundamentally transform how we store, analyze, and share large-scale human genetic data,” said Turakhia. “Besides enabling studies of human genetic diversity, disease, and evolution at unprecedented scale and speed, it can depict detailed evolutionary and mutational histories which shape diverse human populations, something that current representations do not capture.”

    Compression Data massive Pangenome shrinks Size Stores Technique
    Share. Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Email
    Previous ArticleBiosplice files FDA application for knee osteoarthritis drug
    Next Article Expanding Clinically Relevant Findings with Enhanced Exomes
    admin
    • Website

    Related Posts

    A Video Report from AGBT

    February 27, 2026

    Novo Nordisk, Vivtex Ink Up to $2.1B Deal to Develop Oral Biologics for Metabolic Conditions

    February 27, 2026

    Increasing Rice Yields with Gene-Informed Selective Breeding

    February 27, 2026

    Mutant p53 Selective Reactivation Demonstrated in Advanced Solid Tumors

    February 27, 2026
    Leave A Reply Cancel Reply

    Our Picks

    9 Time-Saving Kitchen Gadgets for Fall at Amazon

    September 5, 2025

    Why Exercise Is So Important For Heart Health, From An MD

    September 5, 2025

    An Engineered Protein Helps Phagocytes Gobble Up Diseased Cells

    September 5, 2025

    How To Get Rid Of Hangnails + Causes From Experts

    September 5, 2025
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo
    Don't Miss
    Longevity

    Nautilus debuts Voyager platform in push toward next-gen proteomics

    By adminMarch 1, 20260

    Company’s new benchtop system promises a clearer view of proteins following validation at a leading…

    First-in-Human Success for Prenatal Stem Cell Therapy in Spina Bifida

    February 28, 2026

    Pressure-Driven Pathway Links Liver Congestion to Fibrosis and Cancer

    February 28, 2026

    A cellular atlas of aging comes into focus

    February 28, 2026

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    About Us

    At FineGut, our mission is simple: to enhance your self-awareness when it comes to your gut health. We believe that a healthy gut is the foundation of overall well-being, and understanding the brain–gut connection can truly transform the way you live.

    Our Picks

    9 Time-Saving Kitchen Gadgets for Fall at Amazon

    September 5, 2025

    Why Exercise Is So Important For Heart Health, From An MD

    September 5, 2025

    An Engineered Protein Helps Phagocytes Gobble Up Diseased Cells

    September 5, 2025
    Gut Health

    Nautilus debuts Voyager platform in push toward next-gen proteomics

    March 1, 2026

    First-in-Human Success for Prenatal Stem Cell Therapy in Spina Bifida

    February 28, 2026

    Pressure-Driven Pathway Links Liver Congestion to Fibrosis and Cancer

    February 28, 2026
    X (Twitter) YouTube
    • Contact us
    • Privacy Policy
    • Disclaimer
    • Terms and Conditions
    © 2026 finegut.com. Designed by Pro.

    Type above and press Enter to search. Press Esc to cancel.