Researchers conduct sequencing and de novo assembly of 150 genomes in Denmark – Phys.Org

Posted: July 27, 2017 at 9:47 am

July 27, 2017 by Bob Yirka report A depiction of the double helical structure of DNA. Its four coding units (A, T, C, G) are color-coded in pink, orange, purple and yellow. Credit: NHGRI

(Phys.org)A large international team of researchers has developed a Danish reference genome catalog based on the de novo assembly of 150 genomes sequenced from 50 family trios. In their paper published in the journal Nature, the group describes the multi-year effort, its purpose, and where they believe such efforts are leading.

One of the ways that scientists are learning about diseases, particularly those that are hereditary in nature, is by sequencing the genomes of large groups of peopledoing so enables searching for variants that cause or contribute to a given diseasethe ultimate goal would be to sequence every single person on Earth. In this new effort, the researchers have sequenced the genomes of 150 people in Denmark (from 50 family trios) of Danish descentnative Inuits and immigrants were screened out.

The effort, the researchers report, was carried out partly to create a Danish reference catalog and partly to learn how to conduct large-scale genome sequencing. They note that such a project required the combined efforts of multiple people and organizations working in coordinated fashion. Ultimately, the project cost approximately $10 million.

One major aspect of the project was selecting technology and determining how many people to sequence. The team wound up using samples from 50 families and did the sequencing using both combinations of paired-end and mate-pair libraries with the Illumina HiSeq2000. They note also that de novo assemblies were used because the researchers believed other approaches left out pertinent information. De novo refers to deriving a peptide sequence from a mass spectrum without the use of a sequence database. It is the preferred approach to sequencing when the aim is to identify novel peptides in organisms that have not been previously sequenced. The researchers report very few gaps in the data. They also note that three assemblies were used: Allpaths-LG14, SOAPdenovo2 and SGA. Accuracy was measured by comparing their results with the human reference genome.

The researchers hope that their effort will lead to improved medical interpretation of genetics in Denmark.

Explore further: Genome sequencing of individual Korean offers opportunity to identify parts of sequence unique to Korean population

More information: Lasse Maretty et al. Sequencing and de novo assembly of 150 genomes from Denmark as a population reference, Nature (2017). DOI: 10.1038/nature23264

Abstract Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and use this information to augment association mapping studies of complex disorders and other phenotypic traits. Genetic variation is identified mainly by mapping short reads to the reference genome or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set of structural variants including many novel insertions and demonstrate how this variant catalogue enables further deciphering of known association mapping signals. We leverage the assemblies to provide 100 completely resolved major histocompatibility complex haplotypes and to resolve major parts of the Y chromosome. Our study provides a regional reference genome that we expect will improve the power of future association mapping studies and hence pave the way for precision medicine initiatives, which now are being launched in many countries including Denmark.

Journal reference: Nature

2017 Phys.org

(Phys.org)A team of researchers with the Genomic Medicine Institute at Seoul National University in South Korea and Korean genetic sequencing firm Macrogen has conducted the most complete genome sequencing of a person ...

(Medical Xpress)Via genetic analysis, a large international team of researchers has found rare, damaging gene variants that they believe contribute to the risk of a person developing schizophrenia. In their paper published ...

(Phys.org) Genome sequencing-technology company, Illumina, based in San Diego has announced (at the annual JP Morgan Healthcare Conference) that its new machine, called the HiSeq X Ten is able to sequence whole human genomes ...

A team spanning Baylor College of Medicine, Rice University, Texas Children's Hospital and the Broad Institute of MIT and Harvard has developed a new way to sequence genomes, which can assemble the genome of an organism, ...

Researchers from BGI reported the most complete haploid-resolved diploid genome (HDG) sequence based on de novo assembly with NGS technology and the pipeline developed lays the foundation for de novo assembly of genomes with ...

To reconstruct phylogenetic trees from next-generation sequencing data using traditional methods requires a time-consuming combination of bioinformatic procedures including genome assembly, gene prediction, orthology identification ...

(Phys.org)A large international team of researchers has developed a Danish reference genome catalog based on the de novo assembly of 150 genomes sequenced from 50 family trios. In their paper published in the journal Nature, ...

When salmon spawn, the sperm of competing males are in an all-or-nothing race to be the first to reach and fertilise the eggs.

Malaria was already widespread on Sardinia by the Roman period, long before the Middle Ages, as indicated by research at the Institute of Evolutionary Medicine of the University of Zurich with the help of a Roman who died ...

A new mathematical model of ecology created by University of Chicago scientists provides the most accurate reproduction to date of natural biodiversity, according to a new paper in the journal Nature.

It might seem like a tomato plant and a subway system don't have much in common, but both, it turns out, are networks that strive to make similar tradeoffs between cost and performance.

Bacteria come in all shapes and sizessome are straight as a rod, others twist like a corkscrew. Shape plays an important role in how bacteria infiltrate and attack cells in the body. The helical shape of Helicobacter pylori, ...

Please sign in to add a comment. Registration is free, and takes less than a minute. Read more

Continued here:
Researchers conduct sequencing and de novo assembly of 150 genomes in Denmark - Phys.Org

Related Posts