New bioinformatics tool for complex genome analysis

Posted: January 12, 2014 at 3:49 am

Scientists from The Genome Analysis Centre (TGAC) in Norwich UK have developed a new bioinformatics tool to boost complex genome analysis.

The tool supports Illuminas recently released Nextera LMP kit, which enables the production of jumping libraries of up to 12kb. These Long Mate Pair libraries are an invaluable resource for analysing large areas of the genome, carrying out complex assemblies and other downstream bioinformatics analytics.

However, LMP libraries are intrinsically noisy and to maximise their value, post-sequencing data analysis is required.

Richard Leggett, at TGAC, said: Regulating laboratory protocols and selection of sequenced data for downstream analysis are vital in making effective use of mate pair libraries.

However, quality control of the libraries can require significant bioinformatics analysis. Further processing is also required to extract true mate pair reads, remove fragment junction adaptors and clip reads.

For this reason we developed NextClip, a tool for comprehensive quality analysis of Nextera LMP libraries and preparation of reads for scaffolding.

Mate pair libraries are formed by making large fragments of DNA (5-12 kb in length for Nextera) and are sequenced from either end of the fragment to produce two sequences of DNA that are separated by a known distance.

Sequence reads from Long Mate Pair libraries are an important tool in the construction of complex genome assemblies because they connect large repeat regions.

Grouping the data generated from mate pair library sequencing with shorter insert paired-end reads provide a powerful combination, allowing the joining together of longer DNA sequences, with higher certainty.

The Genome Analysis Centre is a research institute focused on the development of genomics and computational biology. Based at Norwich Research Park, it receives strategic funding from the Biotechnology and Biological Science Research Council - 9.2 million in 2012-2013 - as well as support from other research funders.

Read more:
New bioinformatics tool for complex genome analysis

Related Posts