May 25, 2017 Credit: Susanna M. Hamilton, Broad Communications
The Broad Institute of MIT and Harvard will release version 4 of the industry-leading Genome Analysis Toolkit under an open source software license. The software package, designated GATK4, contains new tools and rebuilt architecture. It is available currently as an alpha preview on the Broad Institute's GATK website, with a beta release expected in mid-June. Broad engineers announced the upgrade, as well as the decision to release the tool as an open source product, at Bio-IT World today.
The new version is built on a new architecture, allowing significant streamlining of individual tools and support for performance-enhancing technologies such as Apache SparkTM. This new framework brings improvements to parallelization, capitalizing on cloud deployment and making the process of analyzing vast amounts of genomic data easier, faster, and more efficient.
"We wanted to remove traditional barriers of scale while offering the same high level of data quality our users expect," said Eric Banks, Senior Director of Data Sciences and Data Engineering at Broad and a creator of the original GATK software package. "Thanks to the rapid adoption of cloud computing, researchers can finally do away with many of the infrastructure-related complications that have hampered progress, especially at smaller institutions and startups."
Today, more than 45,000 academic and commercial users worldwide rely on the GATK, running millions of analyses. The GATK is the industry standard for identifying SNPs and indels in germline DNA and RNAseq data. In addition to improving the performance of these established tools, GATK4 extends this scope of analysis to include copy number and structural variation, for both germline and somatic research applications.
Fully open source software
GATK4 will be released as a fully open source product, thanks in part to a collaboration between Broad Institute and Intel Corporation to advance high-performance analytics so researchers can study massive amounts of genomic data from diverse sources worldwide.
At the Intel-Broad Center for Genomic Data Engineering, software engineers and researchers have spent the last several months building, optimizing, and widely sharing new tools and infrastructure to help scientists integrate and process genomic data. GATK4 has benefited from this collaboration, which has helped engineers optimize best practices in hardware and software for genome analytics to make it possible to combine and use research data sets that reside on private, public, and hybrid clouds.
"Releasing GATK4 as open source was the obvious next step for our team," said Geraldine Van der Auwera, Associate Director of Outreach and Communications within the Data Science and Data Engineering group at the Broad Institute. "We believe it's the most effective way to support the community, and we hope it continues to grow, innovate, and help researchers make insights that are essential for future human health breakthroughs." "It is critical for progress in biomedicine that the software we use for analysing the genomes of millions of people is robust and well understood," said Ewan Birney, Director of EMBL-EBI and Chair of the Global Alliance for Genomics and Health (GA4GH). "Releasing GATK software with an open source license directly supports open innovation, data re-use and data re-analysis in the global biomedical community."
"The GATK tools are crucial for both germline and cancer analyses," said Robert L. Grossman of the University of Chicago Department of Medicine and an expert in biomedical informatics. "Releasing GATK4 as an open source software package will increase adoption, and benefit the community."
"Open sourcing the GATK is a big deal for open genomics, and for open science in general," said Jeremy Freeman, manager of computational biology at the Chan Zuckerberg Initiative (CZI). "Not only does it make this critical tool available to as broad as possible an audience for use, reuse, inspection, and contributionit provides a powerful example to the community for how an existing project can embrace open source."
"Open source code is a foundation of efficient biomedical research," said Brad Chapman, a research scientist at the Harvard T.H. Chan School of Public Health. "It enables reproducibility, reuse and remixing by removing barriers for sharing and distributing analyses. The Broad Institute's GATK team leads in the development of scalable, sensitive and specific variant calling algorithms, and open sourcing GATK4 will allow frameworks like Blue Collar Bioinformatics to make these methods broadly available to the scientific research community."
"Cloudera has always been a supporter and believer in the power of open source code," said Tom White, data scientist at Cloudera and a member of the Apache Hadoop PMC. "We've been excited to contribute to the GATK codebase, to make it run smoothly on Apache Spark and Cloudera. This next phase of the GATK, powered by Spark and open source software, will expand access and improve collaboration among genomic data scientists."
"The open sourcing of GATK4 is a great step for genomics, allowing for scalability and performance gains to be openly available to the research, biotech and pharmaceutical communities," said Jason Waxman, corporate vice president and general manager of Data Center Solutions at Intel. "GATK4, when run on Intel's new reference architecture, can achieve a 5X speed-up compared to earlier versions of the software."
"We at Google are excited to see this new release," said Ilia Tulchinsky, Google Cloud Healthcare Engineering Lead. "We've been collaborating with the Broad Institute for the past three years to enhance genomic processing on Google Cloud Platform. As a strong supporter for open source technology, we believe that making GATK available this way will facilitate its use by genomic scientists everywhere. As fellow collaborators with Intel, we particularly look forward to enabling researchers to run GATK4 on Google Cloud using the upcoming Intel Xeon processor Scalable family."
"The GATK is one of the most widely utilized software packages in the life sciences, and our team has worked very productively with Broad to accelerate it for use on Azure," said Geralyn Miller, Director, AI & Research, Microsoft. "This new model will greatly facilitate this effort going forward, and we are excited to continue and expand our efforts around GATK on Azure."
"With the open source launch of GATK4, there is an opportunity to create a global community that can collaborate together and advance the state of art in bioinformatics," said Hong Tang, chief architect at Alibaba Cloud, the cloud computing arm of Alibaba Group. "We look forward to closely working with Broad Institute in bringing the cloud-based GATK service to genomics customers in China, as well as in ongoing GATK research and development."
In addition to offering GATK4 as an open source toolkit, Broad Institute will continue to offer user support, training, and outreach on its popular user support forum. GATK4, like many of the Broad Institute's genome analysis tools, will be available through the Broad Institute's cloud based analysis platform, FireCloud.
Explore further: Google joins effort to boost genomics research
Google announced Wednesday it was teaming up with university scientists to use its computing platform to accelerate efforts in genomics research.
In a sign of the growing importance of the Internet "cloud," software group Cloudera said Monday it raised a whopping $900 million to expand its big data corporate services.
Microsoft has joined the Linux Foundation, the latest sign that the software giant is embracing open-source technologies it formerly treated with hostility.
Cloud computing is a more efficient and cheaper alternative for researchers wanting to access and analyse large amounts of human genomic data, a local study has found.
Apple today announced that its Swift programming language is now open source. As an open source language, the broad community of talented developersfrom app developers to educational institutions to enterprisescan contribute ...
Judging from technology-watching sites, Intel has something to worry about and it involves a rather well known place on the technology map called Redmond, Washington. Look for the sign that says Microsoft. There.
There are significant gaps in our knowledge on the evolution of sex, according to a research review on sex chromosomes from Lund University in Sweden. Even after more than a century of study, researchers do not know enough ...
(Phys.org)Eusocial insects are predominantly dependent on chemosensory communication to coordinate social organization and define group membership. As the social complexity of a species increases, individual members require ...
Scientists using a high-resolution global climate model and historical observations of species distributions on the Northeast U.S. Shelf have found that commercially important species will continue to shift their distribution ...
If you open Google and start typing "Chinese cave gecko", the text will auto-populate to "Chinese cave gecko for sale" just US$150, with delivery. This extremely rare species is just one of an increasingly large number ...
Plant scientists at the University of Cambridge have found a plant protein indispensable for communication early in the formation of symbiosis - the mutually beneficial relationship between plants and fungi. Symbiosis significantly ...
Almost 150 years after Charles Darwin first proposed a little-known prediction from his theory of sexual selection, researchers have found that male moths with larger antennae are better at detecting female signals.
Please sign in to add a comment. Registration is free, and takes less than a minute. Read more
Read the original post:
Genome Analysis Toolkit 4 (GATK4) released as open source ... - Phys.Org
- ENCODE: Encyclopedia Of DNA Elements - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- 07.05.2010 - The Human Genome [ Coast To Coast AM ] - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- NOVA scienceNOW : 51 - Public Genomes, Algae Fuel, Mystery of the Gakkel Ridge, Yoky Matsuoka - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Vincent T. - Genome (Club Remix) - [Preview] - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Comparing The Human And Chimpanzee Genomes - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Whole Genome Sequencing and Its Impact on Clinical Care - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Carlos Bustamante -- "Reconstructing the Great Human Diasporas from Genome Variation Data" - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- 3 Sad Surprises: The Human Genome Project - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- The RFW interviews Genome - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Science Bulletins: Scientists Peer Inside "Superbug" Genome - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Genome : Live @ Smu's : June 3 2012 - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Inoki Genome Federation - Genome 19 - 04 02 2012 - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- THE HUMAN GENOME MUSIC PROJECT - CHROMOSOME 1 - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Genomic Medicine - Bruce Korf (2012) - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Human Genome's 'Blockbuster' Potential Undervalued in Bid GSK vs HGSI - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Announcing the Completion of the First Survey of the Entire Human Genome at the White House - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- DNA analysis Part I. Genomic Sequencing - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- The Genome Question: Moore vs. Jevons with Bud Mishra - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- Genome-Wide Association Studies - Karen Mohlke (2012) - Video [Last Updated On: September 7th, 2012] [Originally Added On: September 7th, 2012]
- New human genome research aids understanding of disease [Last Updated On: September 8th, 2012] [Originally Added On: September 8th, 2012]
- UNC Lineberger scientists lead definition of key lung cancer genome [Last Updated On: September 10th, 2012] [Originally Added On: September 10th, 2012]
- Illumina Announces Expedited Individual Genome Sequencing Service (IGS) [Last Updated On: September 11th, 2012] [Originally Added On: September 11th, 2012]
- Genome research given a boost with opening of bioscience facility [Last Updated On: September 11th, 2012] [Originally Added On: September 11th, 2012]
- Re-Imagining Our Genes: ENCODE Project Reveals Genome as an Information Processing System [Last Updated On: September 11th, 2012] [Originally Added On: September 11th, 2012]
- Illumina unveils upgraded genome sequence service [Last Updated On: September 12th, 2012] [Originally Added On: September 12th, 2012]
- US Personalized Cancer Genome Sequencing Market [Last Updated On: September 18th, 2012] [Originally Added On: September 18th, 2012]
- Yale maps “uncharted” genome regions [Last Updated On: September 18th, 2012] [Originally Added On: September 18th, 2012]
- Research and Markets: US Personalized Cancer Genome Sequencing Market [Last Updated On: September 19th, 2012] [Originally Added On: September 19th, 2012]
- 3Qs: New clues to unlocking the genome [Last Updated On: September 19th, 2012] [Originally Added On: September 19th, 2012]
- Oyster Genome Pries Open Mollusk Evolutionary Shell [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Bangladeshi scientist decodes genome of deadly fungus [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Oyster genome uncover the stress adaptation and complexity of shell formation [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- The oyster genome reveals stress adaptation and complexity of shell formation [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Diseases of aging map to a few 'hotspots' on the human genome [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- GnuBIO Awarded $4.5 Million in Funding from the National Human Genome Research Institute to Develop Lower Cost Genome ... [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Oyster genome mystery unravelled [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Devangshu Datta: What's in a genome [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- Pacific Oyster Genome Shows Stress Adaptation And Complexity Of Shell Formation [Last Updated On: September 20th, 2012] [Originally Added On: September 20th, 2012]
- UNC Lineberger scientists lead cancer genome analysis of breast cancer [Last Updated On: September 24th, 2012] [Originally Added On: September 24th, 2012]
- Encoding the human genome [Last Updated On: September 24th, 2012] [Originally Added On: September 24th, 2012]
- Cancer genome analysis of breast cancer: Team identifies genetic causes and similarity to ovarian cancer [Last Updated On: September 24th, 2012] [Originally Added On: September 24th, 2012]
- Fungus genome map paves way for 'Snow White' jute variety [Last Updated On: September 24th, 2012] [Originally Added On: September 24th, 2012]
- New online, open access journal focuses on microbial genome announcements [Last Updated On: September 25th, 2012] [Originally Added On: September 25th, 2012]
- By Simply Sharing, Doctors Could Unlock the Genome's Potential [Last Updated On: September 25th, 2012] [Originally Added On: September 25th, 2012]
- Forget the Cloud—Knome Offers Genome Analysis in a Box [Last Updated On: September 28th, 2012] [Originally Added On: September 28th, 2012]
- BGI@CHOP Joint Genome Center to Offer Clinical Next-Generation Sequencing Services [Last Updated On: September 28th, 2012] [Originally Added On: September 28th, 2012]
- Holy Bat Virus! Genome Hints At Origin Of SARS-Like Virus [Last Updated On: September 29th, 2012] [Originally Added On: September 29th, 2012]
- Community Fundraising Effort Helps Researchers Sequence Parrot Genome [Last Updated On: September 29th, 2012] [Originally Added On: September 29th, 2012]
- UMass Med professors are sleuths of the genome [Last Updated On: September 30th, 2012] [Originally Added On: September 30th, 2012]
- Knome Introduces the knoSYS™100; First Plug-and-Play Human Genome Interpretation System [Last Updated On: September 30th, 2012] [Originally Added On: September 30th, 2012]
- First large scale trial of whole-genome cancer testing for clinical decision-making reported [Last Updated On: October 1st, 2012] [Originally Added On: October 1st, 2012]
- Should You Get Your Genome Mapped? [Last Updated On: October 1st, 2012] [Originally Added On: October 1st, 2012]
- Surprising differences between apples and pears [Last Updated On: October 2nd, 2012] [Originally Added On: October 2nd, 2012]
- 50-Hour Whole Genome Sequencing Provides Rapid Diagnosis for Children With Genetic Disorders [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- A map of rice genome variation reveals the origin of cultivated rice [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Genome analysis promises hope for breast cancer patients [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Genome Alberta Welcomes Alberta Minister of Enterprise and Advanced Education, Stephen Khan and Federal Minister of ... [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Fifty-hour whole genome sequencing provides rapid diagnosis for children with genetic disorders [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Will Low-Cost Genome Sequencing Open 'Pandora's Box'? [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Genome testing could help individualize treatments [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- Would you get your genome tested? [Last Updated On: October 3rd, 2012] [Originally Added On: October 3rd, 2012]
- The Genome — a Pandora's Box? [Last Updated On: October 4th, 2012] [Originally Added On: October 4th, 2012]
- Fast genome test could help sick newborns [Last Updated On: October 4th, 2012] [Originally Added On: October 4th, 2012]
- In-Depth Genome Analysis Moves Toward The Hospital Bed [Last Updated On: October 5th, 2012] [Originally Added On: October 5th, 2012]
- Your Verdict On Getting A Genome Test? Bring It On [Last Updated On: October 6th, 2012] [Originally Added On: October 6th, 2012]
- Genome-wide study identifies 8 new susceptibility loci for atopic dermatitis [Last Updated On: October 7th, 2012] [Originally Added On: October 7th, 2012]
- Genome-wide study identifies eight new susceptibility loci for atopic dermatitis [Last Updated On: October 7th, 2012] [Originally Added On: October 7th, 2012]
- Genome interpreter vies for place in clinical market [Last Updated On: October 10th, 2012] [Originally Added On: October 10th, 2012]
- The $1,000 Genome: A Bait and Switch? [Last Updated On: October 10th, 2012] [Originally Added On: October 10th, 2012]
- Mount Sinai School of Medicine Offers First-Ever Course with Whole Genome Sequencing [Last Updated On: October 10th, 2012] [Originally Added On: October 10th, 2012]
- First whole genome sequencing of multiple pancreatic cancer patients has been outlined [Last Updated On: October 11th, 2012] [Originally Added On: October 11th, 2012]
- Cheap genome sequences demand new rules on privacy [Last Updated On: October 11th, 2012] [Originally Added On: October 11th, 2012]
- UConn Gets Grant For Genome Research [Last Updated On: October 11th, 2012] [Originally Added On: October 11th, 2012]
- Inconsistent Genome Privacy Laws Need Toughening, Panel Says [Last Updated On: October 12th, 2012] [Originally Added On: October 12th, 2012]
- US panel calls for stronger privacy for genome data [Last Updated On: October 12th, 2012] [Originally Added On: October 12th, 2012]
- Genome Canada Board Appoints New Chair [Last Updated On: October 12th, 2012] [Originally Added On: October 12th, 2012]
- The $1,000 Genome Is Almost Here- Are We Ready? [Last Updated On: October 15th, 2012] [Originally Added On: October 15th, 2012]
- Global genome effort seeks genetic roots of disease [Last Updated On: October 31st, 2012] [Originally Added On: October 31st, 2012]
- Massive encyclopedia helps explain how the human genome works [Last Updated On: October 31st, 2012] [Originally Added On: October 31st, 2012]
- Genome evolution and carbon dioxide dynamics [Last Updated On: October 31st, 2012] [Originally Added On: October 31st, 2012]