hrvatski jezikClear Cookie - decide language by browser settings

Evaluation of hybrid and non-hybrid methods for de novo assembly of nanopore reads

Sović, Ivan; Križanović, Krešimir; Skala, Karolj; Šikić, Mile (2016) Evaluation of hybrid and non-hybrid methods for de novo assembly of nanopore reads. Bioinformatics, 32 (17). pp. 2582-2589. ISSN 1367-4803

[img]
Preview
PDF - Accepted Version - article
Available under License Creative Commons Attribution.

Download (106kB) | Preview

Abstract

Motivation: Recent emergence of nanopore sequencing technology set a challenge for established assembly methods. In this work we assessed how existing hybrid and non-hybrid de novo assembly methods perform on long and error prone nanopore reads. Results: We benchmarked five non-hybrid (in terms of both error correction and scaffolding) assembly pipelines as well as two hybrid assemblers which use third generation sequencing data to scaffold Illumina assemblies. Tests were performed on several publicly available MinION and Illumina datasets of E. coli K-12, using several sequencing coverages of nanopore data (20x, 30x, 40x and 50x). We attempted to assess the assembly quality at each of these coverages, in order to estimate the requirements for closed bacterial genome assembly. For the purpose of the benchmark, an extensible genome assembly benchmarking framework was developed. Results show that hybrid methods are highly dependent on the quality of NGS data, but much less on the quality and coverage of nanopore data and perform relatively well on lower nanopore coverages. All non-hybrid methods correctly assemble the E. coli genome when coverage is above 40x, even the non-hybrid method tailored for Pacific Biosciences reads. While it requires higher coverage compared to a method designed particularly for nanopore reads, its running time is significantly lower. Availability: https://github.com/kkrizanovic/NanoMark

Item Type: Article
Uncontrolled Keywords: nanopore ; sequencing ; de novo
Subjects: NATURAL SCIENCES > Biology
TECHNICAL SCIENCES > Computing
Divisions: Center for Informatics and Computing
Projects:
Project titleProject leaderProject codeProject type
Algoritmi za analizu slijeda genoma-AGESAMile ŠikićUIP-11-2013-7353HRZZ
Depositing User: Ivan Sović
Date Deposited: 23 Feb 2017 15:22
Last Modified: 28 Feb 2017 14:09
URI: http://fulir.irb.hr/id/eprint/3392
DOI: 10.1093/bioinformatics/btw237

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year