A strand specific high resolution normalization method for chip-sequencing data employing multiple experimental control measurements.

Stefan Enroth, Claes Andersson, Robin Andersson, Claes Wadelius, Mats Gustafsson, Jan Komorowski

Research output: Contribution to journalJournal articleResearchpeer-review

4 Citations (Scopus)
1125 Downloads (Pure)

Abstract

High-throughput sequencing is becoming the standard tool for investigating protein-DNA interactions or epigenetic modifications. However, the data generated will always contain noise due to e.g. repetitive regions or non-specific antibody interactions. The noise will appear in the form of a background distribution of reads that must be taken into account in the downstream analysis, for example when detecting enriched regions (peak-calling). Several reported peak-callers can take experimental measurements of background tag distribution into account when analysing a data set. Unfortunately, the background is only used to adjust peak calling and not as a pre-processing step that aims at discerning the signal from the background noise. A normalization procedure that extracts the signal of interest would be of universal use when investigating genomic patterns.
Original languageEnglish
JournalAlgorithms for Molecular Biology
Volume7
Issue number1
Pages (from-to)2
Number of pages10
ISSN1748-7188
DOIs
Publication statusPublished - 2012

Cite this