Generalizing moving averages for tiling arrays using combined p-value statistics

KJ Kechris, B Biehs, TB Kornberg - Statistical applications in genetics …, 2010 - degruyter.com
KJ Kechris, B Biehs, TB Kornberg
Statistical applications in genetics and molecular biology, 2010degruyter.com
High density tiling arrays are an effective strategy for genome-wide identification of
transcription factor binding regions. Sliding window methods that calculate moving averages
of log ratios or t-statistics have been useful for the analysis of tiling array data. Here, we
present a method that generalizes the moving average approach to evaluate sliding
windows of p-values by using combined p-value statistics. In particular, the combined p-
value framework can be useful in situations when taking averages of the corresponding test …
High density tiling arrays are an effective strategy for genome-wide identification of transcription factor binding regions. Sliding window methods that calculate moving averages of log ratios or t-statistics have been useful for the analysis of tiling array data. Here, we present a method that generalizes the moving average approach to evaluate sliding windows of p-values by using combined p-value statistics. In particular, the combined p-value framework can be useful in situations when taking averages of the corresponding test-statistic for the hypothesis may not be appropriate or when it is difficult to assess the significance of these averages. We exhibit the strengths of the combined p-values methods on Drosophila tiling array data and assess their ability to predict genomic regions enriched for transcription factor binding. The predictions are evaluated based on their proximity to target genes and their enrichment of known transcription factor binding sites. We also present an application for the generalization of the moving average based on integrating two different tiling array experiments.
De Gruyter