Repository logo
 

Data analysis issues for allele-specific expression using Illumina's GoldenGate assay.


Change log

Authors

Ritchie, Matthew E 
Forrest, Matthew S 
Dimas, Antigone S 
Daelemans, Caroline 
Dermitzakis, Emmanouil T 

Abstract

BACKGROUND: High-throughput measurement of allele-specific expression (ASE) is a relatively new and exciting application area for array-based technologies. In this paper, we explore several data sets which make use of Illumina's GoldenGate BeadArray technology to measure ASE. This platform exploits coding SNPs to obtain relative expression measurements for alleles at approximately 1500 positions in the genome. RESULTS: We analyze data from a mixture experiment where genomic DNA samples from pairs of individuals of known genotypes are pooled to create allelic imbalances at varying levels for the majority of SNPs on the array. We observe that GoldenGate has less sensitivity at detecting subtle allelic imbalances (around 1.3 fold) compared to extreme imbalances, and note the benefit of applying local background correction to the data. Analysis of data from a dye-swap control experiment allowed us to quantify dye-bias, which can be reduced considerably by careful normalization. The need to filter the data before carrying out further downstream analysis to remove non-responding probes, which show either weak, or non-specific signal for each allele, was also demonstrated. Throughout this paper, we find that a linear model analysis of the data from each SNP is a flexible modelling strategy that allows for testing of allelic imbalances in each sample when replicate hybridizations are available. CONCLUSIONS: Our analysis shows that local background correction carried out by Illumina's software, together with quantile normalization of the red and green channels within each array, provides optimal performance in terms of false positive rates. In addition, we strongly encourage intensity-based filtering to remove SNPs which only measure non-specific signal. We anticipate that a similar analysis strategy will prove useful when quantifying ASE on Illumina's higher density Infinium BeadChips.

Description

Keywords

Alleles, Databases, Genetic, Gene Expression, Genome, Genomics, Oligonucleotide Array Sequence Analysis, Polymorphism, Single Nucleotide, Statistics as Topic

Journal Title

BMC Bioinformatics

Conference Name

Journal ISSN

1471-2105
1471-2105

Volume Title

Publisher

Springer Science and Business Media LLC