YASMA - Yet Another Statistical Microarray Analysis

YASMA - Yet Another Statistical Microarray Analysis


One problem arising in the analysis of gene expression data from microarrays is the identification of differentially expressed genes. There exist some popular rules, for example, to accept two-fold over or under-expression as significant. Though such rules might be quite useful in providing a quick overview, statistics has to offer a more thorough approach. By considering the amount of variability between replicates of the same experiments, a statistical significance in form of p-values can be assigned to differential gene expression levels.

YASMA is an add-on library for the R statistical package and can be used to analyse simple replicated experiments. For example, we are interested in bacterial genes over- or under-expressed in mutants as compared to the wild type. For this purpose, multiple mRNA preparations are hybridized on several arrays. As long as the same number of arrays is used for each preparation a straightforward ANOVA analysis and analysis of variance components can be applied to the series of experiments (a balanced factorial design).

At the moment the package contains

As the name suggests YASMA is intendended as a complement to the SMA microarray analysis package. High on my list of future extensions are the incorporation of regression analysis for time series data and extensions to more general designs (eg, latin square design).

Downloads

This is joint work with Sharon Kendall, Neil Stoker, Shamit Soneji, and the Bugs group


Lorenz Wernisch
May 2001