Asterias: A Parallelized Web-based Suite for the Analysis of Expression and aCGH Data
Andreu Alibés, Edward R. Morrissey, Andrés Cañada, Oscar M. Rueda, David Casado, Patricio Yankilevich and Ramón Díaz-Uriarte
Statistical Computing Team, Structural and Computational Biology Programme, Spanish National Cancer Center (CNIO), Melchor Fernández Almagro 3, Madrid, 28029, Spain.
Type: Application Note
Abstract: The analysis of expression and CGH arrays plays a central role in the study of complex diseases, especially cancer, including finding markers for early diagnosis and prognosis, choosing an optimal therapy, or increasing our understanding of cancer development and metastasis. Asterias (http://www.asterias.info) is an integrated collection of freely-accessible web tools for the analysis of gene expression and aCGH data. Most of the tools use parallel computing (via MPI) and run on a server with 60 CPUs for computation; compared to a desktop or server-based but not parallelized application, parallelization provides speed ups of factors up to 50. Most of our applications allow the user to obtain additional information for user-selected genes (chromosomal location, PubMed ids, Gene Ontology terms, etc.) by using clickable links in tables and/or fi gures. Our tools include: normalization of expression and aCGH data (DNMAD); converting between different types of gene/clone and protein identifi ers (IDconverter/IDClight); fi ltering and imputation (preP); finding differentially expressed genes related to patient class and survival data (Pomelo II); searching for models of class prediction (Tnasas); using random forests to search for minimal models for class prediction or for large subsets of genes with predictive capacity (GeneSrF); searching for molecular signatures and predictive genes with survival data (SignS); detecting regions of genomic DNA gain or loss (ADaCGH). The capability to send results between different applications, access to additional functional information, and parallelized computation make our suite unique and exploit features only available to web-based applications.
Readers of this also read:
- Gamma-Normal-Gamma Mixture Model for Detecting Differentially Methylated Loci in Three Breast Cancer Cell Lines
- Statistical Redundancy Testing for Improved Gene Selection in Cancer Classification Using Microarray Data
- Analysis of Gene Expression Data Using BRB-Array Tools
- Data Mining for Gene Networks Relevant to Poor Prognosis in Lung Cancer via Backward-Chaining Rule Induction
- Detecting Outlier Microarray Arrays by Correlation and Percentage of Outliers Spots






