A Journal Devoted To All Areas Of Applied Statistics
 
 
  Annals of Applied Statistics
  Submissions
  Subscriptions
  Editorial Board
  Next Issues
  Published Issues
Supplements
  Instructions for Referees
  Letters to Editor
 
Replication data for: Diverse Correlation Structures in Microarray Gene Expression Data
hdl:1902.1/10645
Version: 1 – Released: Fri Jan 04 13:30:53 EST 2008
Cataloging Information
Documentation, Data and Analysis
User Comments
Versions
 
If you use these data, please add the following citation to your scholarly references. Why cite?
Original Publication
Results found in this publication can be replicated using these data.
Lev Klebanov, and Andrei Yakovlev. Forthcoming. "Diverse Correlation Structures in Microarray Gene Expression Data." Ann. Appl. Statist.
Data Citation Details
Study Global IDhdl:1902.1/10645
AuthorsLev Klebanov (Department of Probability and Statistics, Charles University, Sokolovska); Andrei Yakovlev (Department of Biostatistics and Computational Biology, University of Rochester)
DistributorInstitute for Mathematical Statistics Logo
Deposit DateOctober 01, 2007
Provenance
Abstract and Scope
Abstract

It is well-known that correlations in microarray data represent a serious nuisance deteriorating the performance of gene selection procedures. This paper is intended to demonstrate that the correlation structure of microarray data provides a rich source of useful information. We discuss distinct correlation substructures revealed in microarray gene expression data by an appropriate ordering of genes. These substructures include stochastic proportionality of expression signals in a large percentage of all gene pairs, negative correlations hidden in ordered gene triples, and a long sequence of weakly dependent random variables associated with ordered pairs of genes. The reported striking regularities are of general biological interest and they also have far-reaching implications for theory and practice of statistical methods of microarray data analysis. We illustrate the latter point with a method for testing differential expression of non-overlapping gene pairs. While designed for testing a different null hypothesis, this method provides an order of magnitude more accurate control of type 1 error rate compared to conventional methods of individual gene expression profiling. In addition, this method is robust to the technical noise. Quantitative inference of the correlation structure has the potential to extend the analysis of microarray data far beyond currently practiced methods.

Keywordscorrelation structure; gene expression; microarrays
Data Availability
Number of Files 6
Terms of Use
Dataverse Network Terms of Use
View Terms of Use [+]
IQSS Dataverse Network Terms and Conditions

By downloading these Materials, I agree to the following:

  1. I will not use the Materials to
    1. obtain information that could directly or indirectly identify subjects.
    2. produce links among the Distributor's datasets or among the Distributor's data and other datasets that could identify individuals or organizations.
    3. obtain information about, or further contact with, subjects known to me except where the use and/or release of such identifying information has no potential for constituting an unwarranted invasion of privacy and/or breach of confidentiality.
  2. I agree not to download any Materials where prohibited by applicable law.
  3. I agree not to use the Materials in any way prohibited by applicable law.
  4. I agree that any books, articles, conference papers, theses, dissertations, reports, or other publications that I create which employ data reference the bibliographic citation accompanying this data. These citations include the data authors, data identifier, and other information accord with the Recommended Standard (http://thedata.org/citation/standard) for social science data.
  5. THE DISTRIBUTOR MAKES NO WARRANTIES, EXPRESS OR IMPLIED, BY OPERATION OF LAW OR OTHERWISE, REGARDING OR RELATING TO THE DATASET
Other Information
NotesDATAPASS:TERMS:STANDARD:1.0 (STANDARD DEPOSIT TERMS 1.0) This study was deposited under the of the Data-PASS standard deposit terms. A copy of the usage agreement is included in the file section of this study.

"Replication data for: Diverse Correlation Structures in Microarray Gene Expression Data", hdl:1902.1/10645