Data Formats

SEA accepts three types of data files: expression, covariates and annotation. Data files are always tab-delimited files. strange symbols (such as % , &, /, # , etc) and quotation marks (””) should be avoided in labels and gene names. Also the presence of spaces is not recommended.
We next explain each data file in detail:

  • Expression data: txt file with expression data, genes in rows, arrays in columns. The file must contain an additional row with sample names and a column with gene names. Column names should not start with a number and should also not contain strange characters:


  • Covariates: txt file with experimental design information, containing as many columns as arrays and as many rows as experimental factors. Each cell contains the value of the array in the experimental factor:


Experimental factors must have always more than one level, i.e. two or more time-points, two or more treatments, etc. If not, the experimental factor cannot be considered as such and should not be included in the covariates file.

  • Annotation: txt file with functional annotation of genes. Two columns: gene_annotation. If a gene has more than one annotation label, this is indicated in different rows:


For data file examples, go to the Worked Examples Menu

formats.txt · Last modified: 2011/08/31 09:51 by aconesa
CC Attribution-Noncommercial-Share Alike 3.0 Unported Valid CSS Driven by DokuWiki do yourself a favour and use a real browser - get firefox!! Recent changes RSS feed Valid XHTML 1.0