AnnData

AnnData object

Common extensions : .h5ad



FASTQ

A FASTQ file is a text-based format which stores both the nucleotides sequence of each read and its corresponding quality scores.

Common extensions : .fastq



Gene expression raw count matrix

A matrix of gene expression raw counts. Depending on resolution, it can be a gene x sample, gene x cell or gene x spot matrix. For single cell RNA-seq data outputs from 10X or Parse pipelines, row and column name could be in separated .tsv files, but in the same folder than the .mtx file.

Common extensions : .mtx, .tsv, .csv, .txt, .h5, .h5ad



Spatial positions

Text file indicating spatial positions of each cell/spot

Common extensions : .tsv



SpatialData

SpatialData object

Common extensions : zarr



annotated data matrices

Annotated data matrices are complex data classes designed to efficiently store and handle high-dimensional count matrices associated with their row/column annotations. They are particularly useful for the analysis of single-cell data, where they also centralize information on data transformations (normalization, dimension reduction, projections, etc.) that have been performed during the analysis process. Most downstream analysis functions and tools use them as input, whether in R (seurat or sce objects) or in Python (anndata object class).

Common extensions : .rds, .h5, .h5ad



beta matrix

beta matrix is a topics x genes matrix, indicating for each topic the expression of each overly dispersed genes

Common extensions : .rds, .txt, .xlsx



differential analysis table

Differential analysis table are common outputs of statistical comparison of biological molecules quantified at a particular omics level.

Common extensions : .csv, .tsv, .txt, .xlsx



theta matrix

theta matrix is a spots barcodes x topics proportion matrix, indicating for each spot the proportion of each topic

Common extensions : .rds, .txt, .xlsx