Data description

Main Page

The datasets used in the course are described here. Load them in R by:

mydata <- read.table("datafile.csv",header=TRUE,sep=";")

bosson

The spreadsheet (courtesy of Professor Jean-Luc Bosson) contains aneurysm data from 209 patients coming from France or Vietnam. The variables are:

dimer

The spreadsheet (courtesy of Professor Jean-Luc Bosson) contains log-dimer data over 530 patients, grouped into 3 age classes. The variables are:

feretti

The spreadsheet (courtesy of Professor Gilbert Ferretti) contains data from 43 patients with lung tumor. The variables are:

fires

Data on 270 forest fires in Portugal from P. Cortez and A. Morais The variables are:

LenzT, LenzI

Lenz et al.'s transcriptome and clinical data on 414 patients with diffuse large B cell lymphoma (GSE10846). LenzT is a transcriptome data matrix over 17290 protein coding genes. LenzI contains 10 variables of clinical information for the same patients.

LenzT is a numeric matrix of size 414 x 17290. The columns are named by gene symbols, the rows by GSM numbers.

LenzI is a matrix 414 x 10, rows are named by the same GSM numbers as LenzT. Columns are:

The transcriptome matrix LenzT has been transformed using function rename, the gene symbols match those of dicoH.

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE10846

References

Lenz G, Wright G, Dave SS, Xiao W et al. Stromal gene signatures in large-B-cell lymphomas. N Engl J Med 2008 Nov 27;359(22):2313-23.

Cardesa-Salzmann TM, Colomo L, Gutierrez G, Chan WC et al. High microvessel density determines a poor outcome in patients with diffuse large B-cell lymphoma treated with rituximab plus chemotherapy. Haematologica 2011 Jul;96(7):996-1001.

tauber

The spreadsheet (courtesy of Professor Ma"{}t'e Tauber) contains heights and weights of 2891 children from 4 to 7 years old. The variables are:

12/9/2018