Data set name: GSE412


Original data set (GSE412)
Data set for Orange
Brief description:
The childhood ALL data set (GSE412) includes gene expression information on 110 childhood acute lymphoblastic leukemia samples. For this data set we induced models for two different classification problems. With the first model we try to distinguish between childhood acute lymphoblastic leukemia cells based on changes in gene expression before and after treatment, regardless of the type of treatment used.

Platform: Affymetrix GeneChip Human Genome U95 Version [1 or 2] Set HG-U95A

Diagnostic classes:
- before therapy (before Th): 50 examples (45.5%)
- after therapy (after Th): 60 examples (54.5%)
Number of genes: 8280
Number of samples: 110
Note: From the originally measured 12625 probe sets we removed genes that were not present (P) in at least one sample
Predictive accuracy with 10-fold cross validation (classifying using the best projection with eight attributes):
Classification accuracy: 92.73%
Area under curve (AUC): 0.960
Following are the three best-ranked visualization with eight, six and four attributes in respect to the visualization score, that is, visualizations where examples from different diagnostic classes are best separated:

Score: 98.95%
Genes:
38414_at: CDC20 cell division cycle 20 homolog (S. cerevisiae), CDC20
35590_s_at: gastric inhibitory polypeptide receptor, GIPR
34457_at: solute carrier family 30 (zinc transporter), member 3, SLC30A3
37226_at: BCL2/adenovirus E1B 19kDa interacting protein 1, BNIP1
33143_s_at: solute carrier family 16 (monocarboxylic acid transporters), member 3, SLC16A3
38464_at: glucosidase I, GCS1
838_s_at: ubiquitin-conjugating enzyme E2I (UBC9 homolog, yeast), UBE2I
32264_at: granzyme M (lymphocyte met-ase 1), GZMM
Score: 98.46%
Genes:
35590_s_at: gastric inhibitory polypeptide receptor, GIPR
37905_r_at: Guanine nucleotide binding protein-like 1, GNL1
38447_at: adrenergic, beta, receptor kinase 1, ADRBK1
838_s_at: ubiquitin-conjugating enzyme E2I (UBC9 homolog, yeast), UBE2I
36822_at: TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 68kDa, TAF15
36651_at: acid phosphatase 2, lysosomal, ACP2
Score: 97.90%
Genes:
37226_at: BCL2/adenovirus E1B 19kDa interacting protein 1, BNIP1
35590_s_at: gastric inhibitory polypeptide receptor, GIPR
838_s_at: ubiquitin-conjugating enzyme E2I (UBC9 homolog, yeast), UBE2I
38464_at: glucosidase I, GCS1

Attribute ranking

Following is the histogram of genes showing how often are they present in one of the top 100 radviz visualizations with 8 attributes.

Genes:
838_s_at: ubiquitin-conjugating enzyme E2I (UBC9 homolog, yeast), UBE2I
35590_s_at: gastric inhibitory polypeptide receptor, GIPR
36822_at: TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 68kDa, TAF15
36161_at: adaptor-related protein complex 2, beta 1 subunit, AP2B1
725_i_at
37226_at: BCL2/adenovirus E1B 19kDa interacting protein 1, BNIP1
34457_at: solute carrier family 30 (zinc transporter), member 3, SLC30A3
38464_at: glucosidase I, GCS1
36651_at: acid phosphatase 2, lysosomal, ACP2
37905_r_at: Guanine nucleotide binding protein-like 1, GNL1
39420_at: DNA-damage-inducible transcript 3, DDIT3
33143_s_at: solute carrier family 16 (monocarboxylic acid transporters), member 3, SLC16A3
38447_at: adrenergic, beta, receptor kinase 1, ADRBK1
33069_f_at: UDP glucuronosyltransferase 2 family, polypeptide B15, UGT2B15
33870_at: jumonji domain containing 1B, JMJD1B
36223_at: Splicing factor proline/glutamine rich (polypyrimidine tract binding protein associated), SFPQ
39332_at: tubulin, beta polypeptide paralog, RP11-506K6.1
41117_s_at: Solute carrier family 9 (sodium/hydrogen exchanger), isoform 3 regulator 2, SLC9A3R2
32264_at: granzyme M (lymphocyte met-ase 1), GZMM
39994_at: chemokine (C-C motif) receptor 1, CCR1