Revision as of 03:37, 14 April 2011 edit Jonkerz (talk \| contribs) Extended confirmed users 53,201 edits →External links: {{Recategorize\|date=April 2011}} ← Previous edit		Revision as of 01:55, 22 April 2011 edit undo Snotbot (talk \| contribs) 98,645 edits m Fixing improperly nested section headings (task 5) Next edit →
Line 33: The data processing procedures are based on reference methods from the literature or on innovative internal developments. Implemented with the open source statistical software R, they follow a set of specifications which facilitate collaborative work and tracking. ===== Pre-processing ===== The data sent by the hybridization platforms are pre-processed according to a normalization and quality control stage adapted to each technology: background correction, quality control, filtering, aggregation and normalization. For genomic data (CGH, SNPs), an essential segmentation step is added to identify the altered regions along the genome. ===== Data analysis ===== The data analysis unfolds into three main stages: * Class discovery, using unsupervised clustering, enables the identification of the underlying molecular groups. The quality and variety of the supplied annotations are crucial to interpret the resulting classification. Line 42: * Class prediction, using classification approaches, establishes the smallest combinations of molecular markers to characterize tumor groups and to guide decisions about medical treatments. ===== Interpretation and Validation ===== Results are interpreted through additional bioinformatics analysis (pathway analysis, combined genome and transcriptome study), and then validated against independent datasets from the literature or from the CIT program. Finally, a validation of the results is carried out with RT-PCR on a microfluidic platform.

CIT Program Tumor Identity Cards: Difference between revisions