Microarray technology provides large-scale gene expression profiles according to experimental conditions such as quantitative values. The transcription of genes are triggered or inhibited by complex biological interactions and associations. To underst...
Microarray technology provides large-scale gene expression profiles according to experimental conditions such as quantitative values. The transcription of genes are triggered or inhibited by complex biological interactions and associations. To understand not only the function and the role of a gene in a cell but also the mechanism of cellular phenomena, we need to capture snapshots of cellular process using microarray technology. Nowadays, microarray technology is expected to contribute to filling the ultimate purpose of the total study in bioinformatics. Microarray technology allows the monitoring of expression levels for thousands of genes simultaneously. This novel technology helps us to understand gene regulation as well as a gene by gene interactions more systematically.
In the microarray experiment, however, many undesirable systematic variations are observed. Normalization is the process of removing some sources of variation which affect the measured gene expression levels. Normalization plays an important role in the earlier stage of microarray data analysis. The subsequent analysis results are highly dependent on normalization.
Microarray, based on its gene expression data information, has been applicable to the field of cancer diagnosis with the computer-aided classification and prediction technology. Current clinical practice involves an experienced hematopathologist's interpretation of the tumor's morphology. In this way, however, classification remains imperfect and errors do occur. So, it has been suggested that such microarrays could provide a tool for correct cancer classification. But from a datamining-based point of view, there is one difficulty in microarray data analysis that the number of samples is very small while the number of attributes(i.e., genes) is very large. Therefore, to classify the subtypes of cancer correctly using current microarray technology, we should select the informative genes whose expression pattern was strongly correlated with the class distinction to be predicted.
Independently separated informative genes can contribute to inspiring the study of medical cure after the correct classification by them. These informative genes list data suggest that genes useful for cancer class prediction may also provide insight into cancer pathogenesis and pharmacology. So for this application, we have to separate this informative genes list independently. And that list should be consistent, trusted, and strongly correlated with the class distinction to be predicted.
In this paper, the system that can create the separated informative genes list after Lowess normalization is proposed. The effectiveness of these system and method was evaluated through some experiments. In the experimental results using MLP(Multi-Layer Perceptron)-based classifier system, it was found that the proposed system and the suggested combination method create an independently separated informative genes list with consistency, trust, and strong correlation with the class distinction for microarray data.
Therefore, the proposed system and the suggested combination method in this paper are expected to contribute to providing insight into cancer pathogenesis and pharmacology as well as correct classification of cancer types.