This thesis covers a variety of deep learning approaches with regularization, style-transfer generation, and residuals applied to biomedical and material sciences. The first study introduces a novel deep neural network architecture, deep learning-base...
This thesis covers a variety of deep learning approaches with regularization, style-transfer generation, and residuals applied to biomedical and material sciences. The first study introduces a novel deep neural network architecture, deep learning-based cancer metastasis estimator (DeepCME), designed to predict breast cancer metastasis using gene expression. However, the problem of overfitting occurs frequently while training deep learning models using gene expression data because they contain a large number of genes, and the sample size is rather small. To address this problem, several regularization methods are implemented, such as L1 penalty, batch normalization, and dropout. DeepCME demonstrates superior performance, achieving the highest average area under curve scores compared with five baseline models. Additionally, significant genes associated with breast cancer metastasis are identified, highlighting the potential for cost-effective and time-efficient clinical applications. The second study introduces the combined transformation GAN (ctGAN), a style-transfer generative model designed to address existing challenges in the implementation of deep learning models for analyzing gene expression. Previous models have limited applicability for clinical purposes, but ctGAN enables the combined transformation of gene expression and survival data and improves survival analysis by augmenting data through style transformations. ctGAN demonstrates high plausibility in data generation based on distribution and clustering evaluations. The proposed method may enable predictions regarding the likelihood of a patient with breast cancer developing other types of cancer and responding differently to various treatment methods. The third study focuses on estimating Single-Parabolic Band (SPB) parameters in thermoelectric materials using residual-based ensemble model. Estimating SPB parameters is significant for understanding the band structure and predicting the theoretical performance of thermoelectric materials. However, the derivation of these band parameters involves complex calculations, including Fermi integrals, posing substantial challenges due to their computational complexity. To address this problem, this study proposes DeepSPB which is an ensemble model with deep neural network and machine learning model. Deep neural networks are trained to predict band parameters, including density of states effective mass, nondegenerate mobility, and reduced Fermi level. After that, an additional machine learning model leveraging residuals was implemented to adjust predictions by estimating error rates, effectively reducing the prediction error. The performance of the models was validated using pseudo data and experimental data for ZnSb decorated with Ag and Cu.
The three methods described in this paper demonstrate considerable potential for application in various industries. The first two studies could improve metastatic cancer prediction and support cancer treatment and prevention in the medical field, benefiting public healthcare. The final study could facilitate material optimization in the semiconductor industry by analyzing band parameter variations with doping concentrations, reducing time and costs.