Machine Learning Advanced Dynamic Omics Data Analysis for Precision Medicine

Therefore, to simultaneously improve the phenotype discrimination and genotype interpretability for complex diseases, it is necessary: To design and implement new machine learning technologies to integrate prior-knowledge with new 'omics datasets to provide transferable learning methods by comb...

Full description

Bibliographic Details
Main Author: Zeng, Tao
Other Authors: Huang, Tao, Lu, Chuan
Format: eBook
Language:English
Published: Frontiers Media SA 2020
Subjects:
Online Access:
Collection: Directory of Open Access Books - Collection details see MPG.ReNa
Description
Summary:Therefore, to simultaneously improve the phenotype discrimination and genotype interpretability for complex diseases, it is necessary: To design and implement new machine learning technologies to integrate prior-knowledge with new 'omics datasets to provide transferable learning methods by combining multiple sources of data; To develop new network-based theories and methods to balance the trade-off between accuracy and interpretability of machine learning in biomedical and biological domains; To enhance the causality inference on "small-sample high dimension" data to capture the personalized causal relationship.
Precision medicine is being developed as a preventative, diagnostic and treatment tool to combat complex human diseases in a personalized manner. By utilizing high-throughput technologies, dynamic 'omics data including genetics, epi-genetics and even meta-genomics has produced temporal-spatial big biological datasets which can be associated with individual genotypes underlying pathogen progressive phenotypes. It is therefore necessary to investigate how to integrate these multi-scale 'omics datasets to distinguish the novel individual-specific disease causes from conventional cohort-common disease causes. Currently, machine learning plays an important role in biological and biomedical research, especially in the analysis of big 'omics data.
However, in contrast to traditional big social data, 'omics datasets are currently always "small-sample-high-dimension", which causes overwhelming application problems and also introduces new challenges: (1) Big 'omics datasets can be extremely unbalanced, due to the difficulty of obtaining enough positive samples of such rare mutations or rare diseases; (2) A large number of machine learning models are "black box," which is enough to apply in social applications. However, in biological or biomedical fields, knowledge of the molecular mechanisms underlying any disease or biological study is necessary to deepen our understanding; (3) The genotype-phenotype association is a "white clue" captured in conventional big data studies. But identification of "causality" rather than association would be more helpful for physicians or biologists, as this can be used to determine an experimental target as the subject of future research.
Item Description:Creative Commons (cc), https://creativecommons.org/licenses/by/4.0/
Physical Description:1 electronic resource (393 p.)
ISBN:978-2-88963-554-2
9782889635542