Fundamentals of Data Mining in Genomics and Proteomics

By doing so the book is following demands for a more statistical data mining approach to analyzing high-throughput data. Finally, by highlighting limitations and open issues Fundamentals of Data Mining in Genomics and Proteomics is intended to instigate critical thinking and avenues for new research...

Full description

Bibliographic Details
Other Authors: Dubitzky, Werner (Editor), Granzow, Martin (Editor), Berrar, Daniel P. (Editor)
Format: eBook
Language:English
Published: New York, NY Springer US 2007, 2007
Edition:1st ed. 2007
Subjects:
Online Access:
Collection: Springer eBooks 2005- - Collection details see MPG.ReNa
LEADER 04913nmm a2200445 u 4500
001 EB000355381
003 EBX01000000000000000208433
005 00000000000000.0
007 cr|||||||||||||||||||||
008 130626 ||| eng
020 |a 9780387475097 
100 1 |a Dubitzky, Werner  |e [editor] 
245 0 0 |a Fundamentals of Data Mining in Genomics and Proteomics  |h Elektronische Ressource  |c edited by Werner Dubitzky, Martin Granzow, Daniel P. Berrar 
250 |a 1st ed. 2007 
260 |a New York, NY  |b Springer US  |c 2007, 2007 
300 |a XXII, 281 p. 68 illus  |b online resource 
505 0 |a to Genomic and Proteomic Data Analysis -- Design Principles for Microarray Investigations -- Pre-Processing DNA Microarray Data -- Pre-Processing Mass Spectrometry Data -- Visualization in Genomics and Proteomics -- Clustering — Class Discovery in the Post-Genomic Era -- Feature Selection and Dimensionality Reduction in Genomics and Proteomics -- Resampling Strategies for Model Assessment and Selection -- Classification of Genomic and Proteomic Data Using Support Vector Machines -- Networks in Cell Biology -- Identifying Important Explanatory Variables for Time-Varying Outcomes -- Text Mining in Genomics and Proteomics 
653 |a Computer Appl. in Life Sciences 
653 |a Statistics, general 
653 |a Bioinformatics 
653 |a Computational biology  
653 |a Statistics  
653 |a Biochemistry, general 
653 |a Bioinformatics  
653 |a Biotechnology 
653 |a Proteomics 
653 |a Proteomics 
653 |a Bioinformatics 
653 |a Biotechnology 
653 |a Biochemistry 
700 1 |a Granzow, Martin  |e [editor] 
700 1 |a Berrar, Daniel P.  |e [editor] 
041 0 7 |a eng  |2 ISO 639-2 
989 |b Springer  |a Springer eBooks 2005- 
856 4 0 |u https://doi.org/10.1007/978-0-387-47509-7?nosfx=y  |x Verlag  |3 Volltext 
082 0 |a 570.285 
520 |a By doing so the book is following demands for a more statistical data mining approach to analyzing high-throughput data. Finally, by highlighting limitations and open issues Fundamentals of Data Mining in Genomics and Proteomics is intended to instigate critical thinking and avenues for new research in the field 
520 |a The caveats and pitfalls of the discussed methods are highlighted by addressing questions such as: What can go wrong? Under which circumstances can a particular method be applied and when should it not be used? What alternative methods exist? Extensive references to related material and resources are provided to assist readers in identifying and exploring additional information. The structure of this text mirrors the typical stages involved in deploying a data mining solution, spanning from data pre-processing to knowledge discovery to result post-processing. It is hoped that this will equip researchers and practitioners with a useful and practical framework to tackle their own data mining problems in genomics and proteomics. In contrast to some texts on machine learning and biological data analysis, a deliberate effort has been made to incorporate important statistical notions.  
520 |a More than ever before, research and development in genomics and proteomics depends on the analysis and interpretation of large amounts of data generated by high-throughput techniques. With the advance of computational systems biology, this situation will become even more manifest as scientists will generate truly large-scale data sets by simulating of biological systems and conducting synthetic experiments. To optimally exploit such data, life scientists need to understand the fundamental concepts and properties of the fast-growing arsenal of analytical techniques and methods from statistics and data mining. Typically, the relevant literature and products present these techniques in a form which is either very simplistic or highly mathematical, favoring formal rigor over conceptual clarity and practical relevance.  
520 |a Fundamentals of Data Mining in Genomics and Proteomics addresses these shortcomings by adopting an approach which focuses on fundamental concepts and practical applications. The book presents key analytical techniques used to analyze genomic and proteomic data by detailing their underlying principles, merits and limitations. An important goal of this text is to provide a highly intuitive and conceptual (as opposed to intricate mathematical) account of the discussed methodologies. This treatment will enable readers with interest in analysis of genomic and proteomic data to quickly learn and appreciate the essential properties of relevant data mining methodologies without recourse to advanced mathematics. To complement the conceptual discussions, the book draws upon the lessons learned from applying the presented techniques to concrete analysis problems in genomics and proteomics.