Statistical Learning for Biomedical Data (Practical Guides to Biostatistics and Epidemiology)

Statistical Learning for Biomedical Data (Practical Guides to Biostatistics and Epidemiology)

by JamesD.Malley (Author), KarenG.Malley (Author), SinisaPajevic (Author)

Synopsis

This book is for anyone who has biomedical data and needs to identify variables that predict an outcome, for two-group outcomes such as tumor/not-tumor, survival/death, or response from treatment. Statistical learning machines are ideally suited to these types of prediction problems, especially if the variables being studied may not meet the assumptions of traditional techniques. Learning machines come from the world of probability and computer science but are not yet widely used in biomedical research. This introduction brings learning machine techniques to the biomedical world in an accessible way, explaining the underlying principles in nontechnical language and using extensive examples and figures. The authors connect these new methods to familiar techniques by showing how to use the learning machine models to generate smaller, more easily interpretable traditional models. Coverage includes single decision trees, multiple-tree techniques such as Random Forests (TM), neural nets, support vector machines, nearest neighbors and boosting.

$54.86

Quantity

20+ in stock

More Information

Format: Paperback
Pages: 298
Edition: 1
Publisher: Cambridge University Press
Published: 24 Feb 2011

ISBN 10: 0521699096
ISBN 13: 9780521699099
Book Overview: This highly motivating introduction to statistical learning machines explains underlying principles in nontechnical language, using many examples and figures.

Media Reviews
'The book is well written and provides nice graphics and numerous applications.' Michael R. Chernick, Technometrics
Some important concepts are explained well including the difference between supervised and unsupervised learning, and describing when specific methods work well and when they don't. They also list twenty canonical questions and point the reader to the sections in the book where these questions are answered. They provide many important examples from biomedical research and illustrate the methods to solve these problems along with the pitfalls of some of them. ... Overall, I think this is a good reference source for biomedical researchers involved in data mining or classification, but the reader should beware of the arguments that are loosely explained. Michael R. Chernick, Significance
The book is well written and provides nice graphics and numerous applications... the book is good for its intended audience, the users of biomedical data. Michael R. Chernick, Technometrics
While biomedical applications of the statistical learning machines described in this book are becoming more apparent, they are not widely practiced. This book provides an excellent overview for the neophyte as to the nuts and bolts of certain statistical learning machines and the major issues involved in development and evaluation of specific machines. The authors display a nice dance between exuberance and caution; they do not attempt to advance any particular machine learning approach, instead emphasizing the processes and the need to use several approaches depending on data context. Wendy J. Mack, University of Southern California, Los Angeles for American Journal of Epidemiology
Author Bio
James D. Malley is a Research Mathematical Statistician in the Mathematical and Statistical Computing Laboratory, Division of Computational Bioscience, Center for Information Technology, at the National Institutes of Health. Karen G. Malley is president of Malley Research Programming, Inc. in Rockville, Maryland, providing statistical programming services to the pharmaceutical industry and the National Institutes of Health. She also serves on the global council of the Clinical Data Interchange Standards Consortium (CDISC) user network, and the steering committee of the Washington, DC area CDISC user network. Sinisa Pajevic is a Staff Scientist in the Mathematical and Statistical Computing Laboratory, Division of Computational Bioscience, Center for Information Technology, at the National Institutes of Health.