Building statistical models in Python develop useful models for regression, classification, time series, and survival analysis

The ability to proficiently perform statistical modeling is a fundamental skill for data scientists and essential for businesses reliant on data insights. Building Statistical Models with Python is a comprehensive guide that will empower you to leverage mathematical and statistical principles in dat...

Full description

Bibliographic Details
Main Authors: Nguyen, Huy Hoang, Adams, Paul N. (Author), Miller, Stuart J. (Author)
Format: eBook
Language:English
Published: Birmingham, UK Packt Publishing Ltd. 2023
Edition:1st edition
Subjects:
Online Access:
Collection: O'Reilly - Collection details see MPG.ReNa
LEADER 05732nmm a2200469 u 4500
001 EB002180811
003 EBX01000000000000001318298
005 00000000000000.0
007 cr|||||||||||||||||||||
008 231010 ||| eng
050 4 |a QA276 
100 1 |a Nguyen, Huy Hoang 
245 0 0 |a Building statistical models in Python  |b develop useful models for regression, classification, time series, and survival analysis  |c Huy Hoang Nguyen, Paul N. Adams, Stuart J. Miller 
250 |a 1st edition 
260 |a Birmingham, UK  |b Packt Publishing Ltd.  |c 2023 
300 |a 420 pages  |b illustrations 
505 0 |a Includes bibliographical references and index 
505 0 |a Visualizing data types -- Measuring and describing distributions -- Measuring central tendency -- Measuring variability -- Measuring shape -- The normal distribution and central limit theorem -- The Central Limit Theorem -- Bootstrapping -- Confidence intervals -- Standard error -- Correlation coefficients (Pearson's correlation) -- Permutations -- Permutations and combinations -- Permutation testing -- Transformations -- Summary -- References -- Chapter 3: Hypothesis Testing -- The goal of hypothesis testing -- Overview of a hypothesis test for the mean -- Scope of inference 
505 0 |a Spearman's rank correlation coefficient -- Summary -- Part 2: Regression Models -- Chapter 6: Simple Linear Regression -- Simple linear regression using OLS -- Coefficients of correlation and determination -- Coefficients of correlation -- Coefficients of determination -- Required model assumptions -- A linear relationship between the variables -- Normality of the residuals -- Homoscedasticity of the residuals -- Sample independence -- Testing for significance and validating models -- Model validation -- Summary -- Chapter 7: Multiple Linear Regression -- Multiple linear regression 
505 0 |a Cover -- Copyright -- Contributors -- Table of Contents -- Preface -- Part 1: Introduction to Statistics -- Chapter 1: Sampling and Generalization -- Software and environment setup -- Population versus sample -- Population inference from samples -- Randomized experiments -- Observational study -- Sampling strategies -- random, systematic, stratified, and clustering -- Probability sampling -- Non-probability sampling -- Summary -- Chapter 2: Distributions of Data -- Technical requirements -- Understanding data types -- Nominal data -- Ordinal data -- Interval data -- Ratio data 
505 0 |a Tests with more than two groups and ANOVA -- Multiple tests for significance -- ANOVA -- Pearson's correlation coefficient -- Power analysis examples -- Summary -- References -- Chapter 5: Non-Parametric Tests -- When parametric test assumptions are violated -- Permutation tests -- The Rank-Sum test -- The test statistic procedure -- Normal approximation -- Rank-Sum example -- The Signed-Rank test -- The Kruskal-Wallis test -- Chi-square distribution -- Chi-square goodness-of-fit -- Chi-square test of independence -- Chi-square goodness-of-fit test power analysis 
505 0 |a Hypothesis test steps -- Type I and Type II errors -- Type I errors -- Type II errors -- Basics of the z-test -- the z-score, z-statistic, critical values, and p-values -- The z-score and z-statistic -- A z-test for means -- z-test for proportions -- Power analysis for a two-population pooled z-test -- Summary -- Chapter 4: Parametric Tests -- Assumptions of parametric tests -- Normally distributed population data -- Equal population variance -- T-test -- a parametric hypothesis test -- T-test for means -- Two-sample t-test -- pooled t-test -- Two-sample t-test -- Welch's t-test -- Paired t-test 
653 |a statistics / aat 
653 |a Mathematical models / http://id.loc.gov/authorities/subjects/sh85082124 
653 |a Statistics / http://id.loc.gov/authorities/subjects/sh85127580 
653 |a Python (Computer program language) / http://id.loc.gov/authorities/subjects/sh96008834 
653 |a Python (Langage de programmation) 
653 |a Mathematical statistics / fast / (OCoLC)fst01012127 
653 |a Statistique 
653 |a Python (Computer program language) / fast / (OCoLC)fst01084736 
653 |a Statistics / fast / (OCoLC)fst01132103 
653 |a Mathematical models / fast / (OCoLC)fst01012085 
653 |a Mathematical statistics / http://id.loc.gov/authorities/subjects/sh85082133 
700 1 |a Adams, Paul N.  |e author 
700 1 |a Miller, Stuart J.  |e author 
041 0 7 |a eng  |2 ISO 639-2 
989 |b OREILLY  |a O'Reilly 
776 |z 9781804614280 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781804614280/?ar  |x Verlag  |3 Volltext 
082 0 |a 519.5 
082 0 |a 519.5 
520 |a The ability to proficiently perform statistical modeling is a fundamental skill for data scientists and essential for businesses reliant on data insights. Building Statistical Models with Python is a comprehensive guide that will empower you to leverage mathematical and statistical principles in data assessment, understanding, and inference generation. This book not only equips you with skills to navigate the complexities of statistical modeling, but also provides practical guidance for immediate implementation through illustrative examples. Through emphasis on application and code examples, you’ll understand the concepts while gaining hands-on experience. With the help of Python and its essential libraries, you’ll explore key statistical models, including hypothesis testing, regression, time series analysis, classification, and more. By the end of this book, you’ll gain fluency in statistical modeling while harnessing the full potential of Python's rich ecosystem for data analysis