Statistics for Data Science and Analytics
Home > Computing and Information Technology > Computer programming / software engineering > Programming and scripting languages: general > Statistics for Data Science and Analytics
Statistics for Data Science and Analytics

Statistics for Data Science and Analytics

|
     0     
5
4
3
2
1




International Edition


About the Book

Introductory statistics textbook with a focus on data science topics such as prediction, correlation, and data exploration Statistics for Data Science and Analytics is a comprehensive guide to statistical analysis using Python, presenting important topics useful for data science such as prediction, correlation, and data exploration. The authors provide an introduction to statistical science and big data, as well as an overview of Python data structures and operations. A range of statistical techniques are presented with their implementation in Python, including hypothesis testing, probability, exploratory data analysis, categorical variables, surveys and sampling, A/B testing, and correlation. The text introduces binary classification, a foundational element of machine learning, validation of statistical models by applying them to holdout data, and probability and inference via the easy-to-understand method of resampling and the bootstrap instead of using a myriad of “kitchen sink” formulas. Regression is taught both as a tool for explanation and for prediction. This book is informed by the authors’ experience designing and teaching both introductory statistics and machine learning at Statistics.com. Each chapter includes practical examples, explanations of the underlying concepts, and Python code snippets to help readers apply the techniques themselves. Statistics for Data Science and Analytics includes information on sample topics such as: Int, float, and string data types, numerical operations, manipulating strings, converting data types, and advanced data structures like lists, dictionaries, and sets Experiment design via randomizing, blinding, and before-after pairing, as well as proportions and percents when handling binary data Specialized Python packages like numpy, scipy, pandas, scikit-learn and statsmodels—the workhorses of data science—and how to get the most value from them Statistical versus practical significance, random number generators, functions for code reuse, and binomial and normal probability distributions Written by and for data science instructors, Statistics for Data Science and Analytics is an excellent learning resource for data science instructors prescribing a required intro stats course for their programs, as well as other students and professionals seeking to transition to the data science field.

Table of Contents:
About the Authors xvii Acknowledgments xix About the Companion Website xxi Introduction xxiii 1 Statistics and Data Science 1 1.1 Big Data: Predicting Pregnancy 2 1.2 Phantom Protection from Vitamin E 2 1.3 Statistician, Heal Thyself 3 1.4 Identifying Terrorists in Airports 4 1.5 Looking Ahead 5 1.6 Big Data and Statisticians 5 2 Designing and Carrying Out a Statistical Study 9 2.1 Statistical Science 9 2.2 Big Data 10 2.3 Data Science 10 2.4 Example: Hospital Errors 11 2.5 Experiment 12 2.6 Designing an Experiment 13 2.7 The Data 19 2.8 Variables and Their Flavors 21 2.9 Python: Data Structures and Operations 25 2.10 Are We Sure We Made a Difference? 34 2.11 Is Chance Responsible? The Foundation of Hypothesis Testing 34 2.12 Probability 36 2.13 Significance or Alpha Level 38 2.14 Other Kinds of Studies 40 2.15 When to Use Hypothesis Tests 42 2.16 Experiments Falling Short of the Gold Standard 42 2.17 Summary 43 2.18 Python: Iterations and Conditional Execution 44 2.19 Python: Numpy, scipy, and pandas—The Workhorses of Data Science 50 Exercises 56 3 Exploring and Displaying the Data 61 3.1 Exploratory Data Analysis 61 3.2 What to Measure—Central Location 62 3.3 What to Measure—Variability 65 3.4 What to Measure—Distance (Nearness) 69 3.5 Test Statistic 71 3.6 Examining and Displaying the Data 72 3.7 Python: Exploratory Data Analysis/Data Visualization 80 Exercises 88 4 Accounting for Chance—Statistical Inference 91 4.1 Avoid Being Fooled by Chance 91 4.2 The Null Hypothesis 92 4.3 Repeating the Experiment 93 4.4 Statistical Significance 99 4.5 Power 103 4.6 The Normal Distribution 103 4.7 Summary 105 4.8 Python: Random Numbers 105 Exercises 115 5 Probability 121 5.1 What Is Probability 121 5.2 Simple Probability 122 5.3 Probability Distributions 126 5.4 From Binomial to Normal Distribution 129 5.5 Appendix: Binomial Formula and Normal Approximation 133 5.6 Python: Probability 134 Exercises 141 6 Categorical Variables 143 6.1 Two-way Tables 143 6.2 Conditional Probability 144 6.3 Bayesian Estimates 147 6.4 Independence 150 6.5 Multiplication Rule 154 6.6 Simpson’s Paradox 156 6.7 Python: Counting and Contingency Tables 157 Exercises 163 7 Surveys and Sampling 167 7.1 Literary Digest—Sampling Trumps “All Data” 167 7.2 Simple Random Samples 170 7.3 Margin of Error: Sampling Distribution for a Proportion 172 7.4 Sampling Distribution for a Mean 174 7.5 The Bootstrap 176 7.6 Rationale for the Bootstrap 177 7.7 Standard Error 188 7.8 Other Sampling Methods 188 7.9 Absolute vs. Relative Sample Size 192 7.10 Python: Random Sampling Strategies 192 Exercises 202 8 More than Two Samples or Categories 207 8.1 Count Data—R × C Tables 207 8.2 The Role of Experiments (Many Are Costly) 208 8.3 Chi-Square Test 210 8.4 Single Sample—Goodness-of-Fit 215 8.5 Numeric Data: ANOVA 217 8.6 Components of Variance 222 8.7 Factorial Design 224 8.8 The Problem of Multiple Inference 226 8.9 Continuous Testing 228 8.10 Bandit Algorithms 229 8.11 Appendix: ANOVA, the Factor Diagram, and the F-Statistic 230 8.12 More than One Factor or Variable—From ANOVA to Statistical Models 237 8.13 Python: Contingency Tables and Chi-square Test 237 8.14 Python: ANOVA 241 Exercises 246 9 Correlation 249 9.1 Example: Delta Wire 249 9.2 Example: Cotton Dust and Lung Disease 251 9.3 The Vector Product Sum Test 252 9.4 Correlation Coefficient 256 9.5 Correlation is not Causation 260 9.6 Other Forms of Association 261 9.7 Python: Correlation 262 Exercises 269 10 Regression 271 10.1 Finding the Regression Line by Eye 272 10.2 Finding the Regression Line by Minimizing Residuals 274 10.3 Linear Relationships 276 10.4 Prediction vs. Explanation 280 10.5 Python: Linear Regression 284 Exercises 293 11 Multiple Linear Regression 295 11.1 Terminology 295 11.2 Example—Housing Prices 296 11.3 Interaction 301 11.4 Regression Assumptions 304 11.5 Assessing Explanatory Regression Models 306 11.6 Assessing Regression for Prediction 314 11.7 Python: Multiple Linear Regression 324 Exercises 332 12 Predicting Binary Outcomes 337 12.1 K-Nearest-Neighbors 337 12.2 Python: Classification 343 Exercises 346 Index 349


Best Sellers


Product Details
  • ISBN-13: 9781394253807
  • Publisher: John Wiley & Sons Inc
  • Publisher Imprint: John Wiley & Sons Inc
  • Language: English
  • Returnable: N
  • Returnable: N
  • ISBN-10: 139425380X
  • Publisher Date: 07 Aug 2024
  • Binding: Hardback
  • No of Pages: 384
  • Returnable: N
  • Weight: 725 gr


Similar Products

Add Photo
Add Photo

Customer Reviews

REVIEWS      0     
Click Here To Be The First to Review this Product
Statistics for Data Science and Analytics
John Wiley & Sons Inc -
Statistics for Data Science and Analytics
Writing guidlines
We want to publish your review, so please:
  • keep your review on the product. Review's that defame author's character will be rejected.
  • Keep your review focused on the product.
  • Avoid writing about customer service. contact us instead if you have issue requiring immediate attention.
  • Refrain from mentioning competitors or the specific price you paid for the product.
  • Do not include any personally identifiable information, such as full names.

Statistics for Data Science and Analytics

Required fields are marked with *

Review Title*
Review
    Add Photo Add up to 6 photos
    Would you recommend this product to a friend?
    Tag this Book Read more
    Does your review contain spoilers?
    What type of reader best describes you?
    I agree to the terms & conditions
    You may receive emails regarding this submission. Any emails will include the ability to opt-out of future communications.

    CUSTOMER RATINGS AND REVIEWS AND QUESTIONS AND ANSWERS TERMS OF USE

    These Terms of Use govern your conduct associated with the Customer Ratings and Reviews and/or Questions and Answers service offered by Bookswagon (the "CRR Service").


    By submitting any content to Bookswagon, you guarantee that:
    • You are the sole author and owner of the intellectual property rights in the content;
    • All "moral rights" that you may have in such content have been voluntarily waived by you;
    • All content that you post is accurate;
    • You are at least 13 years old;
    • Use of the content you supply does not violate these Terms of Use and will not cause injury to any person or entity.
    You further agree that you may not submit any content:
    • That is known by you to be false, inaccurate or misleading;
    • That infringes any third party's copyright, patent, trademark, trade secret or other proprietary rights or rights of publicity or privacy;
    • That violates any law, statute, ordinance or regulation (including, but not limited to, those governing, consumer protection, unfair competition, anti-discrimination or false advertising);
    • That is, or may reasonably be considered to be, defamatory, libelous, hateful, racially or religiously biased or offensive, unlawfully threatening or unlawfully harassing to any individual, partnership or corporation;
    • For which you were compensated or granted any consideration by any unapproved third party;
    • That includes any information that references other websites, addresses, email addresses, contact information or phone numbers;
    • That contains any computer viruses, worms or other potentially damaging computer programs or files.
    You agree to indemnify and hold Bookswagon (and its officers, directors, agents, subsidiaries, joint ventures, employees and third-party service providers, including but not limited to Bazaarvoice, Inc.), harmless from all claims, demands, and damages (actual and consequential) of every kind and nature, known and unknown including reasonable attorneys' fees, arising out of a breach of your representations and warranties set forth above, or your violation of any law or the rights of a third party.


    For any content that you submit, you grant Bookswagon a perpetual, irrevocable, royalty-free, transferable right and license to use, copy, modify, delete in its entirety, adapt, publish, translate, create derivative works from and/or sell, transfer, and/or distribute such content and/or incorporate such content into any form, medium or technology throughout the world without compensation to you. Additionally,  Bookswagon may transfer or share any personal information that you submit with its third-party service providers, including but not limited to Bazaarvoice, Inc. in accordance with  Privacy Policy


    All content that you submit may be used at Bookswagon's sole discretion. Bookswagon reserves the right to change, condense, withhold publication, remove or delete any content on Bookswagon's website that Bookswagon deems, in its sole discretion, to violate the content guidelines or any other provision of these Terms of Use.  Bookswagon does not guarantee that you will have any recourse through Bookswagon to edit or delete any content you have submitted. Ratings and written comments are generally posted within two to four business days. However, Bookswagon reserves the right to remove or to refuse to post any submission to the extent authorized by law. You acknowledge that you, not Bookswagon, are responsible for the contents of your submission. None of the content that you submit shall be subject to any obligation of confidence on the part of Bookswagon, its agents, subsidiaries, affiliates, partners or third party service providers (including but not limited to Bazaarvoice, Inc.)and their respective directors, officers and employees.

    Accept

    New Arrivals

    Inspired by your browsing history


    Your review has been submitted!

    You've already reviewed this product!