Data Mining Algorithms by Pawel Cichosz at Bookstore UAE
Book 1
Book 2
Book 3
Book 1
Book 2
Book 3
Book 1
Book 2
Book 3
Book 1
Book 2
Book 3
Home > Mathematics and Science Textbooks > Mathematics > Calculus and mathematical analysis > Numerical analysis > Data Mining Algorithms: Explained Using R
Data Mining Algorithms: Explained Using R

Data Mining Algorithms: Explained Using R


     0     
5
4
3
2
1



International Edition


X
About the Book

Data Mining Algorithms is a practical, technically-oriented guide to data mining algorithms that covers the most important algorithms for building classification, regression, and clustering models, as well as techniques used for attribute selection and transformation, model quality evaluation, and creating model ensembles. The author presents many of the important topics and methodologies widely used in data mining, whilst demonstrating the internal operation and usage of data mining algorithms using examples in R.



Table of Contents:

Acknowledgements xix

Preface xxi

References xxxi

Part I Preliminaries 1

1 Tasks 3

1.1 Introduction 3

1.2 Inductive learning tasks 5

1.3 Classification 9

1.4 Regression 14

1.5 Clustering 16

1.6 Practical issues 19

1.7 Conclusion 20

1.8 Further readings 21

References 22

2 Basic statistics 23

2.1 Introduction 23

2.2 Notational conventions 24

2.3 Basic statistics as modeling 24

2.4 Distribution description 25

2.5 Relationship detection 47

2.6 Visualization 62

2.7 Conclusion 65

2.8 Further readings 66

References 67

Part II Classification 69

3 Decision trees 71

3.1 Introduction 71

3.2 Decision tree model 72

3.3 Growing 76

3.4 Pruning 90

3.5 Prediction 103

3.6 Weighted instances 105

3.7 Missing value handling 106

3.8 Conclusion 114

3.9 Further readings 114

References 116

4 Naïve Bayes classifier 118

4.1 Introduction 118

4.2 Bayes rule 118

4.3 Classification by Bayesian inference 120

4.4 Practical issues 125

4.5 Conclusion 131

4.6 Further readings 131

References 132

5 Linear classification 134

5.1 Introduction 134

5.2 Linear representation 136

5.3 Parameter estimation 145

5.4 Discrete attributes 154

5.5 Conclusion 155

5.6 Further readings 156

References 157

6 Misclassification costs 159

6.1 Introduction 159

6.2 Cost representation 161

6.3 Incorporating misclassification costs 164

6.4 Effects of cost incorporation 176

6.5 Experimental procedure 180

6.6 Conclusion 184

6.7 Further readings 185

References 187

7 Classification model evaluation 189

7.1 Introduction 189

7.2 Performance measures 190

7.3 Evaluation procedures 213

7.4 Conclusion 231

7.5 Further readings 232

References 233

Part III Regression 235

8 Linear regression 237

8.1 Introduction 237

8.2 Linear representation 238

8.3 Parameter estimation 242

8.4 Discrete attributes 250

8.5 Advantages of linear models 251

8.6 Beyond linearity 252

8.7 Conclusion 258

8.8 Further readings 258

References 259

9 Regression trees 261

9.1 Introduction 261

9.2 Regression tree model 262

9.3 Growing 263

9.4 Pruning 274

9.5 Prediction 277

9.6 Weighted instances 278

9.7 Missing value handling 279

9.8 Piecewise linear regression 284

9.9 Conclusion 292

9.10 Further readings 292

References 293

10 Regression model evaluation 295

10.1 Introduction 295

10.2 Performance measures 296

10.3 Evaluation procedures 303

10.4 Conclusion 309

10.5 Further readings 309

References 310

Part IV Clustering 311

11 (Dis)similarity measures 313

11.1 Introduction 313

11.2 Measuring dissimilarity and similarity 313

11.3 Difference-based dissimilarity 314

11.4 Correlation-based similarity 321

11.5 Missing attribute values 324

11.6 Conclusion 325

11.7 Further readings 325

References 326

12 k-Centers clustering 328

12.1 Introduction 328

12.2 Algorithm scheme 330

12.3 k-Means 334

12.4 Beyond means 338

12.5 Beyond (fixed) k 342

12.6 Explicit cluster modeling 343

12.7 Conclusion 345

12.8 Further readings 345

References 347

13 Hierarchical clustering 349

13.1 Introduction 349

13.2 Cluster hierarchies 351

13.3 Agglomerative clustering 353

13.4 Divisive clustering 361

13.5 Hierarchical clustering visualization 364

13.6 Hierarchical clustering prediction 366

13.7 Conclusion 369

13.8 Further readings 370

References 371

14 Clustering model evaluation 373

14.1 Introduction 373

14.2 Per-cluster quality measures 376

14.3 Overall quality measures 385

14.4 External quality measures 393

14.5 Using quality measures 397

14.6 Conclusion 398

14.7 Further readings 398

References 399

Part V Getting Better Models 401

15 Model ensembles 403

15.1 Introduction 403

15.2 Model committees 404

15.3 Base models 406

15.4 Model aggregation 420

15.5 Specific ensemble modeling algorithms 431

15.6 Quality of ensemble predictions 448

15.7 Conclusion 449

15.8 Further readings 450

References 451

16 Kernel methods 454

16.1 Introduction 454

16.2 Support vector machines 457

16.3 Support vector regression 473

16.4 Kernel trick 482

16.5 Kernel functions 484

16.6 Kernel prediction 487

16.7 Kernel-based algorithms 489

16.8 Conclusion 494

16.9 Further readings 495

References 496

17 Attribute transformation 498

17.1 Introduction 498

17.2 Attribute transformation task 499

17.3 Simple transformations 504

17.4 Multiclass encoding 510

17.5 Conclusion 521

17.6 Further readings 521

References 522

18 Discretization 524

18.1 Introduction 524

18.2 Discretization task 525

18.3 Unsupervised discretization 530

18.4 Supervised discretization 533

18.5 Effects of discretization 551

18.6 Conclusion 553

18.7 Further readings 553

References 556

19 Attribute selection 558

19.1 Introduction 558

19.2 Attribute selection task 559

19.3 Attribute subset search 562

19.4 Attribute selection filters 568

19.5 Attribute selection wrappers 588

19.6 Effects of attribute selection 593

19.7 Conclusion 598

19.8 Further readings 599

References 600

20 Case studies 602

20.1 Introduction 602

20.2 Census income 605

20.3 Communities and crime 631

20.4 Cover type 640

20.5 Conclusion 654

20.6 Further readings 655

References 655

Closing 657

A Notation 659

A.1 Attribute values 659

A.2 Data subsets 659

A.3 Probabilities 660

B R packages 661

B.1 CRAN packages 661

B.2 DMR packages 662

B.3 Installing packages 663

References 664

C Datasets 666

Index 667



About the Author :

Pawel Cichosz, Department of Electronics and Information Technology, Warsaw University of Technology, Poland.


Best Sellers


Product Details
  • ISBN-13: 9781118332580
  • Publisher: John Wiley & Sons Inc
  • Publisher Imprint: John Wiley & Sons Inc
  • Height: 253 mm
  • No of Pages: 720
  • Returnable: N
  • Sub Title: Explained Using R
  • Width: 183 mm
  • ISBN-10: 111833258X
  • Publisher Date: 30 Jan 2015
  • Binding: Hardback
  • Language: English
  • Returnable: N
  • Spine Width: 38 mm
  • Weight: 1287 gr


Similar Products

Add Photo
Add Photo

Customer Reviews

REVIEWS      0     
Click Here To Be The First to Review this Product
Data Mining Algorithms: Explained Using R
John Wiley & Sons Inc -
Data Mining Algorithms: Explained Using R
Writing guidlines
We want to publish your review, so please:
  • keep your review on the product. Review's that defame author's character will be rejected.
  • Keep your review focused on the product.
  • Avoid writing about customer service. contact us instead if you have issue requiring immediate attention.
  • Refrain from mentioning competitors or the specific price you paid for the product.
  • Do not include any personally identifiable information, such as full names.

Data Mining Algorithms: Explained Using R

Required fields are marked with *

Review Title*
Review
    Add Photo Add up to 6 photos
    Would you recommend this product to a friend?
    Tag this Book Read more
    Does your review contain spoilers?
    What type of reader best describes you?
    I agree to the terms & conditions
    You may receive emails regarding this submission. Any emails will include the ability to opt-out of future communications.

    CUSTOMER RATINGS AND REVIEWS AND QUESTIONS AND ANSWERS TERMS OF USE

    These Terms of Use govern your conduct associated with the Customer Ratings and Reviews and/or Questions and Answers service offered by Bookswagon (the "CRR Service").


    By submitting any content to Bookswagon, you guarantee that:
    • You are the sole author and owner of the intellectual property rights in the content;
    • All "moral rights" that you may have in such content have been voluntarily waived by you;
    • All content that you post is accurate;
    • You are at least 13 years old;
    • Use of the content you supply does not violate these Terms of Use and will not cause injury to any person or entity.
    You further agree that you may not submit any content:
    • That is known by you to be false, inaccurate or misleading;
    • That infringes any third party's copyright, patent, trademark, trade secret or other proprietary rights or rights of publicity or privacy;
    • That violates any law, statute, ordinance or regulation (including, but not limited to, those governing, consumer protection, unfair competition, anti-discrimination or false advertising);
    • That is, or may reasonably be considered to be, defamatory, libelous, hateful, racially or religiously biased or offensive, unlawfully threatening or unlawfully harassing to any individual, partnership or corporation;
    • For which you were compensated or granted any consideration by any unapproved third party;
    • That includes any information that references other websites, addresses, email addresses, contact information or phone numbers;
    • That contains any computer viruses, worms or other potentially damaging computer programs or files.
    You agree to indemnify and hold Bookswagon (and its officers, directors, agents, subsidiaries, joint ventures, employees and third-party service providers, including but not limited to Bazaarvoice, Inc.), harmless from all claims, demands, and damages (actual and consequential) of every kind and nature, known and unknown including reasonable attorneys' fees, arising out of a breach of your representations and warranties set forth above, or your violation of any law or the rights of a third party.


    For any content that you submit, you grant Bookswagon a perpetual, irrevocable, royalty-free, transferable right and license to use, copy, modify, delete in its entirety, adapt, publish, translate, create derivative works from and/or sell, transfer, and/or distribute such content and/or incorporate such content into any form, medium or technology throughout the world without compensation to you. Additionally,  Bookswagon may transfer or share any personal information that you submit with its third-party service providers, including but not limited to Bazaarvoice, Inc. in accordance with  Privacy Policy


    All content that you submit may be used at Bookswagon's sole discretion. Bookswagon reserves the right to change, condense, withhold publication, remove or delete any content on Bookswagon's website that Bookswagon deems, in its sole discretion, to violate the content guidelines or any other provision of these Terms of Use.  Bookswagon does not guarantee that you will have any recourse through Bookswagon to edit or delete any content you have submitted. Ratings and written comments are generally posted within two to four business days. However, Bookswagon reserves the right to remove or to refuse to post any submission to the extent authorized by law. You acknowledge that you, not Bookswagon, are responsible for the contents of your submission. None of the content that you submit shall be subject to any obligation of confidence on the part of Bookswagon, its agents, subsidiaries, affiliates, partners or third party service providers (including but not limited to Bazaarvoice, Inc.)and their respective directors, officers and employees.

    Accept


    Inspired by your browsing history


    Your review has been submitted!

    You've already reviewed this product!