Co-Clustering
Home > Computing and Information Technology > Computer programming / software engineering > Algorithms and data structures > Co-Clustering: Models, Algorithms and Applications
Co-Clustering: Models, Algorithms and Applications

Co-Clustering: Models, Algorithms and Applications


     0     
5
4
3
2
1



Out of Stock


Notify me when this book is in stock
X
About the Book

Cluster or co-cluster analyses are important tools in a variety of scientific areas. The introduction of this book presents a state of the art of already well-established, as well as more recent methods of co-clustering. The authors mainly deal with the two-mode partitioning under different approaches, but pay particular attention to a probabilistic approach. Chapter 1 concerns clustering in general and the model-based clustering in particular. The authors briefly review the classical clustering methods and focus on the mixture model. They present and discuss the use of different mixtures adapted to different types of data. The algorithms used are described and related works with different classical methods are presented and commented upon. This chapter is useful in tackling the problem of co-clustering under the mixture approach. Chapter 2 is devoted to the latent block model proposed in the mixture approach context. The authors discuss this model in detail and present its interest regarding co-clustering. Various algorithms are presented in a general context. Chapter 3 focuses on binary and categorical data. It presents, in detail, the appropriated latent block mixture models. Variants of these models and algorithms are presented and illustrated using examples. Chapter 4 focuses on contingency data. Mutual information, phi-squared and model-based co-clustering are studied. Models, algorithms and connections among different approaches are described and illustrated. Chapter 5 presents the case of continuous data. In the same way, the different approaches used in the previous chapters are extended to this situation. Contents 1. Cluster Analysis. 2. Model-Based Co-Clustering. 3. Co-Clustering of Binary and Categorical Data. 4. Co-Clustering of Contingency Tables. 5. Co-Clustering of Continuous Data. About the Authors Gérard Govaert is Professor at the University of Technology of Compiègne, France. He is also a member of the CNRS Laboratory Heudiasyc (Heuristic and diagnostic of complex systems). His research interests include latent structure modeling, model selection, model-based cluster analysis, block clustering and statistical pattern recognition. He is one of the authors of the MIXMOD (MIXtureMODelling) software. Mohamed Nadif is Professor at the University of Paris-Descartes, France, where he is a member of LIPADE (Paris Descartes computer science laboratory) in the Mathematics and Computer Science department. His research interests include machine learning, data mining, model-based cluster analysis, co-clustering, factorization and data analysis. Cluster Analysis is an important tool in a variety of scientific areas. Chapter 1 briefly presents a state of the art of already well-established as well more recent methods. The hierarchical, partitioning and fuzzy approaches will be discussed amongst others. The authors review the difficulty of these classical methods in tackling the high dimensionality, sparsity and scalability. Chapter 2 discusses the interests of coclustering, presenting different approaches and defining a co-cluster. The authors focus on co-clustering as a simultaneous clustering and discuss the cases of binary, continuous and co-occurrence data. The criteria and algorithms are described and illustrated on simulated and real data. Chapter 3 considers co-clustering as a model-based co-clustering. A latent block model is defined for different kinds of data. The estimation of parameters and co-clustering is tackled under two approaches: maximum likelihood and classification maximum likelihood. Hard and soft algorithms are described and applied on simulated and real data. Chapter 4 considers co-clustering as a matrix approximation. The trifactorization approach is considered and algorithms based on update rules are described. Links with numerical and probabilistic approaches are established. A combination of algorithms are proposed and evaluated on simulated and real data. Chapter 5 considers a co-clustering or bi-clustering as the search for coherent co-clusters in biological terms or the extraction of co-clusters under conditions. Classical algorithms will be described and evaluated on simulated and real data. Different indices to evaluate the quality of coclusters are noted and used in numerical experiments.

Table of Contents:
Acknowledgment xi Introduction xiii I.1. Types and representation of data xiii I.1.1. Binary data xiv I.1.2. Categorical data xiv I.1.3. Continuous data xv I.1.4. Contingency table xvii I.1.5. Data representations xix I.2. Simultaneous analysis xx I.2.1. Data analysis xx I.2.2. Co-clustering xxii I.2.3. Applications xxiii I.3. Notation xxvii I.4. Different approaches xxviii I.4.1. Two-mode partitioning xxviii I.4.2. Two-mode hierarchical clustering xxxvii I.4.3. Direct or block clustering xxxix I.4.4. Biclustering xxxix I.4.5. Other structures and other aims xliv I.5. Model-based co-clustering xlvi I.6. Outline xlix Chapter 1. Cluster Analysis 1 1.1. Introduction 1 1.2. Miscellaneous clustering methods 4 1.2.1. Hierarchical approach 4 1.2.2. The k-means algorithm 5 1.2.3. Other approaches 7 1.3. Model-based clustering and the mixture model 11 1.4. EM algorithm 15 1.4.1. Complete data and complete-data likelihood 16 1.4.2. Principle 17 1.4.3. Application to mixture models 18 1.4.4. Properties 19 1.4.5. EM: an alternating optimization algorithm 19 1.5. Clustering and the mixture model 20 1.5.1. The two approaches 20 1.5.2. Classification likelihood 21 1.5.3. The CEM algorithm 22 1.5.4. Comparison of the two approaches 22 1.5.5. Fuzzy clustering 24 1.6. Gaussian mixture model 26 1.6.1. The model 26 1.6.2. CEM algorithm 28 1.6.3. Spherical form, identical proportions and volumes 29 1.6.4. Spherical form, identical proportions but differing volumes 30 1.6.5. Identical covariance matrices and proportions 31 1.7. Binary data 32 1.7.1. Binary mixture model 32 1.7.2. Parsimonious model 33 1.7.3. Examples of application 35 1.8. Categorical variables 36 1.8.1. Multinomial mixture model 36 1.8.2. Parsimonious model 38 1.9. Contingency tables 41 1.9.1. MNDKI2 algorithm 41 1.9.2. Model-based approach 43 1.9.3. Illustration 47 1.10. Implementation 49 1.10.1. Choice of model and of the number of classes 51 1.10.2. Strategies for use 51 1.10.3. Extension to particular situations 52 1.11. Conclusion 53 Chapter 2. Model-Based Co-Clustering 55 2.1. Metric approach 55 2.2. Probabilistic models 57 2.3. Latent block model 59 2.3.1. Definition 59 2.3.2. Link with the mixture model 61 2.3.3. Log-likelihoods 62 2.3.4. A complex model 63 2.4. Maximum likelihood estimation and algorithms 67 2.4.1. Variational EM approach 69 2.4.2. Classification EM approach 72 2.4.3. Stochastic EM-Gibbs approach 73 2.5. Bayesian approach 75 2.6. Conclusion and miscellaneous developments 76 Chapter 3. Co-Clustering of Binary and Categorical Data 79 3.1. Example and notation 80 3.2. Metric approach 82 3.3. Bernoulli latent block model and algorithms 84 3.3.1. The model 84 3.3.2. Model identifiability 85 3.3.3. Binary LBVEM and LBCEM algorithms 86 3.4. Parsimonious Bernoulli LBMs 90 3.5. Categorical data 91 3.6. Bayesian inference 93 3.7. Model selection 96 3.7.1. The integrated completed log-likelihood (ICL) 96 3.7.2. Penalized information criteria 97 3.8. Illustrative experiments 98 3.8.1. Townships 98 3.8.2. Mero 101 3.9. Conclusion 105 Chapter 4. Co-Clustering of Contingency Tables 107 4.1. Measures of association 108 4.1.1. Phi-squared coefficient 109 4.1.2. Mutual information 111 4.2. Contingency table associated with a couple of partitions 113 4.2.1. Associated distributions 113 4.2.2. Associated measures of association 116 4.3. Co-clustering of contingency table 119 4.3.1. Two equivalent approaches 119 4.3.2. Parameter modification of criteria 121 4.3.3. Co-clustering with the phi-squared coefficient 124 4.3.4. Co-clustering with the mutual information 129 4.4. Model-based co-clustering 131 4.4.1. Block model for contingency tables 133 4.4.2. Poisson latent block model 137 4.4.3. Poisson LBVEM and LBCEM algorithms 138 4.5. Comparison of all algorithms 140 4.5.1. CROKI2 versus CROINFO 142 4.5.2. CROINFO versus Poisson LBCEM 142 4.5.3. Poisson LBVEM versus Poisson LBCEM 144 4.5.4. Behavior of CROKI2, CROINFO, LBCEM and LBVEM 147 4.6. Conclusion 149 Chapter 5. Co-Clustering of Continuous Data 151 5.1. Metric approach 152 5.1.1. Measure of information 153 5.1.2. Summarized data associated with partitions 153 5.1.3. Objective function 156 5.1.4. CROEUC algorithm 157 5.2. Gaussian latent block model 159 5.2.1. The model 159 5.2.2. Gaussian LBVEM and LBCEM algorithms 160 5.2.3. Parsimonious Gaussian latent block models 161 5.3. Illustrative example 163 5.4. Gaussian block mixture model 168 5.4.1. The model 169 5.4.2. GBEM algorithm 170 5.5. Numerical experiments 173 5.5.1. GBEM versus CROEUC and EM 174 5.5.2. Effect of the size of data 175 5.6. Conclusion 175 Bibliography 177 Index 199

About the Author :
G&eeacute;rard Govaert is Professor at University of Technology, Compiègne, France. Mohamed Nadif is Professor at University of Paris-Descartes, France.


Best Sellers


Product Details
  • ISBN-13: 9781118649503
  • Publisher: John Wiley & Sons Inc
  • Publisher Imprint: Wiley-ISTE
  • Language: English
  • Sub Title: Models, Algorithms and Applications
  • ISBN-10: 1118649508
  • Publisher Date: 11 Dec 2013
  • Binding: Digital (delivered electronically)
  • No of Pages: 256


Similar Products

Add Photo
Add Photo

Customer Reviews

REVIEWS      0     
Click Here To Be The First to Review this Product
Co-Clustering: Models, Algorithms and Applications
John Wiley & Sons Inc -
Co-Clustering: Models, Algorithms and Applications
Writing guidlines
We want to publish your review, so please:
  • keep your review on the product. Review's that defame author's character will be rejected.
  • Keep your review focused on the product.
  • Avoid writing about customer service. contact us instead if you have issue requiring immediate attention.
  • Refrain from mentioning competitors or the specific price you paid for the product.
  • Do not include any personally identifiable information, such as full names.

Co-Clustering: Models, Algorithms and Applications

Required fields are marked with *

Review Title*
Review
    Add Photo Add up to 6 photos
    Would you recommend this product to a friend?
    Tag this Book Read more
    Does your review contain spoilers?
    What type of reader best describes you?
    I agree to the terms & conditions
    You may receive emails regarding this submission. Any emails will include the ability to opt-out of future communications.

    CUSTOMER RATINGS AND REVIEWS AND QUESTIONS AND ANSWERS TERMS OF USE

    These Terms of Use govern your conduct associated with the Customer Ratings and Reviews and/or Questions and Answers service offered by Bookswagon (the "CRR Service").


    By submitting any content to Bookswagon, you guarantee that:
    • You are the sole author and owner of the intellectual property rights in the content;
    • All "moral rights" that you may have in such content have been voluntarily waived by you;
    • All content that you post is accurate;
    • You are at least 13 years old;
    • Use of the content you supply does not violate these Terms of Use and will not cause injury to any person or entity.
    You further agree that you may not submit any content:
    • That is known by you to be false, inaccurate or misleading;
    • That infringes any third party's copyright, patent, trademark, trade secret or other proprietary rights or rights of publicity or privacy;
    • That violates any law, statute, ordinance or regulation (including, but not limited to, those governing, consumer protection, unfair competition, anti-discrimination or false advertising);
    • That is, or may reasonably be considered to be, defamatory, libelous, hateful, racially or religiously biased or offensive, unlawfully threatening or unlawfully harassing to any individual, partnership or corporation;
    • For which you were compensated or granted any consideration by any unapproved third party;
    • That includes any information that references other websites, addresses, email addresses, contact information or phone numbers;
    • That contains any computer viruses, worms or other potentially damaging computer programs or files.
    You agree to indemnify and hold Bookswagon (and its officers, directors, agents, subsidiaries, joint ventures, employees and third-party service providers, including but not limited to Bazaarvoice, Inc.), harmless from all claims, demands, and damages (actual and consequential) of every kind and nature, known and unknown including reasonable attorneys' fees, arising out of a breach of your representations and warranties set forth above, or your violation of any law or the rights of a third party.


    For any content that you submit, you grant Bookswagon a perpetual, irrevocable, royalty-free, transferable right and license to use, copy, modify, delete in its entirety, adapt, publish, translate, create derivative works from and/or sell, transfer, and/or distribute such content and/or incorporate such content into any form, medium or technology throughout the world without compensation to you. Additionally,  Bookswagon may transfer or share any personal information that you submit with its third-party service providers, including but not limited to Bazaarvoice, Inc. in accordance with  Privacy Policy


    All content that you submit may be used at Bookswagon's sole discretion. Bookswagon reserves the right to change, condense, withhold publication, remove or delete any content on Bookswagon's website that Bookswagon deems, in its sole discretion, to violate the content guidelines or any other provision of these Terms of Use.  Bookswagon does not guarantee that you will have any recourse through Bookswagon to edit or delete any content you have submitted. Ratings and written comments are generally posted within two to four business days. However, Bookswagon reserves the right to remove or to refuse to post any submission to the extent authorized by law. You acknowledge that you, not Bookswagon, are responsible for the contents of your submission. None of the content that you submit shall be subject to any obligation of confidence on the part of Bookswagon, its agents, subsidiaries, affiliates, partners or third party service providers (including but not limited to Bazaarvoice, Inc.)and their respective directors, officers and employees.

    Accept

    New Arrivals


    Inspired by your browsing history


    Your review has been submitted!

    You've already reviewed this product!