Estimation and Selection in Regression Clustering

Guoqi Qian; Yuehua Wu

Estimation and Selection in Regression Clustering

Authors

Guoqi Qian The University of Melbourne
Yuehua Wu York University

Keywords:

Regression clustering, Least squares, Model Selection

Abstract

Regression clustering is an important model-based clustering tool having applications in a variety of disciplines. It discovers and reconstructs the hidden structure for a data set which is a random sample from a population comprising a fixed, but unknown, number of sub-populations, each of which is characterized by a class-specific regression hyperplane. An essential objective, as well as a preliminary step, in most clustering techniques including regression clustering, is to determine the underlying number of clusters in the

data. In this paper, we briefly review regression clustering methods and discuss how to determine the underlying number of clusters by using model selection techniques, in particular, the information-based technique. A computing algorithm is developed for estimating the number of clusters and other parameters in regression clustering. Simulation studies are also provided to show the performance of the algorithm.

Author Biographies

Guoqi Qian, The University of Melbourne

Department of Mathematics and Statistics
Yuehua Wu, York University

Department of Mathematics and Statistics

Downloads

Additional Files

Published

2011-11-27

Issue

Vol. 4 No. 4: (October 2011)

Section

Econometrics and Statistics

License

Upon acceptance of an article by the European Journal of Pure and Applied Mathematics, the author(s) retain the copyright to the article. However, by submitting your work, you agree that the article will be published under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). This license allows others to copy, distribute, and adapt your work, provided proper attribution is given to the original author(s) and source. However, the work cannot be used for commercial purposes.

By agreeing to this statement, you acknowledge that:

You retain full copyright over your work.
The European Journal of Pure and Applied Mathematics will publish your work under the Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
This license allows others to use and share your work for non-commercial purposes, provided they give appropriate credit to the original author(s) and source.

How to Cite

Estimation and Selection in Regression Clustering. (2011). European Journal of Pure and Applied Mathematics, 4(4), 455-466. https://www.ejpam.com/index.php/ejpam/article/view/1184

Download Citation

Estimation and Selection in Regression Clustering

Authors

Keywords:

Abstract

Author Biographies

Downloads

Additional Files

Published

Issue

Section

License

How to Cite

submit a manuscript

Information

right_block_image

affiliated_journal_block

formatting_package