Date of Award
4-22-2008
Degree Type
Thesis
Degree Name
Master of Science (MS)
Department
Mathematics and Statistics
First Advisor
Jiawei Liu - Co-Chair
Second Advisor
Yu-sheng Hsu - Co-Chair
Third Advisor
Jeff Qin
Abstract
The study aims to estimate the ability of different grouping techniques on categorical response. We try to find out how well do they work? Do they really find clusters when clusters exist? We use Cancer Problems in Living Scales from the ACS as our categorical data variables and lung cancer survivors as our studying group. Five methods of cluster analysis are examined for their accuracy in clustering on both real CPILS dataset and simulated data. The methods include hierarchical cluster analysis (Ward's method), model-based clustering of raw data, model-based clustering of the factors scores from a maximum likelihood factor analysis, model-based clustering of the predicted scores from independent factor analysis, and the method of latent class clustering. The results from each of the five methods are then compared to actual classifications. The performance of model-based clustering on raw data is poorer than that of the other methods and the latent class clustering method is most appropriate for the specific categorical data examined. These results are discussed and recommendations are made regarding future directions for cluster analysis research.
DOI
https://doi.org/10.57709/1059706
Recommended Citation
Guo, Ling, ""Clustering Categorical Response" Application to Lung Cancer Problems in Living Scales." Thesis, Georgia State University, 2008.
doi: https://doi.org/10.57709/1059706