You searched for
+publisher:"University of Connecticut" +contributor:("Kevin Brown, Ph.D., Ion Mandiou, Ph.D., Yong-Jun Shin, M.D., Ph.D."). One record found.
No search limiters apply to these results.
University of Connecticut
Yankee, Tara N.
Rank Aggregation of Feature Scoring Methods for Unsupervised Learning.
Degree: M. Eng., Biomedical Engineering, 2017, University of Connecticut
The ability to collect and store large amounts of data is transforming data-driven discovery; recent technological advances in biology allow systematic data production and storage at a previously unattainable scale. It is common for biological Big Data to have an order of magnitude or more features than samples. Feature scoring with selection is therefore an essential pre-processing step to finding meaningful clusters in these data. Many feature scoring algorithms have been proposed; they are based on dramatically different ideas about what constitutes a “good” or “important” feature. Motivated by studies in data classification, we use a rank aggregation (RANKAGG) method to combine estimates of feature importance from multiple sources and use a subset of the highest scoring features for subsequent clustering. We demonstrate the performance of RANKAGG on five real-world biological data-sets, and compare the clustering performance of RANKAGG to the thirteen individual feature scoring methods comprising RANKAGG. The rank aggregated features have a mean perfor- mance across the five data-sets equal to the best individual feature scoring method but with lower variance, indicating robust performance across a variety of data. We carefully consider if there is any systematic way to remove rankers from RANKAGG to improve clustering performance. We demonstrate that rank aggregated feature selection yields excellent performance in clustering problems and possibly more im- portantly, greatly limits the risk of choosing a method that is sub-optimal for a given data-set.
Advisors/Committee Members: Kevin Brown, Ph.D., Ion Mandiou, Ph.D., Yong-Jun Shin, M.D., Ph.D., Kevin Brown, Ph.D..
Subjects/Keywords: clustering; ensemble learning; feature selection; unsupervised learning
to Zotero / EndNote / Reference
APA (6th Edition):
Yankee, T. N. (2017). Rank Aggregation of Feature Scoring Methods for Unsupervised Learning. (Masters Thesis). University of Connecticut. Retrieved from https://opencommons.uconn.edu/gs_theses/1123
Chicago Manual of Style (16th Edition):
Yankee, Tara N. “Rank Aggregation of Feature Scoring Methods for Unsupervised Learning.” 2017. Masters Thesis, University of Connecticut. Accessed June 20, 2019.
MLA Handbook (7th Edition):
Yankee, Tara N. “Rank Aggregation of Feature Scoring Methods for Unsupervised Learning.” 2017. Web. 20 Jun 2019.
Yankee TN. Rank Aggregation of Feature Scoring Methods for Unsupervised Learning. [Internet] [Masters thesis]. University of Connecticut; 2017. [cited 2019 Jun 20].
Available from: https://opencommons.uconn.edu/gs_theses/1123.
Council of Science Editors:
Yankee TN. Rank Aggregation of Feature Scoring Methods for Unsupervised Learning. [Masters Thesis]. University of Connecticut; 2017. Available from: https://opencommons.uconn.edu/gs_theses/1123