Advanced search options

Advanced Search Options 🞨

Browse by author name (“Author name starts with…”).

Find ETDs with:

in
/  
in
/  
in
/  
in

Written in Published in Earliest date Latest date

Sorted by

Results per page:

You searched for subject:(gavagai). One record found.

Search Limiters

Last 2 Years | English Only

No search limiters apply to these results.

▼ Search Limiters


Linköping University

1. Gränsbo, Gustav. Word Clustering in an Interactive Text Analysis Tool.

Degree: Human-Centered systems, 2019, Linköping University

A central operation of users of the text analysis tool Gavagai Explorer is to look through a list of words and arrange them in groups. This thesis explores the use of word clustering to automatically arrange the words in groups intended to help users. A new word clustering algorithm is introduced, which attempts to produce word clusters tailored to be small enough for a user to quickly grasp the common theme of the words. The proposed algorithm computes similarities among words using word embeddings, and clusters them using hierarchical graph clustering. Multiple variants of the algorithm are evaluated in an unsupervised manner by analysing the clusters they produce when applied to 110 data sets previously analysed by users of Gavagai Explorer. A supervised evaluation is performed to compare clusters to the groups of words previously created by users of Gavagai Explorer. Results show that it was possible to choose a set of hyperparameters deemed to perform well across most data sets in the unsupervised evaluation. These hyperparameters also performed among the best on the supervised evaluation. It was concluded that the choice of word embedding and graph clustering algorithm had little impact on the behaviour of the algorithm. Rather, limiting the maximum size of clusters and filtering out similarities between words had a much larger impact on behaviour.

Subjects/Keywords: word clustering; word embedding; distributional semantics; hierarchical clustering; text analytics; language technology; natural language processing; gavagai; Language Technology (Computational Linguistics); Språkteknologi (språkvetenskaplig databehandling)

Record DetailsSimilar RecordsGoogle PlusoneFacebookTwitterCiteULikeMendeleyreddit

APA · Chicago · MLA · Vancouver · CSE | Export to Zotero / EndNote / Reference Manager

APA (6th Edition):

Gränsbo, G. (2019). Word Clustering in an Interactive Text Analysis Tool. (Thesis). Linköping University. Retrieved from http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-157497

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

Chicago Manual of Style (16th Edition):

Gränsbo, Gustav. “Word Clustering in an Interactive Text Analysis Tool.” 2019. Thesis, Linköping University. Accessed August 13, 2020. http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-157497.

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

MLA Handbook (7th Edition):

Gränsbo, Gustav. “Word Clustering in an Interactive Text Analysis Tool.” 2019. Web. 13 Aug 2020.

Vancouver:

Gränsbo G. Word Clustering in an Interactive Text Analysis Tool. [Internet] [Thesis]. Linköping University; 2019. [cited 2020 Aug 13]. Available from: http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-157497.

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

Council of Science Editors:

Gränsbo G. Word Clustering in an Interactive Text Analysis Tool. [Thesis]. Linköping University; 2019. Available from: http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-157497

Note: this citation may be lacking information needed for this citation format:
Not specified: Masters Thesis or Doctoral Dissertation

.