• Login
    View Item 
    •   MUT Research Archive
    • Journal Articles
    • School of Computing and IT (JA)
    • Journal Articles (CI)
    • View Item
    •   MUT Research Archive
    • Journal Articles
    • School of Computing and IT (JA)
    • Journal Articles (CI)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Quadratic Approach for Fast Topic Selection in Modelling Big Text Analytics

    Thumbnail
    View/Open
    Quadratic Approach for Fast Topic Selection in Modelling Big Text Analytics.pdf (2.145Mb)
    Date
    2018
    Author
    Wambugu, Geoffrey M
    Onyango, George
    Kimani, Stephen
    Metadata
    Show full item record
    Abstract
    One challenging issue in application of Latent Dirichlet Allocation (LDA) is to select the optimal number of topics which must depend on both the corpus itself and user modeling goals. This paper presents a topic selection method which models the minimum perplexity against number of topics for any given dataset. The research set up scikit-learn and graphlab on jupyter notebook in the google cloud compute engine’s custom machine and then developed python code to manipulate selected existing datasets. Results indicate that the graph of perplexity against number of topics (K) has a strong quadratic behaviour around a turning point and opens upwards. This means that the graph has a minimum perplexity point that optimizes K. The paper presents a model of the optimum K in an identified interval and guides the calculation of this value of K within three iterations using quadratic approach and differential calculus. This improves inferential speed of number of topics and hyper parameter alpha thereby enhancing LDA application in big data.
    URI
    http://hdl.handle.net/123456789/3596
    Collections
    • Journal Articles (CI) [118]

    MUT Library copyright © 2017-2024  MUT Library Website
    Contact Us | Send Feedback
     

     

    Browse

    All of Research ArchiveCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    LoginRegister

    MUT Library copyright © 2017-2024  MUT Library Website
    Contact Us | Send Feedback