An Interactive Visualization Platform for Interpreting Topic Models

Carson Sievert (joint work with Kenny Shirley)
9-26-2013

The ugly truth

Overview

  1. What is a topic model?
  2. Fitting topic models via Latent Dirichlet Allocation (LDA).
  3. Fitting an LDA model with the R package LDAviz.
  4. Interpreting model output.
    • Topic specific keywords.
    • 'Global' keywords.
  5. Measuring similarity between topics.
  6. Visualizing model output.

Live Demo

  • These ideas (and some others) are integrated into a fully interactive web application.
  • This instance makes use of:
    • The R package shiny
    • The JavaScript library D3
  • By integrating these tools we can harness the best of both worlds!
    • R for statistical algorithms
    • JavaScript for interacting with graphics
library(shiny) #run application from R
runGitHub("LDAviz", "kshirley", subdir="inst/shiny/hover")

Acknowledgements

  • Thanks to Kenny Shirley for being a great mentor.
  • Thanks to Carlos Scheidegger for sharing his love & hatred for JavaScript.
  • Thanks to all the members of Statistics Research for helpful feedback and discussions.

Questions/Comments/Compliments?