TOPIC MODEL - 主题模型

主题模型的意义

Topic modeling provides methods for automatically organizing, understanding, searching, and summarizing large electronic archives.

  1. Discover the hidden themes that pervade the collection.
  2. Annotate the documents according to those themes.
  3. Use annotations to organize, summarize, and search the texts.

Latent Dirichlet allocation(LDA)

参考文献

  1. Probabilistic Topic Models, ICML2012 Tutorial: http://www.cs.columbia.edu/~blei/talks/Blei_ICML_2012.pdf
    2.