|
Planning Document for Topic Modeling
Page history
last edited
by Alan Liu 10 years, 7 months ago
Last updated: Oct. 22, 2014
Research Method
Topic Modeling
Introductions to the Idea of Topic Modeling
Topic Modeling Tools (see DH Toychest for fuller list)
- MALLET (command-line version; download and install in a folder called "mallet" directly in root directory of your computer)
- Java GUI version of Mallet (aka "GUI Topic Modeling Tool"; download the .jar file and run from your computer)
- LDAvis ("R package for interactive topic model visualization) (example of use)
- MALLET-to-Gephi Data Stacker (online tool that takes "the '--output-doc-topics' output from MALLET and reorganize it into a format that Gephi understands")
- [for other tools related to text-preparation and workflow for topic modeling, see below]
Tutorials for MALLET Topic Modeling
Initial Proof of Concept for small sample of documents from WhatEvery1Says corpus
Participants in the project should all perform experimental topic modeling runs on samples from the corpus as they work on particular components of the text-preparation processes outlined above. Ideally, there will be an iterative relation between tweaks to those processes and tweaks to the topic modeling.
- Log of Topic Model Runs (and MALLET commands): collective log of our topic model runs with a record by date of each run and the MALLET commands used, plus commentary on or links to the results (in progress)
- Alan's topic model run of 1 May 2014 (using MALLET): results
Planning Document for Topic Modeling
|
Tip: To turn text into a link, highlight the text, then click on a page or file from the list above.
|
|
|
Comments (0)
You don't have permission to comment on this page.