| 
  • If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!

View
 

Meeting 16 (2015-08-03)

Page history last edited by Alan Liu 8 years, 8 months ago

Progress to Date (and Future Scheduling)

 

  • Status Reports (Developer Task Assignments page)
  • This Week:
    • Wednesday (Aug. 5), 1 pm -- Planning meeting (joined by Scott Kleinman). The goals of this meeting will include:
      * Discussion of scrubbing and other preprocessing steps
      * Discussion of topic modeling strategy
      * Discussion of ideas for eventual public-facing interface

 

 


Setting Priorities for Other Publications to Collect

 

  • What should our priorities be in collecting other publications beyond the ones we already have? 
    For example, here are some possibilities:
    * Other U.S. cities (e.g., Washington Post, LA Times, Chicago Tribune)?
    * Online media news/popular media (eg., Huff Post, Salon)
    * TV/Radio media
    * Middlebrow (e.g., USA Today)
    * Magazines (e.g., New Republic, LA Review of Books)?
    * Economic press (e.g., Forbes, Business Insider, The Economist)?
    * Higher-education press (e.g., Chronicle of Higher Education, Higher Ed)?
    * Campus papers (e.g., Harvard Crimson, Yale Daily News, UCLA Bruin)?
    * Commencement speeches.
    * Articles on "sciences"?
    * Social media

 

  1. Other nations: UK (Canada, Australia, New Zealand, India)?
  2. At least one other U.S. city
  3. born-digital publications
  4. One higher-ed publication
  5. Economic press
  6. Etc.

 


Observations and Findings From Human Reading of Our Corpus

 

  • General observations?
  • Our research questions? (example topic model of NYT 2014 "humanities") (Antconc analysis of the same files)
    • What are the topics with which the humanities are associated?
      • What are the most important topics with which the humanities are associated?
      • What are the kinds of topics in the corpus? What is their order of importance?
        • To what degree can we see topics that coincide with disciplinary definitions of the humanities (e.g., literature, history, philosophy, classics)?
        • Also search for relation of sciences in the corpus in relation to humanities.
        • How are the humanities related to the arts relative to the social sciences?
        • To what degree can we see institutional, political, academic, and other domains of topics?
      • Can we identify the articles that are prevalently about the humanities in order to see what topics have a high weight in them?
      • What are collocates and n-grams of "humanities"?
        • To what degree does searching on collocates and n-grams produce a corpus that overlaps with our existing corpus?
    • How do topics vary by nation and decade?
      • identify important historical moments and correlation with the humanities?
      • how does public discourse about the humanities correlate with educational facts on the ground (enrollment, tuition, etc.)
    • What are the "hot" button topics in public discourse about the humanities? How do they compare to the way academics and foundations discuss the humanities?   
  • Fixes for our scrubbing list ("List of Fixes Needed for Raw Texts")?


Wednesday (Aug. 5), 1 pm -- Planning meeting (joined by Scott Kleinman). The goals of this meeting will include:
* Discussion of scrubbing and other preprocessing steps
* Discussion of topic modeling strategy
* Discussion of ideas for eventual public-facing interface

 

 

 

Comments (0)

You don't have permission to comment on this page.