| 
  • If you are citizen of an European Union member nation, you may not use this service unless you are at least 16 years old.

  • You already know Dokkio is an AI-powered assistant to organize & manage your digital files & messages. Very soon, Dokkio will support Outlook as well as One Drive. Check it out today!

View
 

Meeting 13 (2015-06-22)

Page history last edited by Alan Liu 9 years ago

 

Academic Senate grant #2!

 

Preparation for June 29th RA Orientation Meeting and July Collection Runs

 

(1) Workstation Preparation

  • Setting up "WE1S" user on Transcriptions machines with a standard development environment:
    • "WE1S" user:
      • Login on Transcriptions machines: Username = TRANSCRIPT-1\WE1S  Password = 
      • Google account username: 4humwe1s (Gmail: 4humwe1s@gmail.com) (same password)
      • LastPass account username: 4humwe1s (same password)
    • Collection Workstation Set-Up
    • Setting up the Mac and Linux machines.
    • Drive space issues
  • CRC machines? Personal laptops?
  • Individual Developer Accounts for API's and for Import.io

 


 

(2) July Collection Runs 

  • June 29th RA Orientation Meeting (noon):
    • RAs:
      • Jonathan Callies (English)
        Ashley Champagne (English)
        Phillip Cortes (English)
        Zach Horton (English)
        (Alex Kulick -- possible, depending on his summer travels) (Sociology)
        Patrick Mooney (English)
        Christopher Walker (back after July 7th) (English)
      • Secondary RA orientation in July:  
        • Jonathan Callies (English)
          Christopher Walker (back after July 7th) (English) 
    • Materials for Orientation Meeting
      • Example time sheet
      • Hard-copy instructions for the various collection workflows
      • Hard-copy of Google Form for collection manifest
      • To-do list (e.g., get personal accounts for API's and Import.io)
  • [July ?] Collection Run 1: New York Times
    • Everyone takes a year of the NY Times and collects; repeat until we have the whole corpus.
    • Also: manual work getting the continued pages in the NY Times 
  • [July ?] Collection Run 2: Wall Street Journal
    • Everyone takes a year of the WSJ and collects; repeat until we have the whole corpus.
    • Also: manual work getting the hidden Flash text in the Proquest versions of WSJ
  • [July and on] --
    • Collection runs for Guardian, USAToday, etc.
    • RAs develop workflows for additional publications
    • Begin preprocessing work (development of stoplists; scrubbing of texts using Lexos)
    • Begin topic modeling work
    • Alan attempts to recruit programmer help from MAT and/or Geography for visualization work.

 


 

 

Comments (0)

You don't have permission to comment on this page.