Meeting (2016-08-05)


 Meeting Outcomes:

(jump to notebook added after meeting at bottom of page)

 

 

Next full-team meeting date(s)? -- some options:

 


I. Corpus Finalization -- (possible to complete A, B, C, D below by early Sept.?)

 

 

 

 

 

 


II. Manifest/MongoDB Work

 

 

 


III. Topic Modeling & Interpretation -- (start on this in early to late Sept.?)

 

 

 


 

 

 

 

 

 

 

Meeting Outcomes (current to-do's in red)

  • Next full-team meeting: Friday, Aug. 26th, 11:30 a.m. (Pacific)
  • Metadata finalization work
    • RAs to meet in next 2 weeks for shared work session to help make our workflow more uniform.
    • Individual RAs to continue with the way they have been finalizing CSV files. Standardization of variations among the files to be done later (ideally assisted by scripting).
    • We are aiming to complete metadata finalization by early Sept.
  • Plain-text file finalization work:
    • Tyler & Ishjot will create a script to export plain-text files from the CSVs, name them, & store in appropriate locations.
    • Unicode special-character issues will be addressed at the later stage of using Scott's Python scrubbing script on the corpus.
    • Jeremy's continued work on de-duping script has obviated need for RA subgroup to study de-duping results. Instead, Jeremy will continue and build the results of de-duping (with Ishjot & Tyler's help) into our total workflow.
    • Tyler & Ishjot will assist Jeremy in implementing total workflow in the virtual machine environment.
  • Manifest/MongoDB work:
    • Ishjot & Tyler to continue working with Scott on developing the backend (including file upload capability).
  • Preparing for Topic Modeling & Interpretation:
    • Lindsay will work on creating a random-sample corpus from our publications.