Meeting Outcomes (current to-do's in red)
- Next full-team meeting: Friday, Aug. 26th, 11:30 a.m. (Pacific)
- Metadata finalization work:
- RAs to meet in next 2 weeks for shared work session to help make our workflow more uniform.
- Individual RAs to continue with the way they have been finalizing CSV files. Standardization of variations among the files to be done later (ideally assisted by scripting).
- We are aiming to complete metadata finalization by early Sept.
- Plain-text file finalization work:
- Tyler & Ishjot will create a script to export plain-text files from the CSVs, name them, & store in appropriate locations.
- Unicode special-character issues will be addressed at the later stage of using Scott's Python scrubbing script on the corpus.
- Jeremy's continued work on de-duping script has obviated need for RA subgroup to study de-duping results. Instead, Jeremy will continue and build the results of de-duping (with Ishjot & Tyler's help) into our total workflow.
- Tyler & Ishjot will assist Jeremy in implementing total workflow in the virtual machine environment.
- Manifest/MongoDB work:
- Ishjot & Tyler to continue working with Scott on developing the backend (including file upload capability).
- Preparing for Topic Modeling & Interpretation:
- Lindsay will work on creating a random-sample corpus from our publications.
|