What and how much data should we collect? Principles informing parallel corpus construction
Text classification: what is it?
Questions and discussion
2. Initial Tasks
Everyone: Read, or re-read as the case may be, the following two articles:
Hoyt Long and Richard Jean So's article, "Literary Pattern Recognition: Modernism between Close Reading and Machine Learning," shared via Ryver. Focus on the section titled "The English Haiku as Statistical Pattern"