Pre-processing of Harvested Plain Text


Replace:

 

Tokenize:

 

 

Tokenize for adding to stop list:

 

 

 

Delete (or put in Stop List)