In the Margins

The Scrub Tool

The Scrubbing tool allows you to make document-wide edits, such as the removal of punctuation, digits, and capital letters. Additionally, it also allows you to input a list of stopwords, lemmas, consolidations, and special characters, each of which will be explained in more detail below. The overall goal of this scrubbing is to normalize texts, removing every possible difference except the actual words used, and thereby permitting the application of advanced statistical analysis. Without scrubbing, said statistical analysis may well register false variation between texts.

Tutorial:
The first part of the scrubbing process involves several simple options which effect the entire document. In almost all cases, their names provide ample evidence of their functionality, so the important thing to remember is that they will take effect throughout your selections. If you select 'Remove All Punctuation', every period, comma, colon, and quotation mark will be removed.

Contents of this path:

  1. Lemmas