In the Margins

The Upload Tool

Upload is the standard starting point for the Lexos workflow. When you begin a new session or reset your workspace, you will be automatically re-directed to Upload.

Use of the tool is fairly straightforward. Drag your document files into the box labeled drop files here, or click the Browse button to use your web browser's file browser to locate your files. Most browsers will allow you to shift- or control-click to select multiple files.

There are some restrictions on file upload size in order to prevent the browser from hanging. Nevertheless, upload times may be slow for large files, particularly if you are working over the internet. The maximum file size of 250MB is approximately the size of of nine Webster's Unabridged Dictionaries. If you experience a problem, try uploading smaller files, or, if you are uploading many files, try uploading them in smaller batches.

Lexos accepts files in .txt, .html, .xml, and .sgml. Make sure that your filenames contain these extensions.

Once you have selected your files, they will begin to upload, one at a time. As each upload is complete, you will see a notification at the bottom of the screen shortly after the Ready For Files To Upload progress bar has said "Complete!" The bigger the file the longer it will take to upload and show up on the page. After uploading is complete, each file is considered a document by Lexos. You can activate, de-activate, and re-label, and classify your documents using the Manage tool.

Note on character encoding: Lexos will automatically convert all files to UTF-8 character encoding. If you are uploading HTML, XML, or SGML files that contain special characters, the Scrubber tool will help you to convert them to UTF-8 characters.

The Lexos Beta Web Scraper

At present, your documents must be available as files on your computer. However, Lexos has a Beta web scraper tool, which will allow you to download files off the internet. This is especially useful when you are using files from sources such as Project Gutenberg. To enable the web scraper, click the "Gear" icon in the top right corner of the screen and select the Use Beta functions checkbox. A link to the web scraper tool will appear above the Browse button. Wherever possible, use it to download plain text files since, otherwise, you will download all the HTML markup in a web page (this can be removed using the Scrubber tool). Upload times may vary, depending on internet speeds. If the process seems to hang, try uploading fewer urls. Large-scale web scraping should not be done in Lexos.

This page has paths: