Text Categorization

Manual Installation Instructions



Introduction

This page shows what Java Text Categorization tools installation script does. A manually installation can be performed to replace the installation script. It is NOT recommanded for general users to install the text Categorization tools manually. Please refer to installation instructions for installing the text categorization tools.

Prerequisites

Install

  1. Create a new directory for the Text Categorization tool on your machine.
  2. Download the zip archive: tc2009.tgz, and place it in the newly created directory. "2009" is the version identification (year).
  3. Uncompress or Unzip the archive. Be careful to use a version of unzip that allows long filenames and that will recreate the full directory structure.

    You should now have four files and eight sub-directories under tc2009, as shown in follows:

    • build.xml
    • overview.html
    • ReadMe.txt
    • SPECIALIST_NLP_TOOLS_TERMS_AND_CONDITIONS.txt

    • /bin
    • /data
    • /docs
    • /examples
    • /install
    • /lib
    • /logs
    • /sources

Please refer to Repository Lists for details.

SetUp

  1. Include the ${TC_DIR}/lib/tc2009dist.jar file to your CLASSPATH environment variable
    => tc2009dist.jar includes classes of mlt, jdi, sti, stri, stWsd all tc api classes, and jdbc driver of HSqlDb library

  2. Include the full path of the top directory of tc2009, ${TC_DIR}, to your CLASSPATH environment variable
    => This allows tc to find the default configuration file, data/config/tc.properties

  3. Change the value of variable ROOT_DIR to the full path of the top directory of tc2009 in the configuration file, data/config/tc.properties

  4. Make sure values of database variables are setup appropriately in the configuration file, see configuration setup for details

Now you are ready for running Mlt, Jdi, Sti, Stri, and StWsd