Text Categorization

STRI Output Filter Options (-of:cn~INT)

  • Description:

    This option is used to set the number of cluster when the ST scores are in cluster format. The INT is the specified value of cluster number. The default value is 5.

  • Examples:
    > stri -p -of:c -of:cn~7
    - Please input a term (type "Ctl-d" to quit) >
    steroid
    --> Input: [steroid]
    --- ST scores (x 1) and rank based on word count ---
    horm|T125|Hormone
    1|0.7806|horm|T125|Hormone
    ----------------------------------
    2|0.7062|strd|T110|Steroid
    ----------------------------------
    3|0.7024|nsba|T124|Neuroreactive Substance or Biogenic Amine
    ----------------------------------
    4|0.6777|patf|T046|Pathologic Function
    ----------------------------------
    5|0.6215|aapp|T116|Amino Acid, Peptide, or Protein
    ----------------------------------
    6|0.6152|lbpr|T059|Laboratory Procedure
    ----------------------------------
    7|0.6049|rcpt|T192|Receptor
    ----------------------------------
    --- ST scores (x 1) and rank based on document count ---
    horm|T125|Hormone
    1|0.8248|horm|T125|Hormone
    ----------------------------------
    2|0.7818|strd|T110|Steroid
    ----------------------------------
    3|0.7463|nsba|T124|Neuroreactive Substance or Biogenic Amine
    ----------------------------------
    4|0.7040|patf|T046|Pathologic Function
    ----------------------------------
    5|0.6589|aapp|T116|Amino Acid, Peptide, or Protein
    ----------------------------------
    6|0.6557|lbpr|T059|Laboratory Procedure
    ----------------------------------
    7|0.6371|fndg|T033|Finding
    ----------------------------------
    --- Overall ST rank ---
    horm|T125|Hormone|dc