Text Categorization

STI Output Filter Options (-of:can)~STR

  • Description:

    This option is used to specify the ST candidates. Only those scores associated with the specified ST candidates are display in the output. STR is the specified ST candidates. It is case sensitive. Three formats are used in STR:

    • TUI format:
      Such as "T022"
    • ST Abbreviation:
      Such as "bdsy"
    • ST Full name:
      Such as "Body_System"

    When ST full name format is used, there are characters conflict with command line option syntax and should not be used. Please use the replaced characters in the following table for the TC command line tools:

    Original
    character
    Replaced
    character
    Example
    '  ''_'T082|spco: Spatial Concept
    => Spatial_Concept
    '-''='T068|hcpp: Human-caused Phenomenon or Proces
    => Human=caused_Phenomenon_or_Proces

    The TUI format is recommended for the simplicity reasons. STI also provides -st option to display the Semantic Types list.

  • Examples:
    > sti -of:can~food -of:can~T167 -of:can~"Amino_Acid,_Peptide,_or_Protein" -p
    - Please input a term (type "Ctl-d" to quit) >
    sugar
    --> Input: [sugar]
    --- ST scores (x 1) and rank based on word count ---
    8|0.6986|food|T168|Food
    9|0.6645|aapp|T116|Amino Acid, Peptide, or Protein
    12|0.6284|sbst|T167|Substance
    --- ST scores (x 1) and rank based on document count ---
    8|0.7194|aapp|T116|Amino Acid, Peptide, or Protein
    9|0.7186|food|T168|Food
    12|0.6473|sbst|T167|Substance
    

    Index input and show results only in STs of [Food], [T167], and [Amino Acid, Peptide, or Protein]