Text Categorization

StWsd System Options (-d)

  • Description:

    This option is used to show detail information of

    • Input text
    • Input filtering details
    • Sti scores in both DC and WC.

  • Examples:
    > stWsd -aw:culture -can:idcn:lbpr -p -d
    - Please input a term (type "Ctl-d" to quit) >
    Cultural assessment in home healthcare.
    --> Found best sense for [culture] in the ST of [idcn|Idea or Concept]
    ==================================================
    ------ Input Information ------
    -- Ambiguous word: [culture]
    -- ST candidates: [idcn|lbpr]
    -- In Context: [Cultural assessment in home healthcare.]
    ------ Selected Options ------
    -- ST Index method: [Sti]
    -- Score type: [Document count]
    -- Use Ambiguous Sentences: [false]
    ------ Option Process Details ------
    -- AS Context: [Cultural assessment in home healthcare.]
    -- Forced Legal Words: [culturability|culturable|cultural|culturally|culture|cultured|cultures|culturing]
    ------ Input Filter Details ------
    --> Input text: [Cultural assessment in home healthcare.]
    -- Words after Acronym filter [cultural assessment in home healthcare.], Acronym filter is not used.
    -- W.E. filtered words (4): [cultural assessment home healthcare], W.E. filter is used
    -- Legal words (2): [cultural healthcare]
    ---  Legal words selected options:
       - Min. length: true (3)
       - Remove stopwords: true
       - Restrictwords only: true
       - Min. normalized count: false (2)
       - Max. normalized count: false (792054)
       - Min. WC: false (2)
       - Min. DC: false (2)
       - Illegal words details: 
         - [assessment]: it is a stopword.
         - [home]: it is a stopword.
    -- Unique words (2): [cultural healthcare], unique word filter is not used
    -- Final words (2): [cultural healthcare]
    -- Total final words used: 2
    ------ Sti Scores Details ------
    --- ST scores (x 1) and rank based on word count ---
    15|0.5806|idcn|T078|Idea or Concept
    53|0.1752|lbpr|T059|Laboratory Procedure
    --- ST scores (x 1) and rank based on document count ---
    14|0.6413|idcn|T078|Idea or Concept
    48|0.2093|lbpr|T059|Laboratory Procedure