Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

Text Categorization

STRI Input Filter Options

STRI provides input filter option for users to filter out irrelevant words. Such as word extraction filter to filter out punctuation, stopwords, words not in the restrictwords list, words not in the legal words list, non-unique words, etc.. It also provides a detail filtering message for debugging purpose.

The table below lists all input filter options of STRI:

  • Input Filter option

    Option FlagFeature Descriptions
    -if:aUse Acronyms filter
    -if:dShow detailed information for input filter
    -if:eNot use words extraction filter
    -if:hShow input filter help menu
    -if:uUse unique words filter (remove duplicate words)

  • Legal Word Filter options