Lexical Tools

Input Category Filter

  • Description:

    This option defines the field to use as the input categories. This option specifies which field marks the syntactic categories of the input term.

    This option has effects on the following flow components: b, Ct, d, dc, e, i, ici, L, Lp, Ln, R, r, s, y.

    Note: The first field is 1. If this option is not present, all syntactic categories are applied for the inflection and derivation modules. If this option is present, this field should contain a long (integer) that is formed by OR'ing (or adding) the following integers:

    Value Variant Other Symbols
    1 adj
    • adjective
    • ADJ
    2 adv
    • adverb
    • ADV
    4 aux
    • auxiliary
    8 compl
    • complementizer
    16 conj
    • conjunction
    • CON
    • con
    32 det
    • determiner
    • DET
    64 modal .
    128 noun
    • NOM
    • NPR
    256 prep
    • preposition
    • PRE
    • pre
    512 pron
    • pronoun
    1024 verb
    • VER
    • ver


  • Features:
    1. Assign the specified categories from the input field to the source category of input LexItem.


  • Symbol: cf:INT

  • Examples:

    shell> lvg -f:i -cf:2
    left|1
    left|1|left|1|1|i|1|
    left|1|left|1|256|i|1|
    
    left|2
    left|2|left|2|1|i|1|
    left|2|left|2|256|i|1|
    
    left|4
    
    left|128
    left|128|left|128|1|i|1|
    left|128|left|128|8|i|1|
    left|128|left|128|512|i|1|
    left|128|lefts|128|8|i|1|
    
    left|1024
    left|1024|left|1024|32|i|1|
    left|1024|left|1024|64|i|1|
    left|1024|leave|1024|1|i|1|
    left|1024|leave|1024|262144|i|1|
    left|1024|leave|1024|1024|i|1|
    left|1024|leaves|1024|128|i|1|
    left|1024|leaving|1024|16|i|1|
    
  • Implementation Logic:
    1. Get the specified filed number of the input category filter.
    2. Get the specified categories from the input term by the specified field number.
    3. Assign the specified categories to the source category of the input LexItem.
    4. Do the filtering on flows: b, Ct, d, dc, e, i, ici, L, Lp, Ln, R, r, s, y.