Lexical Tools

Generate Spelling Variants

  • Short Description: Generate known spelling variants.

  • Full Description:

    This flow tries to preserve the proper inflection as well when reporting the spelling variants. Please note that this flow is case insensitive since it utilizes case insensitive flow of retrieving inflected terms (facts).

    The -m option display the EUI from the lexicon.

    The results are sorted by alphabetical order, frequency of category, inflection, and EUI. Output filter options of combining records by output and EUI, -CR:o or -CR:oe, are suggested to be used with this flow so that all the variant records with same output term and EUI are combined together before displayed.

    The Java version displays all variants by length, then ASCII order while C version displays results in ASCII order.

  • Difference:
    1. The Java version uses new inflection table in the database.
    2. The Java version displays all variants while C version combines variants with same spelling together. Same features can be performed by using the output filter option, -CR:oe, in the Java version.


  • Features:
    1. Generate all inflected forms with same citation form, category and inflection from the database.


  • Symbol: s

  • Examples:
    
    shell> lvg -f:s -m
    color
    color|color|128|1|s|1|E0017902|
    color|color|128|512|s|1|E0017902|
    color|color|1024|1|s|1|E0017903|
    color|color|1024|1024|s|1|E0017903|
    color|color|1024|262144|s|1|E0017903|
    color|color|1|1|s|1|E0792256|
    color|color|1|256|s|1|E0792256|
    color|colour|128|1|s|1|E0017902|
    color|colour|128|512|s|1|E0017902|
    color|colour|1024|1|s|1|E0017903|
    color|colour|1024|1024|s|1|E0017903|
    color|colour|1024|262144|s|1|E0017903|
    color|colour|1|1|s|1|E0792256|
    color|colour|1|256|s|1|E0792256|
    
    resume
    resume|resume|128|1|s|1|E0053099|
    resume|resume|128|512|s|1|E0053099|
    resume|resume|1024|1|s|1|E0053098|
    resume|resume|1024|1024|s|1|E0053098|
    resume|resume|1024|262144|s|1|E0053098|
    resume|resumé|128|1|s|1|E0053099|
    resume|resumé|128|512|s|1|E0053099|
    resume|résumé|128|1|s|1|E0053099|
    resume|résumé|128|512|s|1|E0053099|
    
    shell> lvg -f:s -m -CR:oc
    color
    color|color|1|257|s|1|E0792256|
    color|color|128|513|s|1|E0017902|
    color|color|1024|263169|s|1|E0017903|
    color|colour|1|257|s|1|E0792256|
    color|colour|128|513|s|1|E0017902|
    color|colour|1024|263169|s|1|E0017903|
    
    resume
    resume|resume|128|513|s|1|E0053099|
    resume|resume|1024|263169|s|1|E0053098|
    resume|resumé|128|513|s|1|E0053099|
    resume|résumé|128|513|s|1|E0053099|
    
    More examples

  • Implementation Logic:
    1. Retrieve records form from database Inflection table by using the input as inflected term (case insensitive).
    2. Eliminate records with same EUI, inflection, and category.
    3. Sort results by alphabetical order, category, inflection, EUI.
    4. Generate inflected terms with same EUI, category, and inflection from database.

  • Source Code: ToSpellingVariants.java

  • Hierarchy: Object -> Transformation -> ToGenerateSpellingVariants