The SPECIALIST Lexicon

Derivational Variants Table

I. Inputs

II. Outputs

  • File: deriVarsTable.data
  • Format:
    Base FormCategory of Base FormDerivational VariantCategory of Derivation

    Lowercase and make sure all dPairs are symmetric

III. Algorithm

  • Get all base forms from inflVars.data.ascii
    • shell>flds 5,2 inflVars.data.ascii |sort -u > baseform.data

      Base FormCategory

  • Get derivational variants for all base forms
    shell>GetDerivationalVariantsTableFromLvg ${LVG_CONFIG} baseform.data deriVarsTable.data.org
    • Use ${LVG_CONFIG} to specify which lvg is used
    • Instantiate LvgApi Java Object
      • Domain filter: facts only, OutputFilter.LVG_ONLY
      • Type: zeroD and suffixD, OutputFilter.D_TYPE_ZERO_SUFFIX
      • Negation: no negation, OutputFilter.D_NEGATION_OTHERWISE
    • Read in baseform.data and store "base form|category" in a list
    • Go through each base form of above list
      • get derivational variants
      • check ASCII
      • print out
        • base form (lowercase)
        • category of base form
        • derivational variant (lowercase)
        • category of deirvations

  • Append dPairs from orginial Facts (unverified dPairs):
    This is only for MetaMap, not synchronize with Lexical Tools

  • Make sure symmetric for all dPairs
    This is only for MetaMap, Lexical Tools takes care of the symmetric automatically.

  • Sort and unify:
    shell>sort -u citationFormTable.data.org > citationFormTable.data

  • Check pure ASCII