Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

The SPECIALIST Lexicon

Derivational Variants Table

I. Inputs

II. Outputs

  • File: deriVarsTable.data
  • Format:
    Base FormCategory of Base FormDerivational VariantCategory of Derivation

    Lowercase and make sure all dPairs are symmetric

III. Algorithm

  • Get all base forms from inflVars.data.ascii
    • shell>flds 5,2 inflVars.data.ascii |sort -u > baseform.data

      Base FormCategory

  • Get derivational variants for all base forms
    shell>GetDerivationalVariantsTableFromLvg ${LVG_CONFIG} baseform.data deriVarsTable.data.org
    • Use ${LVG_CONFIG} to specify which lvg is used
    • Instantiate LvgApi Java Object
      • Domain filter: facts only, OutputFilter.LVG_ONLY
      • Type: zeroD and suffixD, OutputFilter.D_TYPE_ZERO_SUFFIX
      • Negation: no negation, OutputFilter.D_NEGATION_OTHERWISE
    • Read in baseform.data and store "base form|category" in a list
    • Go through each base form of above list
      • get derivational variants
      • check ASCII
      • print out
        • base form (lowercase)
        • category of base form
        • derivational variant (lowercase)
        • category of deirvations

  • Append dPairs from orginial Facts (unverified dPairs):
    This is only for MetaMap, not synchronize with Lexical Tools

  • Make sure symmetric for all dPairs
    This is only for MetaMap, Lexical Tools takes care of the symmetric automatically.

  • Sort and unify:
    shell>sort -u citationFormTable.data.org > citationFormTable.data

  • Check pure ASCII