Derivational Variants Table
I. Inputs
II. Outputs
- File: deriVarsTable.data
- Format:
Base Form | Category of Base Form | Derivational Variant | Category of Derivation
|
Lowercase and make sure all dPairs are symmetric
III. Algorithm
- Get all base forms from inflVars.data.ascii
shell>flds 5,2 inflVars.data.ascii |sort -u > baseform.data
- Get derivational variants for all base forms
shell>GetDerivationalVariantsTableFromLvg ${LVG_CONFIG} baseform.data deriVarsTable.data.org
- Use ${LVG_CONFIG} to specify which lvg is used
- Instantiate LvgApi Java Object
- Domain filter: facts only, OutputFilter.LVG_ONLY
- Type: zeroD and suffixD, OutputFilter.D_TYPE_ZERO_SUFFIX
- Negation: no negation, OutputFilter.D_NEGATION_OTHERWISE
- Read in baseform.data and store "base form|category" in a list
- Go through each base form of above list
- get derivational variants
- check ASCII
- print out
- base form (lowercase)
- category of base form
- derivational variant (lowercase)
- category of deirvations
- Append dPairs from orginial Facts (unverified dPairs):
This is only for MetaMap, not synchronize with Lexical Tools
- Make sure symmetric for all dPairs
This is only for MetaMap, Lexical Tools takes care of the symmetric automatically.
- Sort and unify:
shell>sort -u citationFormTable.data.org > citationFormTable.data
- Check pure ASCII