Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

Lexical Tools

Original Derivations Table

I. Original derivations Modifications Notes

  • The original derivations table in Lexical Tools was developed and used in C versions by linguists. It includes 4,559 derivation pairs come from five main sources.
  • In 2012 release, LSG reviewed these derivations tables and 75 synonyms and 11 mistakes were filtered out to results in 4473 pairs of derivations:
    • total unique data: 4473 records
    • remove 75 synonyms from etc.fct (etc.fct.org.synonyms)
    • remove 11 mistakes (derivation.org.err.removed.txt)
    • In addition, 13 modifications (derivation.org.err.modify.txt) from the original derivations table
  • In 2013 release, these orgD pairs are excluded from SD-Facts due to lack of EUI information (which are needed in the new designed derivation DB table)
  • In 2014 release, LSG add verified orgD pairs in DB derivation table:
    • retrieve orgD pairs with valid EUIs in Lexicon.2014 (3,928)
    • identify above orgD pairs are already included DB derivation table
    • validate those orgD with EUIs that are not in DB derivation table
      • suffixD: 1609
      • prefixD: 1
      • zeroD: 0 (all 220 zerpD pairs are already in DB derivation table)
      • unknownD: 30
    • add lexical records of the rest of dPairs to Lexicon (for future releases)

II. Modification Details

The details derivations distribution from five main sources:

  • total line: 10,763 (orgD.raw.data)
    • comment No. (#): 6,229
    • no comment line: 4,534
      • emptyNo (empty line): 59
      • non-empty no: 4,475
        • duplicate No: 2
          sulphurise|verb|sulfurization|noun
          sulphurize|verb|sulfurization|noun
        • valid no: 4,473 (orgD.yes.data + 1 line is empty)
        • Further verification to 4,467 dPairs in (orgD.yes.type.data):
          • removing the following 6 dPairs:
            • apical|adj|apex|noun
            • lend|verb|loan|noun
            • neurotic|adj|nerve|noun
            • ovigerous|adj|ova|noun
            • puric|adj|pus|noun
            • uretic|adj|urine|noun
          • modifying the following:
            • heamolyse|verb|hemolysis|noun => haemolyse|verb|hemolysis|noun
            • heamolyze|verb|hemolysis|noun => haemolyze|verb|hemolysis|noun
            • oxidize|verb|oxygen|noun => oxidize|verb|oxide|noun
            • pliable|adj|ply|noun => pliable|adj|ply|verb
            • pliant|adj|ply|noun => pliant|adj|ply|verb

            • prefixD: 4 (orgD.yes.P.data.${YEAR})
            • zeroD: 353 (orgD.yes.Z.data.${YEAR})
            • suffixD: 4,110 (orgD.yes.type.data.${YEAR})

III. Five Main Sources

The details derivations distribution from five main sources:

  • convers.fct
    • contains conversion (zero derivations) for category from adj to verb
    • according to previous study, there is about 35% error on adj to verb zero derivations conversion. Thus, it is important to use facts instead of rules for zero derivations.
    • 213 unique derivation pairs
  • nomiz.fct
    • taken from nominalization= slots in lexicon
    • exclude derivation pairs covered in rules (dm.rul)
    • 700 derivation pairs (2 duplicated pairs)
        		
      • sulphurise|verb|sulfurization|noun
      • sulphurize|verb|sulfurization|noun
    • 698 unique derivation pairs
  • dm.fct
    • derivations are not covered by rules (dm.rul)
    • 510 unique derivation pairs
  • pd.fct
    • derivation pairs are taken from the public domain dictionary
    • they are sub-words and their headwords
    • exclude derivations pairs from rules and other facts
    • 1924 unique derivation pairs
  • etc.fct (75 synonyms are removed from etc.fct.org)
    • a miscellaneous fact file of morphologically related words
    • most are suffix and few prefix
    • 75 synonyms (## etc.fct.org.synonyms) are removed from etc.fct.org and saved as etc.fct
    • 1187 no comment line in etc.fct
    • 59 are empty line
    • 1128 unique pairs

    • include pairs not specified in the rules and exclude pairs specified in the rules
      • -osis|noun|odial|adj: not productive enough for a rule
      • -ble|adj: irregular facts
      • Three letter words: derivation suffix rules are not allowed to apply to stems of less than three letters.
      • -al|adj: have the meaning "of or pertaining to" the noun
      • -ar|adj: have the meaning "of or pertaining to" the noun
      • -ic|adj: have the meaning "of or pertaining to" the noun
      • other adj suffix means "pertaining to" noun based on Dorland's Definitions
        • -oid|adj
        • -ile|adj
        • -ine|adj
        • -ate|adj
        • -ive|adj
        • -an|adj
        • -ous|adj
        • -ant|adj
        • -ent|adj
        • -y|adj
        • -ary|adj
        • -ory|adj
        • -y|adj
      • adj means "characterized by" noun
      • adj means "marked by" noun
      • adj means relating to noun (derived from Dorlands defs)
      • adj means resembling noun (from Dorland's definitions)
        • -oid|adj
        • -like|adj
        • -form|adj
      • adj means producing noun (from Dorland's definitions)
        • -genic|adj
        • -poietic|adj
        • -genetic|adj
        • -genous|adj
        • -parous|adj
        • -erous|adj
        • -oviferous|adj
        • -serous|adj
      • adj means derived from Noun
        • -ic|adj
        • -genous|adj
      • adj means causing noun
        • -genic|adj
        • -genous|adj
      • adj means caused by noun
        • -genic|adj
        • -genous|adj
          • adj means exhibiting noun
        • verb/adjective rules (Dorland's dictionary definitions)
          • verb means "to render something adj"
          • verb means to make adj (tran)
          • verb means to make or become adj (tran intran)
          • verb means to become adj
        • verb/noun rules (Dorland's dictionary definitions)
          • verb means "to treat with noun"
          • verb means "to subject to noun" (trans)
          • verb means to put into a state of
          • verb means "to throw into a state or condition of "
          • verb means to put under the influence of noun
          • verb means to convert into NOUN (trans)
          • verb means turn something into noun (tran)
          • verb means to convert into NOUN (trans)
          • verb means to be converted into NOUN (intrans)
          • verb means become converted into noun
          • verb means to form (a) noun or some noun(s)
          • verb means to cause noun
          • verb means to induce noun
          • verb means to perform noun
          • verb means to perform noun upon (trans)
          • verb means to combine with noun (intran)
          • verb means to cause to combine with noun (tran)
          • verb means combine or cause to combine with (tran intran)
          • verb means to impregnate with noun
          • verb means to add noun to
          • verb means to cover with noun