Lexical Tools

Original Derivations Table

I. Original derivations Modifications Notes

  • The original derivations table in Lexical Tools was developed and used in C versions by linguists. It includes 4,559 derivation pairs come from five main sources.
  • In 2012 release, LSG reviewed these derivations tables and 75 synonyms and 11 mistakes were filtered out to results in 4473 pairs of derivations:
    • total unique data: 4473 records
    • remove 75 synonyms from etc.fct (etc.fct.org.synonyms)
    • remove 11 mistakes (derivation.org.err.removed.txt)
    • In addition, 13 modifications (derivation.org.err.modify.txt) from the original derivations table
  • In 2013 release, these orgD pairs are excluded from SD-Facts due to lack of EUI information (which are needed in the new designed derivation DB table)
  • In 2014 release, LSG add verified orgD pairs in DB derivation table:
    • retrieve orgD pairs with valid EUIs in Lexicon.2014 (3,928)
    • identify above orgD pairs are already included DB derivation table
    • validate those orgD with EUIs that are not in DB derivation table
      • suffixD: 1609
      • prefixD: 1
      • zeroD: 0 (all 220 zerpD pairs are already in DB derivation table)
      • unknownD: 30
    • add lexical records of the rest of dPairs to Lexicon (for future releases)

II. Modification Details

The details derivations distribution from five main sources:

  • total line: 10,763 (orgD.raw.data)
    • comment No. (#): 6,229
    • no comment line: 4,534
      • emptyNo (empty line): 59
      • non-empty no: 4,475
        • duplicate No: 2
          sulphurise|verb|sulfurization|noun
          sulphurize|verb|sulfurization|noun
        • valid no: 4,473 (orgD.yes.data + 1 line is empty)
        • Further verification to 4,467 dPairs in (orgD.yes.type.data):
          • removing the following 6 dPairs:
            • apical|adj|apex|noun
            • lend|verb|loan|noun
            • neurotic|adj|nerve|noun
            • ovigerous|adj|ova|noun
            • puric|adj|pus|noun
            • uretic|adj|urine|noun
          • modifying the following:
            • heamolyse|verb|hemolysis|noun => haemolyse|verb|hemolysis|noun
            • heamolyze|verb|hemolysis|noun => haemolyze|verb|hemolysis|noun
            • oxidize|verb|oxygen|noun => oxidize|verb|oxide|noun
            • pliable|adj|ply|noun => pliable|adj|ply|verb
            • pliant|adj|ply|noun => pliant|adj|ply|verb

            • prefixD: 4 (orgD.yes.P.data.${YEAR})
            • zeroD: 353 (orgD.yes.Z.data.${YEAR})
            • suffixD: 4,110 (orgD.yes.type.data.${YEAR})

III. Five Main Sources

The details derivations distribution from five main sources:

  • convers.fct
    • contains conversion (zero derivations) for category from adj to verb
    • according to previous study, there is about 35% error on adj to verb zero derivations conversion. Thus, it is important to use facts instead of rules for zero derivations.
    • 213 unique derivation pairs
  • nomiz.fct
    • taken from nominalization= slots in lexicon
    • exclude derivation pairs covered in rules (dm.rul)
    • 700 derivation pairs (2 duplicated pairs)
        		
      • sulphurise|verb|sulfurization|noun
      • sulphurize|verb|sulfurization|noun
    • 698 unique derivation pairs
  • dm.fct
    • derivations are not covered by rules (dm.rul)
    • 510 unique derivation pairs
  • pd.fct
    • derivation pairs are taken from the public domain dictionary
    • they are sub-words and their headwords
    • exclude derivations pairs from rules and other facts
    • 1924 unique derivation pairs
  • etc.fct (75 synonyms are removed from etc.fct.org)
    • a miscellaneous fact file of morphologically related words
    • most are suffix and few prefix
    • 75 synonyms (## etc.fct.org.synonyms) are removed from etc.fct.org and saved as etc.fct
    • 1187 no comment line in etc.fct
    • 59 are empty line
    • 1128 unique pairs

    • include pairs not specified in the rules and exclude pairs specified in the rules
      • -osis|noun|odial|adj: not productive enough for a rule
      • -ble|adj: irregular facts
      • Three letter words: derivation suffix rules are not allowed to apply to stems of less than three letters.
      • -al|adj: have the meaning "of or pertaining to" the noun
      • -ar|adj: have the meaning "of or pertaining to" the noun
      • -ic|adj: have the meaning "of or pertaining to" the noun
      • other adj suffix means "pertaining to" noun based on Dorland's Definitions
        • -oid|adj
        • -ile|adj
        • -ine|adj
        • -ate|adj
        • -ive|adj
        • -an|adj
        • -ous|adj
        • -ant|adj
        • -ent|adj
        • -y|adj
        • -ary|adj
        • -ory|adj
        • -y|adj
      • adj means "characterized by" noun
      • adj means "marked by" noun
      • adj means relating to noun (derived from Dorlands defs)
      • adj means resembling noun (from Dorland's definitions)
        • -oid|adj
        • -like|adj
        • -form|adj
      • adj means producing noun (from Dorland's definitions)
        • -genic|adj
        • -poietic|adj
        • -genetic|adj
        • -genous|adj
        • -parous|adj
        • -erous|adj
        • -oviferous|adj
        • -serous|adj
      • adj means derived from Noun
        • -ic|adj
        • -genous|adj
      • adj means causing noun
        • -genic|adj
        • -genous|adj
      • adj means caused by noun
        • -genic|adj
        • -genous|adj
          • adj means exhibiting noun
        • verb/adjective rules (Dorland's dictionary definitions)
          • verb means "to render something adj"
          • verb means to make adj (tran)
          • verb means to make or become adj (tran intran)
          • verb means to become adj
        • verb/noun rules (Dorland's dictionary definitions)
          • verb means "to treat with noun"
          • verb means "to subject to noun" (trans)
          • verb means to put into a state of
          • verb means "to throw into a state or condition of "
          • verb means to put under the influence of noun
          • verb means to convert into NOUN (trans)
          • verb means turn something into noun (tran)
          • verb means to convert into NOUN (trans)
          • verb means to be converted into NOUN (intrans)
          • verb means become converted into noun
          • verb means to form (a) noun or some noun(s)
          • verb means to cause noun
          • verb means to induce noun
          • verb means to perform noun
          • verb means to perform noun upon (trans)
          • verb means to combine with noun (intran)
          • verb means to cause to combine with noun (tran)
          • verb means combine or cause to combine with (tran intran)
          • verb means to impregnate with noun
          • verb means to add noun to
          • verb means to cover with noun