Original Derivations Table
I. Original derivations Modifications Notes
- The original derivations table in Lexical Tools was developed and used in C versions by linguists. It includes 4,559 derivation pairs come from five main sources.
- In 2012 release, LSG reviewed these derivations tables and 75 synonyms and 11 mistakes were filtered out to results in 4473 pairs of derivations:
- total unique data: 4473 records
- remove 75 synonyms from etc.fct (etc.fct.org.synonyms)
- remove 11 mistakes (derivation.org.err.removed.txt)
- In addition, 13 modifications (derivation.org.err.modify.txt) from the original derivations table
- In 2013 release, these orgD pairs are excluded from SD-Facts due to lack of EUI information (which are needed in the new designed derivation DB table)
- In 2014 release, LSG add verified orgD pairs in DB derivation table:
- retrieve orgD pairs with valid EUIs in Lexicon.2014 (3,928)
- identify above orgD pairs are already included DB derivation table
- validate those orgD with EUIs that are not in DB derivation table
- suffixD: 1609
- prefixD: 1
- zeroD: 0 (all 220 zerpD pairs are already in DB derivation table)
- unknownD: 30
- add lexical records of the rest of dPairs to Lexicon (for future releases)
II. Modification Details
The details derivations distribution from five main sources:
- total line: 10,763 (orgD.raw.data)
- comment No. (#): 6,229
- no comment line: 4,534
- emptyNo (empty line): 59
- non-empty no: 4,475
- duplicate No: 2
sulphurise|verb|sulfurization|noun
sulphurize|verb|sulfurization|noun
- valid no: 4,473 (orgD.yes.data + 1 line is empty)
- Further verification to 4,467 dPairs in (orgD.yes.type.data):
- removing the following 6 dPairs:
- apical|adj|apex|noun
- lend|verb|loan|noun
- neurotic|adj|nerve|noun
- ovigerous|adj|ova|noun
- puric|adj|pus|noun
- uretic|adj|urine|noun
- modifying the following:
heamolyse|verb|hemolysis|noun
=> haemolyse|verb|hemolysis|noun
heamolyze|verb|hemolysis|noun
=> haemolyze|verb|hemolysis|noun
oxidize|verb|oxygen|noun
=> oxidize|verb|oxide|noun
pliable|adj|ply|noun
=> pliable|adj|ply|verb
pliant|adj|ply|noun
=> pliant|adj|ply|verb
- prefixD: 4 (orgD.yes.P.data.${YEAR})
- zeroD: 353 (orgD.yes.Z.data.${YEAR})
- suffixD: 4,110 (orgD.yes.type.data.${YEAR})
III. Five Main Sources
The details derivations distribution from five main sources:
- convers.fct
- contains conversion (zero derivations) for category from adj to verb
- according to previous study, there is about 35% error on adj to verb zero derivations conversion. Thus, it is important to use facts instead of rules for zero derivations.
- 213 unique derivation pairs
- nomiz.fct
- taken from nominalization= slots in lexicon
- exclude derivation pairs covered in rules (dm.rul)
- 700 derivation pairs (2 duplicated pairs)
- sulphurise|verb|sulfurization|noun
- sulphurize|verb|sulfurization|noun
- 698 unique derivation pairs
dm.fct
- derivations are not covered by rules (dm.rul)
- 510 unique derivation pairs
pd.fct
- derivation pairs are taken from the public domain dictionary
- they are sub-words and their headwords
- exclude derivations pairs from rules and other facts
- 1924 unique derivation pairs
etc.fct (75 synonyms are removed from etc.fct.org)
- a miscellaneous fact file of morphologically related words
- most are suffix and few prefix
- 75 synonyms (## etc.fct.org.synonyms) are removed from etc.fct.org and saved as etc.fct
- 1187 no comment line in etc.fct
- 59 are empty line
- 1128 unique pairs
- include pairs not specified in the rules and exclude pairs specified in the rules
- -osis|noun|odial|adj: not productive enough for a rule
- -ble|adj: irregular facts
- Three letter words: derivation suffix rules are not allowed to apply to stems of less than three letters.
- -al|adj: have the meaning "of or pertaining to" the noun
- -ar|adj: have the meaning "of or pertaining to" the noun
- -ic|adj: have the meaning "of or pertaining to" the noun
- other adj suffix means "pertaining to" noun based on Dorland's Definitions
- -oid|adj
- -ile|adj
- -ine|adj
- -ate|adj
- -ive|adj
- -an|adj
- -ous|adj
- -ant|adj
- -ent|adj
- -y|adj
- -ary|adj
- -ory|adj
- -y|adj
- adj means "characterized by" noun
- adj means "marked by" noun
- adj means relating to noun (derived from Dorlands defs)
- adj means resembling noun (from Dorland's definitions)
- -oid|adj
- -like|adj
- -form|adj
- adj means producing noun (from Dorland's definitions)
- -genic|adj
- -poietic|adj
- -genetic|adj
- -genous|adj
- -parous|adj
- -erous|adj
- -oviferous|adj
- -serous|adj
- adj means derived from Noun
- adj means causing noun
- adj means caused by noun
- -genic|adj
- -genous|adj
- adj means exhibiting noun
- verb/adjective rules (Dorland's dictionary definitions)
- verb means "to render something adj"
- verb means to make adj (tran)
- verb means to make or become adj (tran intran)
- verb means to become adj
- verb/noun rules (Dorland's dictionary definitions)
- verb means "to treat with noun"
- verb means "to subject to noun" (trans)
- verb means to put into a state of
- verb means "to throw into a state or condition of "
- verb means to put under the influence of noun
- verb means to convert into NOUN (trans)
- verb means turn something into noun (tran)
- verb means to convert into NOUN (trans)
- verb means to be converted into NOUN (intrans)
- verb means become converted into noun
- verb means to form (a) noun or some noun(s)
- verb means to cause noun
- verb means to induce noun
- verb means to perform noun
- verb means to perform noun upon (trans)
- verb means to combine with noun (intran)
- verb means to cause to combine with noun (tran)
- verb means combine or cause to combine with (tran intran)
- verb means to impregnate with noun
- verb means to add noun to
- verb means to cover with noun