Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

Lexical Tools

SD-Rule Transaction Details: 2017 to 2020

The detail transaction of SD-Rules are described as below:

  • The following table shows the transcation on the 11 new propsoed (non-duplicated) SD-Rules in 2020.

    Computer Generated SD-Rules
    IDProposed New RuleSourceResultsRank & Rule 2017Rank & Rule 2020TypeCount ChangeAccu. Count
    01-CG1ion$|noun|ory$|adjorgDGoodNone5: ion$|noun|ory$|adj New in 2020+187
    02-CG2ability$|noun|ible$|adjnomDGoodNone10: ability$|noun|ible$|adjNew in 2020+188
    03-CG3sable$|adj|zability$|nounnomDGoodNone12: sable$|adj|zability$|nounNew in 2020+189
    04-CG4sability$|noun|zable$|adjnomDGoodNone13: sability$|noun|zable$|adjNew in 2020+190
    05-CG5sis$|noun|ze$|verbnomDGoodNone41: sis$|noun|ze$|verbNew in 2020+191
    06-CG6$|adj|s$|nounorgDGoodNone49: $|adj|s$|nounNew in 2020+192
    07-CG7al$|noun|e$|verbnomDGoodNone92: al$|noun|e$|verbNew in 2020+193
    08-CG8$|verb|age$|nounorgDBadNone100: $|verb|age$|nounNew in 2020+093
    09-CG9$|noun|ial$|adjorgDBadNone106: $|noun|ial$|adjNew in 2020+093
    Expert-Suggested SD-Rules
    10-ES1$|noun|oid$|nounExpertsBadNone104: $|noun|oid$|nounNew in 2020+093
    11-ES2$|adj|oid$|adjExpertsBadNone110: $|adj|oid$|adjNew in 2020+093

    All 87 good SD-Rules in 2017 are evaluated as good rules in 2020. They could be identical, or replaced by the parent-rules or child-rules.

  • Good SD-Rules count in Optimal Set:
    • 2017 has 86 good rules while 2020 has 93 good rules in optimate set:
    • From the evaluation, 7 of 11 new rules are good (4 bad; 7 duplicated are not included). The total number of good SD-Rule is increased by 7 (from 86 to 93), because:
      • no duplicated rule are in the new Sd-rules for evaluation
      • no new rules have parent-child relationshion with existing rule

  • Good Rules comparison (2017-2020):
    Type20172020Details
    No Change8483...
    Good Rule turn bad00N/A
    Parent-1-Child23
    20172020
    53: osity$|noun|ous$|adj2: bility$|noun|ble$|adj
    39: graph$|noun|graphy$25: nce$|noun|nt$|adj
    18: enesis$|noun|enic$|adj
    New in 202007
    • 5: ion$|noun|ory$|adj
    • 10: ability$|noun|ible$|adj
    • 12: sable$|adj|zability$|noun
    • 13: sability$|noun|zable$|adj
    • 41: sis$|noun|ze$|verb
    • 49: $|adj|s$|noun
    • 92: al$|noun|e$|verb
    Total8693 

  • In our process, we only analyze parent-child hierachy for those SD-Rules has parent-child relationship co-exist in the collected set because it is very expensive (time comsuming) to evaluate all parent-child rules. Shoule we modify the processes as:
    • Normalize all SD-Rules to it's root-parent-rule.
    • Analyze parent-child-hieracy for all SD-Rules.

    in 2020, we spent 2 weeks to evaluated 18 parents rules and 12 new Rule (root parent-rules). If we modify to this process, there will be 101 parents rules, very expensive!!

The conclusion is the optimized set of SD-Rules is very steady as we expected. We believe this is one of the component that implies that Lexicon is a good representative subset of general English.