Comparison on Optimized Set on 2014, 2015, and 2016 (TBD)
I. New SD-Rules Evaluation Results:
Three releases applied this approach to retrieve the optimized SD-rule set.
SD-Rule | Rank | Precision | Instances | Source | Decompose | Results |
---|---|---|---|---|---|---|
Duplicated Rules | ||||||
ian$|adj|ia$|noun | 57 | 86.31% | 263 | Suggestions | 1-G Child | Duplicated of good parent-rule an$|adj|a$|noun
|
ian$|noun|ia$|noun | 99 | 0.36% | 274 | Suggestions | 1-G Child | Duplicated of bad parent-rule an$|noun|a$|noun
|
Good Rules | ||||||
se$|verb|zation$|noun | 2 | 100.00% | 1108 | NOM_D | Root-Parent | Good SD-Rule |
sation$|noun|ze$|verb | 3 | 100.00% | 1071 | NOM_D | Root-Parent | Good SD-Rule |
ility$|noun|le$|adj | 9 | 99.94% | 1625 | NOM_D | Root-Parent | Good SD-Rule |
$|adj|ally$|adv | 15 | 99.08% | 2072 | ORG_D | Root-Parent | Good SD-Rule |
nce$|noun|nt$|adj | 18 | 98.82% | 847 | NOM_D | 1G-Child | Good SD-Rule |
cy$|noun|t$|adj | 19 | 98.77% | 406 | NOM_D | Root-parent | Good SD-Rule |
e$|verb|ion$|noun | 20 | 98.76% | 2336 | NOM_D | Root-parent | Good SD-Rule |
ic$|adj|is$|noun | 43 | 91.46% | 281 | ORG_D | 1G-Child | Good SD-Rule |
e$|verb|ing$|noun | 45 | 91.43% | 210 | Suggestions | Root-parent | Good SD-Rule |
al$|adj|us$|noun | 61 | 84.35% | 262 | Suggestions | Root-parent | Good SD-Rule |
es$|noun|ic$|adj | 67 | 73.91% | 23 | Suggestions | Root-parent | Good SD-Rule |
Bad Rules | ||||||
$|noun|ize$|verb | 78 | 59.05% | 442 | Suggestions | Root-parent | Bad SD-Rule |
es$|noun|ic$|noun | 101 | 0.00% | 19 | Suggestions | Root-parent | Bad SD-Rule |
SD-Rule | Rank | Precision | Instances | Source | Decompose | Results |
---|---|---|---|---|---|---|
Duplicated Rules | ||||||
e$|verb|ing$|noun | 47 | 91.47% | 211 | NOM_D | Root-Parent | Duplicated of a good rule |
Good Rules | ||||||
genesis$|noun|genic$|adj | 13 | 99.52% | 207 | EXP_SUG | 3G-Child | Good SD-Rule |
se$|verb|sis$|noun | 27 | 97.87% | 141 | NOM_D | 1G-Child | Good SD-Rule |
sia$|noun|tic$|adj | 40 | 94.17% | 103 | ORG_D | Root-Parent | Good SD-Rule |
on$|noun|ve$|adj | 48 | 91.46% | 1253 | ORG_D | Root-Parent | Good SD-Rule |
e$|noun|ic$|adj | 49 | 91.40% | 1267 | ORG_D | Root-Parent | Good SD-Rule |
$|adj|ism$|noun | 51 | 90.79% | 369 | NOM_D | Root-Parent | Good SD-Rule |
ation$|noun|ed$|adj | 67 | 83.95% | 405 | NOM_D | Root-Parent | Good SD-Rule |
$|noun|ship$|noun | 70 | 80.45% | 133 | ORG_D | Root-Parent | Good SD-Rule |
Bad Rules | ||||||
e$|adj|ion$|noun | 88 | 54.60% | 359 | NOM_D | Root-Parent | Bad SD-Rule |
$|noun|age$|noun | 96 | 36.97% | 119 | ORG_D | Root-Parent | Bad SD-Rule |
al$|adj|ine$|noun | 98 | 32.65% | 49 | EXP_SUG | Root-Parent | Bad SD-Rule |
II. Comparison of SD-Rule set:
Item | 2014 | 2015 | 2016 |
---|---|---|---|
Baseline Set Include parent-child rules | 107 | 120 | 132 |
Total Unique Rules | 96 | 101 | 111 |
Total Good Rules | 73 | 76 | 82 |
Total Valid SD-pairs (SD-Facts: Relevant) | 42,552 | 46,950 | 50,814 |
Opti. System Precision | 95.30% | 95.22 | 95.00% |
Opti. System Recall | 95.01% | 95.70% | 95.26% |
Opti. System Performance | 1.9031 | 1.9093 | 1.9026 |
Cufoff Rule | ar$|adj|e$|noun
| ar$|adj|e$|noun
| $|noun|ist$|noun
|
Optimized Set | 2014 Optimized Set | 2015 Optimized Set | 2016 Optimized Set |
Optimized Diagram | ![]() |
For the Optimial set:
ar$|adj|e$|noun
).
III. Transaction History:
Baseline Collected Candidate SD-Rules | Unique Rules Remove child-rules from Baseline | Good Rules Used in Lexical Tools SD-Rule set | ||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2014 | 107 | 96
| 73 | |||||||||||||||||||||||||||||||||||||||||||
New Rules | 15 |
| ||||||||||||||||||||||||||||||||||||||||||||
2015 | 120
|
Details:
The conclusion is the optimized set of SD-Rules is very steady as we expected.