Comparison on Optimized Set on 2014 and 2015
I. From 2014 to 2015:
The 2014 optimized set is based on 2013 SD-Rule data. It is used as baseline for 2015. 15 new SD-Rules are then added to the 2014 SD-Rule set for evaluation and used for 2015 release. 11 of them are evaluated as good rules in the optimized set, 2 are bad rules and 2 are duplicated (child rule of existing rules). Also, in the optimized set, 2 child rules are used to replace proposed rules.
SD-Rule | Precision | Instances | Source | Results |
---|---|---|---|---|
Good Rules | ||||
se$|verb|zation$|noun | 100.00% | 1108 | NOM_D | Good SD-Rule |
sation$|noun|ze$|verb | 100.00% | 1071 | NOM_D | Good SD-Rule |
ility$|noun|le$|adj | 99.94% | 1625 | NOM_D | Good SD-Rule |
$|adj|ally$|adv | 99.08% | 2072 | ORG_D | Good SD-Rule |
ce$|noun|t$|adj | 98.82% | 847 | NOM_D | Child rule nce$|noun|nt$|adj is used
|
cy$|noun|t$|adj | 98.77% | 406 | NOM_D | Good SD-Rule |
e$|verb|ion$|noun | 98.76% | 2336 | NOM_D | Good SD-Rule |
c$|adj|s$|noun | 91.46% | 281 | ORG_D | Child rule ic$|adj|is$|noun is used
|
e$|verb|ing$|noun | 91.43% | 210 | Suggestions | Good SD-Rule |
ian$|adj|ia$|noun | 86.31% | 263 | Suggestions | Duplicated, parent rule an$|adj|a$|noun is used
|
al$|adj|us$|noun | 84.35% | 262 | Suggestions | Good SD-Rule |
es$|noun|ic$|adj | 73.91% | 23 | Suggestions | Good SD-Rule |
Bad Rules | ||||
$|noun|ize$|verb | 59.05% | 442 | Suggestions | Bad SD-Rule |
ian$|noun|ia$|noun | 0.36% | 274 | Suggestions | Duplicated, parent rule an$|noun|a$|noun is a bad SD-Rule
|
es$|noun|ic$|noun | 0.00% | 19 | Suggestions | Bad SD-Rule |
II. Comparison of SD-Rule set:
Item | 2014 | 2015 |
---|---|---|
Total Unique Rules | 96 | 101 |
Total Good Rules | 73 | 76 |
Opti. System Precision | 95.30% | 95.22% |
Opti. System Recall | 95.01% | 95.70% |
Opti. System Performance | 1.9031 | 1.9093 |
Cufoff Rule | ar$|adj|e$|noun
| ar$|adj|e$|noun
|
Optimized Set | 2014 Optimized Set | 2015 Optimized Set |
Optimized Diagram |
For the Optimial set:
III. Transaction Details:
The detail transaction of SD-Rules are described as below:
Type | 2014 | 2015 | Details | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
No Change | 65 | 65 | ... | ||||||||||
Parent-1-Child | 4 | 4 |
| ||||||||||
Parent-2-Child | 4 | 2 |
| ||||||||||
New in 2015 | 0 | 5 |
| ||||||||||
Total | 73 | 76 |
Computer Generated SD-Rules | ||||||||
---|---|---|---|---|---|---|---|---|
ID | Proposed New Rule | Source | Results | Rank & Rule 2015 | Rank & Rule 2014 | Type | Count Change | Accu. Count |
01-CG1 | se$|verb|zation$|noun | nomD | Good | 02: se$|verb|zation$|noun | None | New in 2015 | +1 | 74 |
02-CG2 | sation$|noun|ze$|verb | nomD | Good | 03: sation$|noun|ze$|verb | None | New in 2015 | +1 | 75 |
03-CG3 | ility$|noun|le$|adj | nomD | Good | 09: ility$|noun|le$|adj | 02: ability$|noun|able$|adj | Parent-1-Child | +0 | 75 |
04-CG4 | $|adj|ally$|adv | orgD | Good | 15: $|adj|ally$|adv | 08: ic$|adj|ically$|adv | Parent-1-Child | +0 | 75 |
05-CG5 | nce$|noun|nt$|adj | nomD | Good | 18: nce$|noun|nt$|adj
| 16: ance$|noun|ant$|adj
18: ence$|noun|ent$|adj | Parent-2-child | -1 | 74 |
06-CG6 | cy$|noun|t$|adj | nomD | Good | 19: cy$|noun|t$|adj | 21: ency$|noun|ent$|adj | Parent-1-Child | +0 | 74 |
07-CG7 | e$|verb|ion$|noun | nomD | Good | 20: e$|verb|ion$|noun
| 10: ate$|verb|ation$|noun
63: se$|verb|sion$|noun | Parent-2-Child | -1 | 73 |
08-CG8 | c$|adj|s$|noun | orgD | Good | 43: ic$|adj|is$|noun | 41: ic$|adj|is$|noun | Child | +0 | 73 |
Expert-Suggested SD-Rules | ||||||||
09-ES1 | e$|verb|ing$|noun | Experts | Good | 45: e$|verb|ing$|noun | None | New in 2015 | +1 | 74 |
10-ES2 | al$|adj|us$|noun | Experts | Good | 61: al$|adj|us$|noun | None | New in 2015 | +1 | 75 |
11-ES3 | es$|noun|ic$|adj | Experts | Good | 67: es$|noun|ic$|adj | None | New in 2015 | +1 | 76 |
12-ES4 | $|noun|ize$|verb | Experts | Bad | 78: $|noun|ize$|verb | None | New | +0 | 76 |
13-ES5 | es$|noun|ic$|noun | Experts | Bad | 101: es$|noun|ic$|noun | None | New | +0 | 76 |
14-ES6 | ian$|adj|ia$|noun | Experts | Good | 57: a$|noun|an$|adj | 53: a$|noun|an$|adj | Duplicated-Child | +0 | 76 |
15-ES7 | ian$|noun|ia$|noun | Experts | Bad | 99: a$|noun|an$|noun | 93: a$|noun|an$|noun | Duplicated-Child | +0 | 76 |
The conclusion is the optimized set of SD-Rules is very steady as we expected. Does this imply that Lexicon is a good representative subset of general English?