Lexical Tools

Results of Proposed Rules - 2016

I. Results

11 SD-Rules are proposed to be added to the SD-Rule for evaluation. The results are described as follows:

  • Good/bad rules:
  • 8 of them are evaluated as good rules in the optimized set
  • 3 are bad rules.

  • Experts' suggestion: 50% (1/2) is good.
  • Computation rules: 78% (7/9) is good.
    • NOM_D: 75% (3/4) is good
    • ORG_D: 80% (4/5) is good

  • Also, in the optimized set, 2 child rules are used to replace proposed root-parent rules
    • genesis$|noun|genic$|adj from esis$|noun|ic$|adj
    • se$|verb|sis$|noun from e$|verb|is$|noun

SD-RuleRankPrecisionInstancesSourceDecomposeResults
Good Rules
genesis$|noun|genic$|adj1399.52%207EXP_SUG3G-ChildGood SD-Rule
se$|verb|sis$|noun2797.87%141NOM_D1G-ChildGood SD-Rule
sia$|noun|tic$|adj4094.17%103ORG_DRoot-ParentGood SD-Rule
on$|noun|ve$|adj4891.46%1253ORG_DRoot-ParentGood SD-Rule
e$|noun|ic$|adj4991.40%1267ORG_DRoot-ParentGood SD-Rule
$|adj|ism$|noun5190.79%369NOM_DRoot-ParentGood SD-Rule
ation$|noun|ed$|adj6783.95%405NOM_DRoot-ParentGood SD-Rule
$|noun|ship$|noun7080.45%133ORG_DRoot-ParentGood SD-Rule
Bad Rules
e$|adj|ion$|noun8854.60%359NOM_DRoot-ParentBad SD-Rule
$|noun|age$|noun9636.97%119ORG_DRoot-ParentBad SD-Rule
al$|adj|ine$|noun9832.65%49EXP_SUGRoot-ParentBad SD-Rule

II. Further Observation on NOM_D

The top 22 SD-Rule generated from NOM_D are evaluated, 95.5% (21/22) are good SD-Rules.

IDSD-RuleRankPrecisionInstancesNotes
Evaluated in 2015: Frequency > 200, Instance coverage > 1.00% , Accum. Coverage > 80.0%
1$|adj|ness$|noun1100.00%27352013
2se$|verb|zation$|noun2100.00%11092015
3sation$|noun|ze$|verb3100.00%10722015
4ility$|noun|le$|adj999.94%16292015
5iness$|noun|y$|adj1099.82%5462013
6ation$|noun|e$|verb1599.25%25192013
7ce$|noun|t$|adj1998.83%8542015, nce$|noun|nt$|adj is used
8cy$|noun|t$|adj2098.77%4072015
9e$|verb|ion$|noun2198.76%23372015
10$|verb|ment$|noun2298.35%6072013
11ication$|noun|y$|verb2398.33%3002013
12ed$|adj|ion$|noun2498.15%7042013
13$|adj|ity$|noun2697.89%20822013
14e$|adj|ity$|noun3197.55%8972013
15$|verb|ion$|noun4193.72%5732013
16$|verb|ing$|noun4492.56%5242013
17$|verb|ation$|noun5290.29%3402013
Evaluated in 2016: Frequency > 100, Instance coverage > 0.40% , Accum. Coverage > 83.36%)
18e$|verb|ing$|noun4791.47%2112015, Expert's Suggested
19ation$|noun|ed$|adj6783.95%4052016
20$|adj|ism$|noun5190.79%3692016
21e$|adj|ion$|noun8854.60%3592016, bad (cutoff at Rank of 82)
22e$|verb|is$|noun2797.87%1412016, se$|verb|sis$|noun is used

III. Further Observation on ORG_D

The top 6 SD-Rules generated from ORG_D are all good SD-Rules

IDSD-RuleRankPrecisionInstancesstatus
Evaluated in 2015: Frequency > 40, Instance coverage > 1.00% , Accum. Coverage > 11.50%
1$|noun|less$|adj1199.64%5612013
2$|verb|ion$|noun4193.72%5732013, also from NOM_D
3ist$|noun|y$|noun3795.48%5092013
4$|adj|ally$|adv1799.08%20722015
5$|noun|ful$|adj5489.21%1392013
6c$|adj|s$|noun5589.04%2922015
Evaluated in 2016: frequency >= 35; Accu. coverage: > 16.00% Ind Coverage: > 0.80%
7on$|noun|ve$|adj4891.46%12532016
8$|noun|ship$|noun7080.45%1332016
9$|noun|age$|noun9636.97%1192016, bad (cutoff at Rank of 82)
10e$|noun|ic$|adj4991.40%12672016
11sia$|noun|tic$|adj4094.17%1032016

V. Future Work

Evaluated more SD-Rules from NOM_D and ORG_D down the list.