Lexical Tools

Optimizing 2020 SD-Rule Set - Optimum Log

I. Criteria:

  • Total valid SD-Pairs from baseline (parent only rules - baseline) is 53,440
  • Candidate child rules are:
    • Decompose occurrence rate >= 40% (default)
    • Candidate child rules: occurrence rate >= 25% (default)
    • Candidate child rules: precision is decided by the methodology of optimization
    • Find the one with max. precision and recall
    • If the child rule has lower precision and recall than parents, it is not a good candidate even its recall is over 25%!

      Such as in Cases 15 and 16, the system performance is worse because both precision and recall are lower than parents. No need to run the program for these two cases.

      If the performance of a child rule is worse, then the next generation child rules will be worse. No need to run through the following generations (even we still run through them to keep the log completed)!

  • Find the best set by comparing parent vs. child rules:
    • Only apply when child rules precision is better than parent rule
    • Higher system performance
    • If System Performance is the same,
      • Use Precision
      • Use Recall
      • Use Linguistic knowledge

      • Use parent rule to replace child rules.
      • If no parent-child rules involved, use more rules

II. Iterative Optimization Log:

Source:

  • Dir: ${SUFFIX_DIR}/data/${YEAR}/dataR/SdRulesOptimum/*/
  • File: sdRules.stats.out.html

IDRank: Parent-RuleRank: Candidate Child-RulesCutoff SD-Rules Sys Accu. Rate
(Precision)
Sys Cover. Rate
(Recall)
Sys. PerfNotes
Rule No.A. RateOccr.YesNoSD-Rule
0 Rank in Baseline (all Rank)
Parent-rule only - Baseline
Rank
No child-Rule
9263.14%331209122$|noun|ist$|noun95.05%94.26%1.8931Baseline
1.1 22 (32):
0|2073|2054|19|$|adj|ally$|adv|99.08%|100.00%
22:
1|1955|1954|1|c$|adj|cally$|adv|99.95%|94.31%
9263.14%331209122$|noun|ist$|noun95.05%94.26%1.8931Same
1.2 22 (32):
0|2073|2054|19|$|adj|ally$|adv|99.08%|100.00%
15:
2|1950|1949|1|ic$|adj|ically$|adv|99.95%|94.07%
9263.14%331209122$|noun|ist$|noun95.07%94.06%1.8914Worse
2.1 30 (43):
0|2085|2041|44|$|adj|ity$|noun|97.89%|100.00%
20, 32:
1|949|942|7|c$|adj|city$|noun|99.26%|45.52% 1|728|712|16|l$|adj|lity$|noun|97.80%|34.92%
9363.14%331209122$|noun|ist$|noun95.05%93.53%1.8859Worse
2.2 30 (43):
0|2081|2037|44|$|adj|ity$|noun|97.89%|100.00%
19, 32:
2|948|942|6|ic$|adj|icity$|noun|99.37%|45.47% 1|728|712|16|l$|adj|lity$|noun|97.80%|34.92%
9363.14%331209122$|noun|ist$|noun95.06%93.53%1.8859Worse
3.1 86 (107):
0|1326|968|358|$|noun|al$|adj|73.00%|100.00%
78:
1|673|557|116|n$|noun|nal$|adj|82.76%|50.75%
9461.29%935736$|noun|ish$|adj95.27%93.82%1.8910Worse
3.2 86 (107):
0|1326|968|358|$|noun|al$|adj|73.00%|100.00%
75:
2|621|533|88|on$|noun|onal$|adj|85.83%|46.83%
9461.29%935736$|noun|ish$|adj95.32%93.78%1.8910Worse
3.3 86 (107):
0|1326|968|358|$|noun|al$|adj|73.00%|100.00%
74:
3|577|497|80|ion$|noun|ional$|adj|86.14%|43.51%
9461.29%935736$|noun|ish$|adj95.33%93.71%1.8904Worse
3.4 86 (107):
0|1326|968|358|$|noun|al$|adj|73.00%|100.00%
72:
4|472|408|64|tion$|noun|tional$|adj|86.44%|35.60%
9461.29%935736$|noun|ish$|adj95.36%93.54%1.8890Worse
4.1 101 (122):
0|664|343|321|$|noun|y$|noun|51.66%|100.00%
52:
1|253|234|19|h$|noun|hy$|noun|92.49%|38.10%
9263.14%331209122$|noun|ist$|noun95.04%93.78%1.8882Worse
5.1 48 (65):
0|573|537|36|$|verb|ion$|noun|93.72%|100.00%
39:
1|449|434|15|t$|verb|tion$|noun|96.66%|78.36%
9167.01%976532al$|noun|e$|verb95.10%90.34%1.8544Worse
5.2 48 (65):
0|573|537|36|$|verb|ion$|noun|93.72%|100.00%
19:
2|322|320|2|ct$|verb|ction$|noun|99.38%|56.20%
9167.01%976532al$|noun|e$|verb95.11%90.12%1.8524Worse
5.3 48 (65):
0|573|537|36|$|verb|ion$|noun|93.72%|100.00%
6:
3|186|186|0|ect$|verb|ection$|noun|100.00%|32.46%
9167.01%976532al$|noun|e$|verb95.10%89.87%1.8498Worse
6.1 72 (92):
0|264|228|36|a$|noun|an$|adj|86.36%|100.00%
No candidate child rules found! 9263.14%331209122$|noun|ist$|noun95.05%94.26%1.8931Same
7.1 127 (150):
0|277|2|275|a$|noun|an$|noun|0.72%|100.00%
127:
1|137|1|136|ia$|noun|ian$|noun|0.73%|49.46%
9263.14%331209122$|noun|ist$|noun95.05%94.26%1.8931Same
8.1 68 (88):
0|137|120|17|a$|noun|ar$|adj|87.59%|100.00%
58:
1|115|105|10|la$|noun|lar$|adj|91.30%|83.94%
9263.14%331209122$|noun|ist$|noun95.06%94.23%1.8929Worse
8.2 68 (88):
0|137|120|17|a$|noun|ar$|adj|87.59%|100.00%
46:
2|69|65|4|ula$|noun|ular$|adj|94.20%|50.36%
9263.14%331209122$|noun|ist$|noun95.07%94.15%1.8922Worse
9.1 20 (29):
0|2529|2510|19|ation$|noun|e$|verb|99.25%|100.00%
17, 2:
1|1062|1061|1|sation$|noun|se$|verb|99.91%|41.99% 1|1257|1257|0|zation$|noun|ze$|verb|100.00%|49.70%
9363.14%331209122$|noun|ist$|noun95.07%93.90%1.8896Worse
9.2 20 (29):
0|2529|2510|19|ation$|noun|e$|verb|99.25%|100.00%
5, 2:
2|1038|1038|0|isation$|noun|ise$|verb|100.00%|41.04% 2|1250|1250|0|ization$|noun|ize$|verb|100.00%|49.43%
9363.14%331209122$|noun|ist$|noun95.07%93.84%1.8891Worse
9.3 20 (29):
0|2529|2510|19|ation$|noun|e$|verb|99.25%|100.00%
5, 2:
2|1038|1038|0|isation$|noun|ise$|verb|100.00%|41.04% 1|1257|1257|0|zation$|noun|ze$|verb|100.00%|49.70%
9363.14%331209122$|noun|ist$|noun95.07%93.85%1.8892Worse
10.1 64 (84):
0|294|262|32|c$|adj|s$|noun|89.12%|100.00%
57:
1|284|260|24|ic$|adj|is$|noun|91.55%|96.60%
9263.14%331209122$|noun|ist$|noun95.07%94.25%1.8932Better
10.2 64 (84):
0|294|262|32|c$|adj|s$|noun|89.12%|100.00%
46:
2|192|182|10|tic$|adj|tis$|noun|94.79%|65.31%
9263.14%331209122$|noun|ist$|noun95.08%94.11%1.8919Worse
10.3 64 (84):
0|294|262|32|c$|adj|s$|noun|89.12%|100.00%
33:
3|174|170|4|itic$|adj|itis$|noun|97.70%|59.18%
9263.14%331209122$|noun|ist$|noun95.09%94.08%1.8918Worse
11.1 30 (43):
0|882|861|21|ce$|noun|t$|adj|97.62%|100.00%
24:
1|871|861|10|nce$|noun|nt$|adj|98.85%|98.75%
9263.14%331209122$|noun|ist$|noun95.09%94.25%1.8934Better
11.2 30 (43):
0|882|861|21|ce$|noun|t$|adj|97.62%|100.00%
22, 27:
2|333|330|3|ance$|noun|ant$|adj|99.10%|37.76% 2|538|531|7|ence$|noun|ent$|adj|98.70%|61.00%
9363.14%331209122$|noun|ist$|noun95.09%94.25%1.8934Same
12.1 25 (36):
0|416|411|5|cy$|noun|t$|adj|98.80%|100.00%
24:
1|415|410|5|ncy$|noun|nt$|adj|98.80%|99.76%
9263.14%331209122$|noun|ist$|noun95.09%94.25%1.8934Same
13.1 24 (35):
0|2348|2320|28|e$|verb|ion$|noun|98.81%|100.00%
18:
1|2211|2203|8|te$|verb|tion$|noun|99.64%|94.17%
9263.14%331209122$|noun|ist$|noun95.11%94.03%1.8915Worse
13.2 24 (35):
0|2348|2320|28|e$|verb|ion$|noun|98.81%|100.00%
17:
2|2107|2103|4|ate$|verb|ation$|noun|99.81%|89.74%
9263.14%331209122$|noun|ist$|noun95.11%93.85%1.8896Worse
13.3 24 (35):
0|2348|2320|28|e$|verb|ion$|noun|98.81%|100.00%
4:
3|602|602|0|late$|verb|lation$|noun|100.00%|25.64%
9167.01%976532al$|noun|e$|verb95.18%90.65%1.8583Worse
14.1 42 (57):
0|144|138|6|e$|verb|is$|noun|95.83%|100.00%
32:
1|141|138|3|se$|verb|sis$|noun|97.87%|97.92%
9263.14%331209122$|noun|ist$|noun95.09%94.25%1.8934Same
14.2 42 (57):
0|144|138|6|e$|verb|is$|noun|95.83%|100.00%
41, 10:
2|54|52|2|ose$|verb|osis$|noun|96.30%|37.50% 2|59|59|0|yse$|verb|ysis$|noun|100.00%|40.97%
9363.14%331209122$|noun|ist$|noun95.09%94.20%1.8929Wrose
14.3 42 (57):
0|144|138|6|e$|verb|is$|noun|95.83%|100.00%
41, 10:
2|54|52|2|ose$|verb|osis$|noun|96.30%|37.50% 3|58|58|0|lyse$|verb|lysis$|noun|100.00%|40.28%
9363.14%331209122$|noun|ist$|noun95.09%94.20%1.8929Wrose
15.1 53 (70):
0|224|207|17|esis$|noun|ic$|adj|92.41%|100.00%
27:
1|209|206|3|nesis$|noun|nic$|adj|98.56%|93.30%
9263.14%331209122$|noun|ist$|noun95.11%94.25%1.8936Better
15.2 53 (70):
0|224|207|17|esis$|noun|ic$|adj|92.41%|100.00%
18:
2|207|206|1|enesis$|noun|enic$|adj|99.52%|92.41%
9263.14%331209122$|noun|ist$|noun95.11%94.25%1.8937Better
15.3 53 (70):
0|224|207|17|esis$|noun|ic$|adj|92.41%|100.00%
18:
3|207|206|1|genesis$|noun|genic$|adj|99.52%|92.41%
9263.14%331209122$|noun|ist$|noun95.11%94.25%1.8937Same
15.4 53 (70):
0|224|207|17|esis$|noun|ic$|adj|92.41%|100.00%
6:
4|181|181|0|ogenesis$|noun|ogenic$|adj|100.00%|80.80%
9263.14%331209122$|noun|ist$|noun95.11%94.20%1.8932Worse
16.1 15 (20):
0|1634|1633|1|ility$|noun|le$|adj|99.94%|100.00%
2:
1|1632|1632|0|bility$|noun|ble$|adj|100.00%|99.88%
9363.02%19212171ar$|adj|e$|noun95.00%94.48%1.8948Best
16.2 15 (20):
0|1634|1633|1|ility$|noun|le$|adj|99.94%|100.00%
2:
2|1294|1294|0|ability$|noun|able$|adj|100.00%|79.19%
9263.14%331209122$|noun|ist$|noun95.09%93.62%1.8870Worse
17.1 18 (25):
0|1017|1012|5|sis$|noun|tic$|adj|99.51%|100.00%
21, 18, 7:
1|336|334|2|esis$|noun|etic$|adj|99.40%|33.04% 1|369|368|1|osis$|noun|otic$|adj|99.73%|36.28% 1|216|216|0|ysis$|noun|ytic$|adj|100.00%|21.24%
9463.14%331209122$|noun|ist$|noun95.11%94.07%1.8919Worse
18.1 65 (85):
0|101|90|11|sity$|noun|us$|adj|89.11%|100.00%
58:
1|100|90|10|osity$|noun|ous$|adj|90.00%|99.01%
9363.02%19212171ar$|adj|e$|noun95.00%94.48%1.8948Same
22.1 39 (52):
0|60|58|2|sis$|noun|ze$|verb|96.67%|100.00%
11:
1|57|57|0|ysis$|noun|yze$|verb|100.00%|95.00%
9463.02%19212171ar$|adj|e$|noun95.00%94.47%1.8948Same