Retrieve New SD-Rules from NomD, 2015
I. Description
A set of computer programs (FindSdRulesFromDPairs.java
) are developed to find the SD-Rules from a set of suffixD pairs. It identifies and eliminates the same starting characters of a SD-pair and then generates the SD-Rules automatically. Please note that only root-parent SD-Rules is generated in this program. Two sets of SD-pairs are used for this task. This page details the new SD-Rules selected from nomD.
II. Procedures
- Directory: ${SUFFIXD_DIR}
- Programs:
shell>cd ${SUFFIXD_DIR}/bin
shell>GetSdRule ${YEAR}
2
nomD
shell>GetSdRule ${YEAR}
3
nomD
III. Results
- These are SD-Pairs from nominalizations of Lexicon.2015
- The file of ${NOM_D_DIR}/data/nomD.yes.S.data is used as input
- There are 23,384 SD-Pairs to generate 1,017 SD-Rules, see
sdRulesFromSdPairs.rpt
- All generated SD-Rules are root-parent rules (without parent-rule).
- Rules with following criteria are selected:
- High frequency (>= 200):
- Accumulate coverage: 80.34% (> 80.00%)
- Individual coverage: 1.26% (> 1.00%)
- SD-Rules meet above criteria (total instance No. 23,384):
SD-RulesInstances No. | Accu. No. | Notes
| $|adj|ness$|noun | 2733 (11.69%) | 2733 (11.69%) | Exists
| ation$|noun|e$|verb | 2472 10.57% | 5205 (22.26%) | Exists
| e$|verb|ion$|noun | 2298 (9.83%) | 7503 (32.09%) | New, has child rules exist
| $|adj|ity$|noun | 2036 (8.71%) | 9539 (40.79%) | Exists
| ility$|noun|le$|adj | 1609 (6.88%) | 11148 (47.67%) | New, has child rules exist
| se$|verb|zation$|noun | 1100 (4.70%) | 12248 (52.38%) | New, has no child rules exist
| sation$|noun|ze$|verb | 1064 (4.55%) | 13312 (56.93%) | New, has no child rules exist
| ce$|noun|t$|adj | 836 (3.58%) | 14148 (60.50%) | New, has child rules exist
| e$|adj|ity$|noun | 830 (3.55%) | 14978 (64.05%) | Exists
| ed$|adj|ion$|noun | 675 (2.89%) | 15653 (66.94%) | Exists
| $|verb|ment$|noun | 574 (2.45%) | 16227 (69.39%) | Exists
| iness$|noun|y$|adj | 544 (2.33%) | 16771 (71.72%) | Exists
| $|verb|ion$|noun | 535 (2.29%) | 17306 (74.01%) | Exists
| $|verb|ing$|noun | 478 (2.04%) | 17784 (76.05%) | Exists
| cy$|noun|t$|adj | 400 (1.71%) | 18184 (77.76%) | New, has child rules exist
| $|verb|ation$|noun | 307 (1.31%) | 18491 (79.08%) | Exists
| ication$|noun|y$|verb | 295 (1.26%) | 18786 (80.34%) | Exists
| Frequency > 200, Instance coverage > 1.00% , Accum. Coverage > 80.0%
|
---|
|
---|
- New SD-Rules with childred rules
New rules | Instances | Examples | Child-rules
|
---|
e$|verb|ion$|noun | 2298
| - evocate|verb|E0538633|evocation|noun|E0417865
|
ate$|verb|ation$|noun|2013|ORG_RULE|SELF
se$|verb|sion$|noun|2013|ORG_RULE|SELF
|
ility$|noun|le$|adj | 1609
| - appliability|noun|E0541203|appliable|adj|E0541202
|
ability$|noun|able$|adj|2013|ORG_RULE|SELF
|
ce$|noun|t$|adj | 836
| - equivalence|noun|E0025964|equivalent|adj|E0025966
|
ance$|noun|ant$|adj|2013|ORG_RULE|PARENT
ence$|noun|ent$|adj|2013|ORG_RULE|SELF
iance$|noun|iant$|adj|2013|ORG_RULE|CHILD
|
cy$|noun|t$|adj | 400
| - reluctancy|noun|E0514595|reluctant|adj|E0052653
|
ency$|noun|ent$|adj|2013|ORG_RULE|PARENT
iency$|noun|ient$|adj|2013|ORG_RULE|CHILD
|
- New SD-Rules without child-rule
New rules | Instances | Examples | Child-rules
|
---|
se$|verb|zation$|noun | 1100 |
- fertilise|verb|E0027636|fertilization|noun|E0027634
- sanitise|verb|E0054320|sanitization|noun|E0054319
| N/A
|
sation$|noun|ze$|verb | 1064 |
- manualisation|noun|E0579348|manualize|verb|E0579347
- professionalisation|noun|E0050195|professionalize|verb|E0050196
| N/A
|