Previous Candidate Lists
This page describes the analysis and aggregation on all previous Lexicon candidate lists. These lists include valid and invalid candidates from various models as described bellows. The numbers are based on real-time data. In other words, this program needs to be re-run to get the latest number when:
The stats is based on the following implmentation
1
2
3
4
Algorithm:
Year | Acronym Expansions | Abbreviation Expansions | ||||
---|---|---|---|---|---|---|
Total | Valid | Invalid | Total | Valid | Invalid | |
2015 | 908 | 881 (97.03%) | 27 (2.97%) | 62 | 40 (64.52%) | 22 (35.48%) |
2016 | 59 | 59 (100.00%) | 0 (0.00%) | 183 | 180 (98.36%) | 3 (1.64%) |
2017 | 39 | 39 (100.00%) | 0 (0.00%) | 22 | 19 (86.36%) | 3 (13.64%) |
2018 | 17 | 16 (94.12%) | 1 (5.88) | 28 | 26 (92.86%) | 2 (7.14%) |
2019 | 151 | 142 (94.04%) | 9 (5.96%) | 13 | 12 (92.31%) | 1 (7.69%) |
Year | Total | Valid | Invalid | |||
2020 | 148 | 112 (75.68%) | 36 (24.32%) | |||
2021 | 158 | 129 (81.65%) | 29 (18.35%) | |||
2022 | 94 | 53 (56.38%) | 41 (43.62%) | |||
2023 | 2 | 2 (100.00%) | 0 (0.00%) | |||
Accu. | Total: 1808 | Valid: 1636 (90.49%) | Invalid: 172 (9.51%) |
Year | Total | Valid | Invalid | Notes |
---|---|---|---|---|
2015 | 4994 | 3679(73.67%) | 1315 (26.33%) | |
2016 | 360 | 200 (55.56%) | 160 (44.44%) | |
2017 | 1855 | 1316 (70.94%) | 539 (29.06%) |
|
2018 | 808 | 604 (74.75%) | 204 (25.25%) |
|
2019 | 1081 | 663 (61.33%) | 418 (38.67%) |
|
2020 | 1061 | 786 (74.08%) | 275 (25.92%) |
|
2021 | 1262 | XXX (XX.XX%) | XXX (XX.XX%) |
|
Accu. | 9816 | 7056 (71.88%) | 2760 (28.12%) |
Year | Total | Valid | Invalid | Notes |
---|---|---|---|---|
2018 | 557 | 38 (6.82%) | 519 (93.18%) | 6.82% became valid |
2019 | 2533 | 231 (9.12%) | 2302 (90.88%) | 9.12% became valid |
2020 | 2771 | 53 (1.91%) | 2718 (98.09%) | 1.91% became valid Very consistent (small percentage). |
Year | Total | Valid | Invalid | Notes | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
2016 | 6370 | 5725 (89.87%) | 645 (10.13%) |
| ||||||||||
2017 | 1945 | 1764 (90.69%) | 181 (9.31%) |
| ||||||||||
2018 | 819 | 703 (85.84%) | 116 (14.16%) |
| ||||||||||
2019 | 2918 | 2588 (88.69%) | 330 (11.31%) |
| ||||||||||
2020 | 2846 | 2489 (87.46%) | 357 (12.54%) |
| ||||||||||
2021 | 1550 | TBD (87.46%) | TBD (12.54%) |
|
Year | Total | Valid | Invalid | Notes |
---|---|---|---|---|
2017 | 1034 | 393 (38.01%) | 641 (61.99%) | 38.01% become valid Main reason is some candidates were not tagged |
2018 | 953 | 133 (13.96%) | 820 (86.04%) | 13.96% become valid Clean up |
2019 | 984 | 50 (5.08%) | 934 (94.92%) | 5.08% become valid Small percentage is consistent! |
2020 | 1291 | 24 (1.86%) | 1267 (98.14%) | 1.86% become valid Small percentage is consistent! |
Year | Word Count | Total | Valid | Invalid | Accu. P |
---|---|---|---|---|---|
2015 | 1000000 | 3368 | 2397 (71.17%) | 971 (28.83%) | 71.17% |
100000 | 2218 | 1520 (68.53%) | 698 (31.47%) | 70.12% | |
10000 | 895 | 605 (67.60%) | 290 (32.40%) | 69.77% | |
1000 | 588 | 249 (42.35%) | 339 (57.65%) | 67.49% | |
100 | 538 | 119 (22.12%) | 419 (77.88%) | 64.28% | |
Accu. | Accu. | 7607 | 4890 (64.28%) | 2712 (35.72%) | 64.28% |
Models | Total | Valid | Invalid | Notes |
---|---|---|---|---|
zeroD, CUI | 322 | 322 (100.00%) | 0 (0.00%) | WordNetCand.ZD.cui.2021 |
zeroD, no CUI | 626 | 601 (96.01%) | 25 (3.99%) | WordNetCand.ZD.noCui.2021 |
aPairs | 1912 | 1412 (73.85%) | 500 (26.15%) | WordNetCand.AP.2021 |
Accu. | 2858 | 2333 (81.63%) | 525 (18.37%) |
1
2
3
4
5
, only used for non-routine word Cand
Date | Total | Valid | Invalid | Notes - completed candList |
---|---|---|---|---|
2018-11-15 | 21955 | 16096 (73.31%) | 5859 (26.69%) | 2.MNSMatcherParAcr, 2017 |
2019-01-03 | 22763 | 16687 (73.31%) | 6076 (26.69%) | 2.MNSMatcherParAcr, 2018 |
2019-07-19 | 24856 | 18915 (76.10%) | 5941 (23.90%) | 1.LexiconAbbAcrExpansion, 2020 |
2019-08-02 | 25675 | 19608 (76.37%) | 6067 (23.63%) | 3.DMNSMatcherCuiEndWord, 2018 |
2019-10-16 | 26756 | 20429 (76.35%) | 6327 (23.65%) | 2.MNSMatcherParAcr, 2019 |
2020-06-12 | 29674 | 23041 (77.65%) | 6633 (22.35%) | 3.DMNSMatcherCuiEndWord, 2019 |
2020-07-17 | 29832 | 23192 (77.74%) | 6640 (22.26%) | 1.LexiconAbbAcrExpansion, 2021 |
2020-08-18 | 30892 | 23999 (77.69%) | 6893 (22.32%) | 2.MNSMatcherParAcr, 2020 |
2021-03-01 | 33737 | 26512 (78.58%) | 7225 (21.42%) | 3.DMNSMatcherCuiEndWord, 2020 |
2021-07-13 | 33831 | 26571 (78.54%) | 7260 (21.46%) | 1.LexiconAbbAcrExpansion, 2022 |
2022-01-10 | 34128 | 26868 (78.73%) | 7260 (21.27%) | 8.WordNetCand.ZD.cui.2021 |
2022-01-10 | 34754 | 27466 (79.03%) | 7288 (20.97%) | 8.WordNetCand.ZD.noCui.2021 |
2022-07-06 | 34756 | 27471 (79.04%) | 7285 (20.96%) | 1.LexiconAbbAcrExpansion, 2023 |
2022-09-27 | 36649 | 28865 (78.76%) | 7784 (21.24%) | 8.WordNetCand.AP.2021 |