Exclusive Filter: A Term with min. Document Count and Word Count
The MEDLINE n-gram set is used to retrieve the DC and WC. It uses 30 as the minimum WC and 1 and the min. DC. There are lots of multiwords in Lexcion are not in the n-gram set due to:
Description | FilterType | Notes |
---|---|---|
get DC|WC from n-gram | FT_TBD | |
if not in the MEDLINE n-gram set | FT_WC_DC_NOT_FOUND |
|
Check if (dc < minDc) or (wc < minWc) | FT_WC_DC_INV_LES |
|
FilterType.FT_WC_DC_INV_LESS
Lexicon | Filter | Sample No | Pass No | Trap No | Exp No | Pass-Rate |
---|---|---|---|---|---|---|
2023 | FT_WC_DC_INV_LESS | 1001867 | 1001867 | 0 | 621582 | 100.0000% |
2022 | FT_WC_DC_INV_LESS | 998845 | 998845 | 0 | 623591 | 100.0000% |
2021 | FT_WC_DC_INV_LESS | 992545 | 992545 | 0 | 626830 | 100.0000% |
2020 | FT_WC_DC_INV_LESS | 983420 | 983420 | 0 | 629088 | 100.0000% |
2019 | FT_WC_DC_INV_LESS | 972721 | 972721 | 0 | 630890 | 100.0000% |
2018 | FT_WC_DC_INV_LESS | 955564 | 955564 | 0 | 625175 | 100.0000% |
2017 | FT_WC_DC_INV_LESS | 935276 | 935276 | 0 | 618346 | 100.0000% |
2016 | FT_WC_DC_INV_LESS | 915583 | 915583 | 0 | 618966 | 100.0000% |
2015 | FT_WC_DC_INV_LESS | 896213 | 896213 | 0 | 612316 | 100.0000% |
2014 | FT_WC_DC_INV_LESS | 875090 | 875090 | 0 | 603592 | 100.0000% |