The SPECIALIST Lexicon

Example: New Element Words from MEDLINE

I. Cutoff Word Count

New element words (EWT_NEW) with high frequency are sent to linguists for review to cover single words and multiwords in MEDLINE. For the first attempt, we chose the cutoff of WC at 1500 to cover 97.58% of single words in MEDLINE.2014 (no way to calculate the coverage rate of multiwords because we don't know all the legit multiwords in MEDLINE). The frequency spectrum for element words (single words) of MEDLINE.2014 is shown as bellows:

II. Examples

New element words (WC >= 1500) are sent to linguists for review. New lexical records of single words and multiwords are found as shown in the following two examples:

  • Ex-1:

    New element word, "cdh" (9982 WC), leads to 44 new lexical records with 78 single words (such as "CDH", "CDH1", and "cadherin1") and 23 multiwords (such as "chronic daily headache", "cervical disc herniation", and "cadherin 1", etc.) of base forms.

    Element Wordcdh|9982|93.8584%
    Associated New Lexical Records
    • E0742227|chronic daily headache
    • E0742228|cellobiose dehydrogenase
    • E0742229|cervical disc herniation|cervical disk herniation
    • E0742230|CDH
    • E0742231|CDH|Cdh|cdh
    • E0742232|cadherin1|cadherin 1|cadherin-1
    • E0742233|CDH1|Cdh1|CDH-1
    • E0742234|cadherin2|cadherin 2|cadherin-2
    • E0742235|CDH2|Cdh2
    • E0742236|cadherin 13|cadherin-13
    • E0742237|CDH13|
    • E0742238|CDH23|Cdh23|cdh23
    • E0742239|cadherin 17|cadherin-17
    • E0742240|CDH17|Cdh17|cdh17
    • E0742241|cadherin 3|cadherin-3
    • E0742242|CDH3|Cdh3
    • E0742243|cadherin 11|cadherin-11
    • E0742244|CDH11|Cdh11|CDH-11|Cdh-11
    • E0742245|cadherin 5|cadherin-5
    • E0742246|CDH5|Cdh5|cdh5
    • E0742247|cadherin 8|cadherin-8
    • E0742248|CDH8|Cdh8
    • E0742249|cadherin 6|cadherin-6
    • E0742250|CDH6|Cdh6|cdh6
    • E0742251|cadherin 16|cadherin-16
    • E0742252|CDH16|Cdh16
    • E0742253|cadherin4|cadherin 4|cadherin-4
    • E0742254|CDH4|Cdh4|cdh4
    • E0742255|cadherin7|cadherin 7|cadherin-7
    • E0742256|CDH7|Cdh7|cdh7
    • E0742257|cadherin 22|cadherin-22
    • E0742258|CDH22|Cdh22
    • E0742259|cadherin 12|cadherin-12
    • E0742260|CDH12|Cdh12
    • E0742261|cadherin 9|cadherin-9
    • E0742262|CDH9|Cdh9
    • E0742263|cadherin 10|cadherin-10
    • E0742264|CDH10|Cdh10
    • E0742265|cadherin 18|cadherin-18
    • E0742266|CDH18|Cdh18
    • E0742267|cadherin 15|cadherin-15
    • E0742268|CDH15|Cdh15
    • E0742269|cadherin19|cadherin 19|cadherin-19
    • E0742270|CDH19|Cdh19|cdh19

  • Ex-2: New element word, "mfi" (3428 WC), leads to 22 new lexical records with 4 single words (such as "MFI") and 22 multiwords (such as "mean fluorescence intensity", "microvascular flow index", and "maternal floor infarction", etc.) of base forms.

    Element Wordmfi|3428|96.3818%
    Associated New Lexical Records
    • E0743581|mean fluorescence intensity
    • E0743582|median fluorescence intensity
    • E0743583|mean fluorescent intensity
    • E0743584|median fluorescent intensity
    • E0743585|microvascular flow index
    • E0743586|microcirculatory flow index
    • E0743587|myofibrillar fragmentation index
    • E0743588|myofibril fragmentation index
    • E0743589|mean fluorescence index
    • E0743590|mechanical fragility index
    • E0743591|Multidimensional Fatigue Inventory
    • E0743592|microflow imaging|micro flow imaging|micro-flow imaging
    • E0743593|metastasis-free interval
    • E0743594|maternal floor infarction
    • E0743595|MFI
    • E0743619|Multidimensional Fatigue Inventory 20|Multidimensional Fatigue Inventory-20
    • E0743620|MFI-20
    • E0743621|MFI type zeolite|MFI-type zeolite
    • E0743622|meso-MFI zeolite
    • E0743623|Meso-MFI
    • E0743624|MFI zeolite|MFI-zeolite

  • Ex-3: New element word, "lumo" (3495 WC), leads to 23 new lexical records with 6 single words (such as "E(LUMO)" and "E(HOMO)") and 28 multiwords (such as "lowest unoccupied molecular orbital", "LUMO energy", and "HOMO energy", etc.) of base forms. Please note that records associated with element words "homo (WC 1217)", which are existing element words in Lexicon, are also updated during this process.

    Element Wordlumo|3495|96.3443%
    Associated New Lexical Records
    • E0743465|lowest unoccupied molecular orbital
    • E0743466|lowest unoccupied molecular orbital energy|lowest-unoccupied-molecular-orbital energy
    • E0743467|E(LUMO)|
    • E0743468|LUMO energy
    • E0743469|highest occupied molecular orbital-lowest unoccupied molecular orbital gap
    • E0743470|HOMO-LUMO gap|HOMO/LUMO gap
    • E0743471|highest occupied molecular orbital-lowest unoccupied molecular orbital energy gap
    • E0743472|HOMO-LUMO energy gap|HOMO/LUMO energy gap
    • E0743473|highest occupied molecular orbital-lowest unoccupied molecular orbital energy
    • E0743474|HOMO-LUMO energy|HOMO/LUMO energy|HOMO, LUMO energy
    • E0743475|HOMO-LUMO|HOMO/LUMO
    • E0743476|HOMO-LUMO transition|HOMO/LUMO transition
    • E0743477|lowest unoccupied molecular orbital energy|lowest-unoccupied-molecular-orbital energy
    • E0743478|LUMO energy
    • E0743479|lowest unoccupied molecular orbital level
    • E0743480|LUMO level
    • E0743481|HOMO-LUMO analysis|HOMO/LUMO analysis|HOMO, LUMO analysis
    • E0743482|LUMO
    • E0743483|highest occupied molecular orbital|highest-occupied molecular orbital
    • E0743484|HOMO
    • E0743485|highest occupied molecular orbital energy|highest-occupied-molecular-orbital energy
    • E0743486|E(HOMO)
    • E0743487|HOMO energy