Sub-Term Mapping Tools

UMLS-Core: Normalized term to CUI Mapping

  • Descriptions:
    • A simple mapping from normalized term to CUI (concept)
    • File: MRXNS_ENG.RRF (fields 2 and 3)

  • Examples - Test Cases:
    InputOutputNotes
    disease'sC0012634Genitive
    disease(s)C0012634parenthetical plural forms
    :disease.&*C0012634Punctuation
    of diseaseC0012634Stop Words
    DisEASEC0012634cases
    diseasesC0012634inflectional variants
    :Disease(S).&*C0012634Combination
    ColoursC0009393spelling variants
    Cortical Cystic DiseasesC0431713Multi words

  • Algorithm:
    • normalized input term by Lvg Norm
    • Mapping
      • It is one to many mapping, such as:
        • tree trumpet|C1460803
        • tree trumpet|C1492035
      • Use Java Hashtable
         keyValues
        TypeStringVector<String>
        DescriptionNormalized termMapped CUIs
      • Removed duplicated key|value (same normTerm|CUI with different LUI and SUI) in MRXNS_ENG.RRF.2011AB
      • MRXNS_ENG.RRF.2011AB has 9,537,805 lines. It uses lots of memory. The alternative is to use database

    • This is a one to many mapping