Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

Sub-Term Mapping Tools

UMLS-Core: Normalized term to CUI Mapping

  • Descriptions:
    • A simple mapping from normalized term to CUI (concept)
    • File: MRXNS_ENG.RRF (fields 2 and 3)

  • Examples - Test Cases:
    InputOutputNotes
    disease'sC0012634Genitive
    disease(s)C0012634parenthetical plural forms
    :disease.&*C0012634Punctuation
    of diseaseC0012634Stop Words
    DisEASEC0012634cases
    diseasesC0012634inflectional variants
    :Disease(S).&*C0012634Combination
    ColoursC0009393spelling variants
    Cortical Cystic DiseasesC0431713Multi words

  • Algorithm:
    • normalized input term by Lvg Norm
    • Mapping
      • It is one to many mapping, such as:
        • tree trumpet|C1460803
        • tree trumpet|C1492035
      • Use Java Hashtable
         keyValues
        TypeStringVector<String>
        DescriptionNormalized termMapped CUIs
      • Removed duplicated key|value (same normTerm|CUI with different LUI and SUI) in MRXNS_ENG.RRF.2011AB
      • MRXNS_ENG.RRF.2011AB has 9,537,805 lines. It uses lots of memory. The alternative is to use database

    • This is a one to many mapping