Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

The SPECIALIST Lexicon

Generating Synonyms from the Original Lexical Tools Synonyms

I. Introduction

Before the new synonym approach in 2017 (2016-), the "SM.DB" in the Lexicon (2016) is a static file (with miminum updates since 2002), which includes 5058 sPairs. The "synonyms.data" file in the Lexical Tools (2016) were actually used for most application with very limited updated (since 2002) includes 5198 sPairs. In other words, "synonyms.data" contains "SM.DB" and latest updates. Thus, the sPairs in the "synonyms.data" are marked as NLP_LVG and kept in the new release. sPairs in "synonyms.data" are updated annaully by moving those sPairs that are found from Meta or Nom.

II. Process

  • Directory: ${LEXICON_SYNONYMS}
  • program: ./Synonym/GenSynonymFromLvg.java
  • Inputs:
    • ./inData/synonyms.data.lvg (from Lvg.2016)
    • ./outData/Results/synonymFromMeta.data
    • ./outData/Results/synonymFromNom.data
  • Algorithm:
    • Go though all sPairs from original synonyms.data from Lvg
    • The synonyms.data (5,198) form Lvg contains the latest updates and thus was used. All 5,058 sPairs in the SM.DB are in the synonyms.data form the Lvg.
    • Exclude those are already in synonymMeta and synonymNom
    • print to output file
  • Program:
    • shell> cd ./bin
    • shell> GetSynonyms ${YEAR} > log.15
      15
  • Outputs:
    • ./outData/Results/synonymFromLvg.data.dup (exist sPairs)
    • ./outData/Results/synonymFromLvg.data
    • Format:
      NpLc Synonym-1Synonym-1Pos-1Synonym-2Pos-2ORG_LVG