The SPECIALIST Lexicon

Antonym - Processes for Annual Release and Stats Reports

I. Set Up

  • base directory: ${ANTONYM_DIR}
  • binary scripts: ./bin
  • data: ./data
    • 0.Antonym
  • Pre-requirements:
    Must complete updates on aPairs from LEX, SD, PD, (TT), CC, SN
shell>cd ${ANTONYM_DIR}/bin
shell>GetAntonyms ${YEAR}

II. Processes

  • Generate aPairs, negation cue words, and antonym files
    Option and DescriptioninputOutputNotes
    1
    • generate aPairs from tagged candidates
    • Antonym.GenAPairsFromTagCand.java
    • ${ANT_DIR}/input/antCand.data.tag.${YEAR}
    • ${ANT_DIR}/input/domain.data
    • ${LEX_DIR}/input/LRSPL
    • ./output/aPairs.data
    • This program generates aPairs with all spVars
    • This program removes duplicated aPairs by spVars fromdifferent sources
    • The result include some duplicated aPairs from the different order of aPairs from different sources. They are taken cared in Step-3.
    • This is the antonym file contains unique aPairs.
    • manually copy aPairs.data to aPairs.data.${YEAR}
    2
    • generate negation cue words from tagged candidates
    • Antonym.GenNegCueWordsFromTagCand.java
    • ${ANT_DIR}/input/antCand.data.tag.${YEAR}
    • ${LEX_DIR}/input/LRSPL
    • ./output/negCueWords.data
    • This is the negation cue word file (unique).
    • manually copy negCueWords.data to negCueWords.data.${YEAR}
    3
    • Gen antonyms release file from results of step-1 (DB table for Lexical Tools)
    • Antonym.GenAntFromAPairs.java
    • ./output/aPairs.data.${YEAR}
    • ./output/antonyms.data

    • ./output/antonyms.data.tagConflict
      => Must be 0, if not, manully fix
    • ./output/antonyms.data.tagDuplicate
      => Must be 0, if not, manully review/fix
    • ./output/antonyms.data.srcConflict
      => Must be known exceptions, if not, manully review/updates
      known exeptions (3):
      • aerobic|E0007609|anaerobic|E0008760|adj|UB|O|quality|CC
      • complete|E0018156|incomplete|E0034025|adj|B|O|quality|SN
      • invisible|E0035728|visible|E0064742|adj|B|O|physical_property|SN

    • This is the antonym release (also DB table for Lexical tools).
    • manually copy antonyms.data to antonyms.data.${YEAR}
    5
    • Get stats on tagged antonym candidate file
    • ${ANT_DIR}/input/antCand.data.tag.${YEAR}
    • ./output/analysis/antCand.data.tag.stats
    • ./output/analysis/domain.out.cand
    • If run the first time, shell> mkdir ${OUTPUT}/analysis
    • Generate stats and domains from antonym candidate tagged file
    6
    • Get stats on canonical antonym from tagged candidate file
    • ${ANT_DIR}/input/antCand.data.tag.${YEAR}
    • ./output/analysis/antCand.data.tag.canon.stats
    • ./output/analysis/domain.out.cand.canon
    • Generate stats and domains from canonical antonym in tagged file
    7
    • Get stats on antonym file
    • ./output/antonyms.data
    • ./output/antonyms.data.2-10

    • ./output/analysis/antonym.data.stats
    • ./output/analysis/domain.out.antonym
    • Generate stats and domains from antonym file
    • This file is used to update antonym growth.