Because of a lapse in government funding, the information on this website may not be up to date, transactions submitted via the website may not be processed, and the agency may not be able to respond to inquiries until appropriations are enacted. The NIH Clinical Center (the research hospital of NIH) is open. For more details about its operating status, please visit cc.nih.gov. Updates regarding government operating status and resumption of normal operations can be found at OPM.gov.

LexCheck

ValidateContentFromTextFile

  • Descriptions:
    Validate the contents of lexical records from a text file. It validates contents in a single lexical record (no cross-reference check among lexical records):
    • Check Reg (glreg & regd):
      • Check inflection rules on glreg for nouns
      • Check inflection rules on regd for verbs and adjs
    • Check/fix duplicated terms:
      • Check duplicated base & spelling variants
      • Check duplicated acronyms
      • Check duplicated abbreviations
      • Check duplicated nominalizations in noun, verb, adj
      • Check duplicated variant (irreg) in noun, verb, adj, adv, etc.

  • Usage:
    shell>ValidateContentFromTextFile <inFile> <outFile> <prepositions.data> <particles.data> <irregExpFile> <-v>
    • inFile: lexical record in text format
    • outFile: auto-fixed lexical record in text format
    • prepositionFile: the preposition file
      default: use the prepositions.data included in the lexCheck${YEAR}api.jar or lexCheck${YEAR}dist.jar
      => prepositions are used in the class of Compl.CheckPreposition.java
    • particleFile: the particle file
      default: use the particles.data included in the lexCheck${YEAR}api.jar or lexCheck${YEAR}dist.jar
      => particles are used in the class of Compl.CheckParticle.java
    • irregExpFile: the irregExceptions.data file, if the specified file does not exist, use the default irregExceptions.data in lexCheck${YEAR}api.jar or lexCheck${YEAR}dist.jar
    • -v: set verbose to true, default: fault

  • Outputs:
    • On screen message:
      • Confirmed message if records are valid.
      • Otherwise, error message
    • outFile: Auto-fixed Lexicon (remove duplicates)

  • Notes:
    • Must include:
      • lexCheck${YEAR}dist.jar (for LVG APIs) or
      • lexCheck${YEAR}api.jar and lvg${YEAR}api.jar
    • Benchmark run time for Lexicon to generate both Lexicon and inflVars: 10 ~ 15 sec.

  • Examples:
    • shell> ValidateContentFromTextFile lexicon.txt lexicon.fixed ./data/Files/prepositions.data ./data/Files/particles.data ./data/Files/irregExceptions.data -v
    • shell> ValidateContentFromTextFile lexicon.txt lexicon.fixed ./data/Files/prepositions.data ./data/Files/particles.data ./data/Files/irregExceptions.data