The SPECIALIST Lexicon

Generate Multiwords from Verb Complements: Process

This section describes the processes to retrieve multiwords from verb complement types form associated verb in the Lexicon.

I. Setup and Inputs

  • Directory: ${MULTIWORDS}/data/${YEAR}/outData/14.VerbComplements
  • Program: ${MULTIWORDS}/bin/14.VerbComplements ${YEAR}

II. Processes

StepDescirptionInputsOutputsNotes
1Get raw LVCs
  • ./inData/LEXICON.release
  • lightVerbs.raw
  • use the latest LEXICON
2i
  • Add tags (from previous tags and WordNet) to raw LVCs
  • Generate multiword candidates for tagging
  • lightVerbs.raw
  • ./inData/lightVerbs.data.tag
  • ./inData/WnIndexWords.data.3.0.mw
  • lightVerbs.cand
  • lightVerbs.tag
  • Send lightVerbs.cand to linguists for tagging
3Verify linguist tagsi
  • lightVerbs.cand.tag.${YEAR}
i
None
  • copy tagged file to lightVerbs.cand.tag.${YEAR}
  • append lightVerbs.cand.tag.${YEAR} to ./inData/lightVerbs.data.tag
  • rerun step 2 until lightVerbs.cand is 0
4i
  • Get multiwords from tagged LVCs
  • Get stats reports
  • lightVerbs.tag
  • inData/inflVars.data
  • lightVerbs.data
  • lightVerbs.infl
  • lightVerbs.stats
  • Use LVC type in the script
11Get raw VPCs
  • ./inData/LEXICON.release
  • verbParticles.raw
  • use the latest LEXICON
12
  • Add tags (from previous tags and WordNet) to raw VPCs
  • Generate multiword candidates for tagging
  • verbParticles.raw
  • ./inData/verbParticles.data.tag
  • ./inData/WnIndexWords.data.3.0.mw
  • verbParticles.cand
  • verbParticles.tag
  • Send verbParticles.cand to linguists for tagging
13Verify linguist tags
  • verbParticles.cand.tag.${YEAR}
Nonenbsp;
  • copy tagged file to verbParticles.cand.tag.${YEAR}
  • append verbParticles.cand.tag.${YEAR} to ./inData/verbParticles.data.tag
  • rerun step 2 until verbParticles.cand is 0
14
  • Get multiwords from tagged VPCs
  • Get stats reports
  • verbParticles.tag
  • inData/inflVars.data
  • verbParticles.data
  • verbParticles.infl
  • verbParticles.stats
  • Use VPC type in the script
20 Combine LVCs and VPCs
  • verbComplements.data
  • verbComplements.infl
  • verbComplements.stats