SPECIALIST Lexicon

Antonym Generation for PD Model

shell>cd ${ANTONYM_DIR}/bin
shell>GetAntonyms ${YEAR}

PD model: Prefix Derivations with negations
use the latest prefixD
antCand from previous prefixD are not completed yet (~7,320 for 2024 release)

Option	Description	input	Output	Notes	Option
30	get antonym candidates from prefixD derivation.getantcandfromprefixd.java	${PD_DIR}/input/derivation.data ${LEX_DIR}/input/inflvars.data ${ANT_DIR}/input/antcand.data.tag.${year} ${ANT_DIR}/input/domain.data	./output/cand/antcandprefixd.data ./output/cand/antcandprefixd.data.tag => aPairs alreayd tagged ./output/cand/antcandprefixd.data.tbd => aPairs to be done, need to be 0 ./output/candtagged/antcandprefixd.data.tag.tagged	if the first time: mkdir ./${year}/output/cand mkdir ./${year}/output/candtagged use updated derivation.data and inflvars.data send antcandprefixd.data.tbd to linguist to complete the tags as for 2025 release, there are 7k+ tbd aPairs needs to be tagged. This number is expected to be much less (only for the annual growth of the prefixD) during the annual release once this is completed tagged.	30
31	Validate and fix tags of antonym candidates (PD) Antonym.ValidateTaggedCand.java	./output/candTagged/antCandPrefixD.data.tag.tagged ${ANT_DIR}/input/domain.data	./output/candTagged/antCandPrefixD.data.tag.fixed	Append linguist's tag to ${PD_DIR}/output/candTagged/antCandPrefixD.data.tag.tagged Run this step until the tag and fixed file are the same Fixed file is the auto-fixes on [TYPE_TBD] and [DOMAIN_TBD] to [NA] and [DOMAIN_NONE]. shell> sort -u antCandPrefixD.data.tag.fixed > antCandPrefixD.data.tag.fixed.uSort Manually copy the sorted-fixed file to tagged file Manually copy antCandPrefixD.data.tag.tagged to antCandPrefixD.data.tag.tagged.${YEAR}.${NO}	31
32	Update release antonyms tagged file form PD Antonym.UpdateAllTaggedFile	./output/candTagged/antCandPrefixD.data.tag.tagged.${YEAR} ${ANT_DIR}/input/antCand.data.tag.${YEAR} ${ANT_DIR}/input/domain.data	${ANT_DIR}/input/antCand.data.tag.updated ${ANT_DIR}/input/antCand.data.tag.updated.srcConflict ${ANT_DIR}/input/antCand.data.tag.updated.tarConflict	This step auto-update all antonym candidate tag file Manully copy antCand.data.tag.updated to antCand.data.tag.updated.3.PD Manully copy/link antCand.data.tag.updated to antCand.data.tag.${YEAR} src could be conflicted (form different sources), for example: activate\|E0007090\|deactivate\|E0417566\|verb\|Y\|UB\|BN2\|quality\|SN activate\|E0007090\|deactivate\|E0417566\|verb\|Y\|UB\|BN2\|quality\|PD The output file is used to generate antonym and negation files for the release. Re-run steps 30-32 until it passes all steps.	32

The SPECIALIST Lexicon