Analysis: Get units from Lexicon by specifying LeadUnit or EndUnit
I. Introduction
This section describes an analysis tool that retrieve all units from Lexicon by specifying the lead-unit and end-unit. The results are used to verify invalid/valid leadUnit and endUnit for case study.
II. Processes
- directory: ${MULTIWORDS_DIR}/bin
- program: 3.InvalidLeadEndUnit
- Run program:
shell> ./3.InvalidLeadEndUnit ${YEAR}
- Processes:
Step | Description | IO | Notes - Examples
|
---|
10 | Get multiwords from Lexicon by the speficfied lead-unitGetTermFromAFileByLeadTerm.java - Get units from a file by
- the speficied lead-unit
- exclude lead-end-unit candidates
- exclude child lead-unit candidates
Inputs:- ./outData/3.InvalidLeadEndTerm/lexMultiwords.data
- ./outData/3.InvalidLeadEndTerm/invalidLeadEndTermCandidates.data
- ./outData/3.InvalidLeadEndTerm/invalidLeadTermsCandidatesChild.data
Outputs: - ./outData/3.InvalidLeadEndTerm/LeadTermTest/unitByLeadTerm.${LEAD_TERM}.out
| - 10 sec.
- Results include three level of units
11 | Get multiwords from Lexicon by the speficfied end-unitGetTermFromAFileByEndTerm.java - Get units from a file by
- the speficied end-unit
- exclude lead-end-unit candidates
- exclude child end-unit candidates
Inputs:- ./outData/3.InvalidLeadEndTerm/lexMultiwords.data
- ./outData/3.InvalidLeadEndTerm/invalidLeadEndTermCandidates.data
- ./outData/3.InvalidLeadEndTerm/invalidEndTermsCandidatesChild.data
Outputs: - ./outData/3.InvalidLeadEndTerm/LeadTermTest/termByEndTerm.${LEAD_TERM}.out
| - 10 sec.
- Results include three level of units
|
|
|
|