The SPECIALIST Lexicon
Prefix Table
I. Inputs
- LEXICON.ascii
- normInflVarEui.data
II. Outputs
- File: prefixTable.data
- Format:
normalized inflVar | EUI | Citation Form | spelling var 1 | spelling var 2 | ...
|
III. Algorithm
- Get Prefix table:
shell>GetPrefixTable normInflVarEui.data LEXICON.ascii prefixTable.data.org
- Read in LEXICON.ascii and load it into Hashtable<EUI, LexRecordObj>
- Read in normInflVarEui.data (ASCII)
- This table is used to replace prefix table
- It is linked from ${STMT_DIR}/PreProcess/data/Lex/{$YEAR}Ascii
- It is done by ${STMT_DIR}/PreProcess/bin/1.PreStmt
- Go through each line and print out (to prefixTable.data.org)
- Sort and unify
sort -u prefixTable.data.org > prefixTable.data
- Check pure ASCII