2 | Get raw synonym candidates from MRCONSO.RRF
Meta.GetSynonymCandidates.java
- same CUI
- English term: Filed-2, LAT = ENG
- not disallowed STI, such as Chemicals & Drugs, defined in SemGroups.filter.txt, use MRSTY.RRF to map CUI to STI
- Must known to Lexicon
- Must have POS of adj, noun, or verb, infl is base
- Remove acronym => it drops precision
- Remove spVars => will add them in Post-process
- Remove nominalization => will add them in Post-process
- Remove class with only single candidates => remove pure spVar & nom
|
- same CUI (definition of synonym, same concept)
- Filed-2, LAT = ENG (English only)
- Terms are normalized into lowercased core-terms (strip initial and final punctuation, then lowercased) as key in lookup mapping for Lexical rcord
- known to Lexicon (design spec.)
- have POS of adj, noun, or verb (design spec.)
- infl is base (design spec.)
- Base form are used n the output
|
|
3 |
- Analyze and check raw synonym candidate list
- Read in and print out, then compare if they are the same.
|
- ./outData/Candidates/synonymCan.data
|
- ./outData/Candidates/synonymCan.raw.data.out
- ./outData/Candidates/diff (must be 0)
next, go to step-10
|