Lexical Tools, Java 8.0, UTF-8, 2025 Release:
12/18/2024
Lexical Tools is a set of fundamental core NLP tools for retrieving lexical variants. Functions (flow components) include retrieving inflectional variants, uninflectional forms, spelling variants, derivational variants, synonyms, antonyms, fruitful variants, normalization, UTF-8 to ASCII conversion, lowercase, abbreviations and acronyms, etc. There are over 64
flow components and 37
options offered in this tools set.
This release includes the latest data integration from
the SPECIALIST LEXICON along with completion of software change requests (SCRs). They are briefly described as follows. Please refer to
release notes
for details.
Feature Enhancements & Bugs Fixes
- Distributed both full version and lite version of lvg.2025
- Upgraded to ICU4J 75.1 (International Components for Unicode)
- Upgraded to HSqlDb 2.7.3-jdk8 (HyperSonic SQL DB)
- Kept JRE, 1.8.0_202
- Integrated with data from the SPECIALIST LEXICON, 2025
Component | Amount | Notes
|
---|
Words (forms)
| 1,007,634 | including: single words and multiwords
|
derivations
| 158,409 | including: zeroD, prefixD, suffixD
|
synonyms
| 282,602 | Sources including: CUI, EUI, NLP
|
antonyms
| 13,076 | Source models including: LEX, SD, PD, CC, SN
|