WordInd is used to create word indexes. It breaks up a string into a unique list of lowercased "words". WordInd follows the UMLS definition of word, a sequence of one or more alphanumeric characters.
WordInd uses UTF-8 for the input and output since 2004 release.
Follow the installation instructions to install lexical tool and run the wordInd program. Check on the following items only if you don't use the provided script to install Lexical tools.
Enter the command:
shell> wordInd -p - Please input a term (type "Ctl-d" to quit) > aaaa bbbb:cccc aaaa bbbb cccc
where:
WordInd reads from standard input and writes to standard output, one line per word. The fields in the output are in the order of the -F options.
Please refer to design document
Please refer to design document