public class ToLuiNormalize extends Transformation implements java.lang.Cloneable
If the flow component B generates multiple outputs in LuiNorm, the results are determined as shown in the following two cases:
History:
ToUnicodeCoreNorm,
ToRemoveGenitive,
ToRemoveS,
ToReplacePunctuationWithSpace,
ToStripStopWords,
ToLowerCase,
ToUninflectWords,
ToCanonicalize,
ToStripMapUnicode,
ToSortWordsByOrderNO_MUTATE_INFO, UPDATE| Constructor and Description |
|---|
ToLuiNormalize() |
| Modifier and Type | Method and Description |
|---|---|
static void |
main(java.lang.String[] args)
A unit test driver for this flow component.
|
static java.util.Vector<LexItem> |
Mutate(LexItem in,
int maxTerm,
java.util.Vector<java.lang.String> stopWords,
java.sql.Connection conn,
RamTrie trie,
java.util.Hashtable<java.lang.Character,java.lang.String> symbolMap,
java.util.Hashtable<java.lang.Character,java.lang.String> unicodeMap,
java.util.Hashtable<java.lang.Character,java.lang.String> ligatureMap,
java.util.Hashtable<java.lang.Character,java.lang.Character> diacriticMap,
java.util.Hashtable<java.lang.Character,java.lang.String> nonStripMap,
RTrieTree removeSTree,
boolean detailsFlag,
boolean mutateFlag)
Performs the mutation of this flow component.
|
GetTestStr, PrintResult, PrintResults, UpdateLexItem, UpdateLexItem, UpdateLexItempublic static java.util.Vector<LexItem> Mutate(LexItem in, int maxTerm, java.util.Vector<java.lang.String> stopWords, java.sql.Connection conn, RamTrie trie, java.util.Hashtable<java.lang.Character,java.lang.String> symbolMap, java.util.Hashtable<java.lang.Character,java.lang.String> unicodeMap, java.util.Hashtable<java.lang.Character,java.lang.String> ligatureMap, java.util.Hashtable<java.lang.Character,java.lang.Character> diacriticMap, java.util.Hashtable<java.lang.Character,java.lang.String> nonStripMap, RTrieTree removeSTree, boolean detailsFlag, boolean mutateFlag) throws java.sql.SQLException
in - a LexItem as the input for this flow componentmaxTerm - tthe maxinum number of permutation term (uninflect)stopWords - A Vector of String - stop wrods listconn - LVG database connectiontrie - LVG ram triesymbolMap - a hash table contains the unicode symbols mappingunicodeMap - a hash table contains the unicode mappingligatureMap - a hash table contains the mapping of ligaturesdiacriticMap - a hash table contains the mapping of diacriticsnonStripMap - a hash table contains the non-Strip map unicoderemoveSTree - a reverse trie tree of removeS pattern rulesdetailsFlag - a boolean flag for processing details informationmutateFlag - a boolean flag for processing mutate informationjava.sql.SQLException - if errors occurr while connect to LVG database.DbBasepublic static void main(java.lang.String[] args)
args - arguments