public class ToUnicodeCoreNorm extends Transformation implements java.lang.Cloneable
Four mapping tables are used in this flow:
History:
NO_MUTATE_INFO, UPDATE
Constructor and Description |
---|
ToUnicodeCoreNorm() |
Modifier and Type | Method and Description |
---|---|
static java.lang.String |
GetCoreNormStr(java.lang.String inStr,
java.util.Hashtable<java.lang.Character,java.lang.String> symbolMap,
java.util.Hashtable<java.lang.Character,java.lang.String> unicodeMap,
java.util.Hashtable<java.lang.Character,java.lang.String> ligatureMap,
java.util.Hashtable<java.lang.Character,java.lang.Character> diacriticMap)
Get unicode synonym
|
static void |
main(java.lang.String[] args)
A unit test driver for this flow component.
|
static java.util.Vector<LexItem> |
Mutate(LexItem in,
java.util.Hashtable<java.lang.Character,java.lang.String> symbolMap,
java.util.Hashtable<java.lang.Character,java.lang.String> unicodeMap,
java.util.Hashtable<java.lang.Character,java.lang.String> ligatureMap,
java.util.Hashtable<java.lang.Character,java.lang.Character> diacriticMap,
boolean detailsFlag,
boolean mutateFlag)
Performs the mutation of this flow component.
|
GetTestStr, PrintResult, PrintResults, UpdateLexItem, UpdateLexItem, UpdateLexItem
public static java.util.Vector<LexItem> Mutate(LexItem in, java.util.Hashtable<java.lang.Character,java.lang.String> symbolMap, java.util.Hashtable<java.lang.Character,java.lang.String> unicodeMap, java.util.Hashtable<java.lang.Character,java.lang.String> ligatureMap, java.util.Hashtable<java.lang.Character,java.lang.Character> diacriticMap, boolean detailsFlag, boolean mutateFlag)
in
- a LexItem as the input for this flow componentsymbolMap
- a hash table contain the unicode symbols mappingunicodeMap
- a hash table contain the unicode mappingligatureMap
- a hash table contain the ligatures mappingdiacriticMap
- a hash table contain the diacritics mappingdetailsFlag
- a boolean flag for processing details informationmutateFlag
- a boolean flag for processing mutate informationpublic static java.lang.String GetCoreNormStr(java.lang.String inStr, java.util.Hashtable<java.lang.Character,java.lang.String> symbolMap, java.util.Hashtable<java.lang.Character,java.lang.String> unicodeMap, java.util.Hashtable<java.lang.Character,java.lang.String> ligatureMap, java.util.Hashtable<java.lang.Character,java.lang.Character> diacriticMap)
inStr
- an input stringsymbolMap
- a hash table contain the unicode symbols mappingunicodeMap
- a hash table contain the unicode mappingligatureMap
- a hash table contain the ligatures mappingdiacriticMap
- a hash table contain the diacritics mappingpublic static void main(java.lang.String[] args)
args
- arguments