Genitive Spelling Variants
I. Introduction
Genitive spVars are spVars with genitive pattern (genitive-noun plus main noun).
Genitive spVars are a subset of spVars. Accordingly, they must meet the criteria of spVars (same meaning, POS, syntax, and pronunciation) as well as match the genitive pattern:
- Genitive patten: is the possession that includes [genitive noun] followed by a [main noun].
The genetive pattern includes following types:
Type | Genitive Pattern | Examples
|
---|
['s] | AAA -> AAA's |
- Bayes theorem -> Bayes's theorem
- Down Syndrome -> Down's Syndrome
|
[s'] | AAAs -> AAAs' |
- Bayes theorem -> Bayes' theorem
- Wellens syndrome -> Wellens' syndrome
|
[x'] | AAAx -> AAAx' | - Cox' proportioal hazzard model
|
[z'] | AAAz -> AAAz' |
- Alvarez disease -> Alvarez' disease
- Duroziez sign -> Duroziez' sign
|
- Same pronunciation:
to meet the pronunciation criteria of spVar, the following head noun must starts with sounds of /s/ or /z/ (such as 's', 'z', 'c', 'x').
Otherwise, they are not spVar, even they have same POS and meaning., because the pronunciation will be different.
- [Bloom syndrome] and [Bloom's syndrome] is an example of genitive spVar
- [Johanson operation] and [Johanson's operation], are not spVars because the head following noun (operation) starts with [o], and resulting in different pronunciation. In stead, they are synonyms.
- [trachs] and [trach’s] is not a genitive spVar pair because it does not match genitive pattern (no following noun). Instead, they are inflectional spVars because they are the plural forms of trach (and matches spVar criteria).
- [Addisons disease] and [Addison's disease] is spVar. However, they are identified by Metaphone and Caverphone (not spVarNorm)
- In summary, a genitive spVar pattern includes [genitive noun] and [main noun], where:
- [genitive noun]: [-'s], [-s'], [-x'], and [-z']
- [main noun]: must start with 's', 'ce', 'ci', 'cy', 'x', 'z'
That is
- [-'s s-], [-'s ce-], [-'s ci-], [-'s cy-], [-'s x-], [-'s z-]
- [-s' s-], [-s' ce-], [-s' ci-], [-s' cy-], [-s' x-], [-s' z-]
- [-x'], and [-z'] are not applied in this model
- Genitive spVars are indetified by SpVar models but not included in Lexicon (by mistakes). SpVar model is expected to be implemented in Lexbuild to enhanced Lexicon for avoiding such errors.
II. Studies on Lexicon.2015
- Process:
- 890 pairs that match genitive patterns are found from SpVarNorm (false positive) on Lexicon.2015
- 512 unique pairs left after excluding different POS and duplicates (between inflections and base forms)
- They are sent to linguist to tag [Y|N] for valid and invalid spVars inthe following format:
POS | EUI-1 | Term-1 | EUI-2 | Term-2 | Tag
|
---|
- Linguist combines EUI-1 and EUI-2 if the tag is [Y]
- Discussion:
- LexBuild uses gSpell for close-match for linguists to find terms that are spVars
- gSpell is not able to find these terms are related, thus they wre recroded as separated records
- SpVar model should be implemented to LexBuild for close-match to comprehend the possible spVars
- A good example to show powerful application of spVar model for close match
- Results:
- The following examples are tagged [Y], and also identify by this algorithm
noun|E0581842|alzheimer's cell|E0581841|alzheimer cell
noun|E0211234|ayerza's syndrome|E0011533|ayerza syndrome
noun|E0200376|baker's cyst|E0430026|baker cyst
noun|E0000919|bernard's syndrome|E0238630|bernard syndrome
noun|E0439815|darier's sign|E0439816|darier sign
noun|E0340152|epstein's syndrome|E0238661|epstein syndrome
noun|E0071032|lowe's syndrome|E0216665|lowe syndrome
noun|E0004054|marie's syndrome|E0203125|marie syndrome
noun|E0212994|pfeiffer's syndrome|E0238743|pfeiffer syndrome
noun|E0227503|schirmer's syndrome|E0342672|schirmer syndrome
noun|E0437664|spurling's sign|E0437663|spurling sign
noun|E0665753|tarlov's scale|E0665752|tarlov scale
noun|E0735990|tarlov's score|E0665754|tarlov score
noun|E0071056|wermer's syndrome|E0006203|wermer syndrome
noun|E0238783|west's syndrome|E0006229|west syndrome
noun|E0477289|wickham's stria|E0345929|wickham stria
- The following are identified by this algorithm as spVar, hwoever, they are not because of syntax:
noun|E0732110|ayre's spatula|E0525454|ayre spatula|
noun|E0570882|buckley's centauries|E0570881|buckley centauries|
noun|E0506873|dubowitz's syndrome|E0506874|dubowitz syndrome|
noun|E0227451|franceschetti's syndrome|E0340148|franceschetti syndrome|
noun|E0355922|giemsa's stain|E0002780|giemsa stain|
noun|E0644118|hoffmann-tinel's sign|E0644117|hoffmann-tinel sign|
noun|E0213099|ladd's syndrome|E0423148|ladd syndrome|
noun|E0742867|lch's|E0739251|lch|
noun|E0739251|lch's|E0742867|lch|
noun|E0539798|ledderhose's syndrome|E0539799|ledderhose syndrome|
noun|E0213207|leriche's syndrome|E0238702|leriche syndrome|
noun|E0071030|lermoyez's syndrome|E0238703|lermoyez syndrome|
noun|E0684701|neuman's systems model|E0684700|neuman systems model|
noun|E0212964|opitz's syndrome|E0698976|opitz syndrome|
noun|E0518700|papez's circuit|E0518701|papez circuit|
noun|E0570313|randle's cycle|E0570312|randle cycle|
noun|E0342663|reed's cell|E0342662|reed cell|
noun|E0005381|seidel's scotoma|E0345520|seidel scotoma|
noun|E0342599|sturge's syndrome|E0453725|sturge syndrome|
noun|E0613378|tarlov's cyst|E0342583|tarlov cyst|
noun|E0348742|thornwaldt's cyst|E0348740|thornwaldt cyst|
noun|E0200504|tinel's sign|E0644114|tinel sign|
noun|E0531106|torkildsen's shunt|E0531105|torkildsen shunt|
noun|E0681957|utstein's style|E0681956|utstein style|