Sorting Order: By Alphabetic Order
I. Description:
Finally, all base forms are sorted by alphabetic order so that all base forms are uniquely sorted and citation form is uniquely assigned.
II. Example
Argentinean
is chosen over Argentinian
as the citation form because of the alphabetic order. Two other examples of new citation forms because of alphabetic order are shown in the following table:
2013- | 2014+ |
---|---|
{base=Argentinian spelling_variant=Argentinean entry=E0000435 cat=noun variants=reg } |
{base=Argentinean spelling_variant=Argentinian entry=E0000435 cat=noun variants=reg } |
{base=accessory spelling_variant=accessary entry=E0006710 cat=noun variants=reg compl=pphr(to,np) } |
{base=accessary spelling_variant=accessory entry=E0006710 cat=noun variants=reg compl=pphr(to,np) } |
{base=New Zealand black spelling_variant=New Zealand Black entry=E0004456 cat=noun variants=reg } |
{base=New Zealand Black spelling_variant=New Zealand black entry=E0004456 cat=noun variants=reg } |
III. Impacts (on Norm)
Results of NLP programs use citation forms might change accordingly. For examples, using above LexRecords, the result of Norm (which uses -f:Ct) is changed accordingly between 2013- and 2014+:
Argentinean
or Argentinian
argentinian
argentinean
accessary
or accessory
accessory
accessary
Please note that both New Zealand Black
and New Zealand black
are normalized to black new zealand
because the lowercase flow compoment (-f:l) is processed before -f:Ct in Norm.