The SPECIALIST Lexicon

Sorting Order: By Alphabetic Order

I. Description:
Finally, all base forms are sorted by alphabetic order so that all base forms are uniquely sorted and citation form is uniquely assigned.

II. Example
Argentinean is chosen over Argentinian as the citation form because of the alphabetic order. Two other examples of new citation forms because of alphabetic order are shown in the following table:

2013-2014+
{base=Argentinian
spelling_variant=Argentinean
entry=E0000435
	cat=noun
	variants=reg
}
{base=Argentinean
spelling_variant=Argentinian
entry=E0000435
	cat=noun
	variants=reg
}
{base=accessory
spelling_variant=accessary
entry=E0006710
	cat=noun
	variants=reg
	compl=pphr(to,np)
}
{base=accessary
spelling_variant=accessory
entry=E0006710
	cat=noun
	variants=reg
	compl=pphr(to,np)
}
{base=New Zealand black
spelling_variant=New Zealand Black
entry=E0004456
	cat=noun
	variants=reg
}
{base=New Zealand Black
spelling_variant=New Zealand black
entry=E0004456
	cat=noun
	variants=reg
}

III. Impacts (on Norm)
Results of NLP programs use citation forms might change accordingly. For examples, using above LexRecords, the result of Norm (which uses -f:Ct) is changed accordingly between 2013- and 2014+:

  • Input: Argentinean or Argentinian
  • Results (2013-): argentinian
  • Results (2014+): argentinean
  • Input: accessary or accessory
  • Results (2013-): accessory
  • Results (2014+): accessary

Please note that both New Zealand Black and New Zealand black are normalized to black new zealand because the lowercase flow compoment (-f:l) is processed before -f:Ct in Norm.