The SPECIALIST Lexicon

Spelling Variant Patterns - Test on Lexicon-LRSPL

Norm, MES, and ES are used in a sequential order to retrieve the most spelling variant groups. This model is tested on Lexicon Spelling variants (LRSPL) for the recall. This is the inital test on this model. The results are shown as follows:

  • Results:

    2014

    StepMethodsEdit DistanceCombined No.Total GroupsSpVars GroupsSingle GroupsSpVar No.Group (Recall)
    0Lexicon.2014N/A0N/AN/A0249,231100.00 %
    1NormN/A126,708122,52392,41530,108219,12387.92%
    2MES214,571107,952104,1573,795245,43698.48%
    3ES11,418106,534105,2011,333247,89899.47%
    4MES3145106,389105,3081,081248,15099.57%
    5ES2431105,958105,618340248,89199.86%
    6MES437105,921105,647274248,95799.89%

    2015

    StepMethodsEdit DistanceCombined No.Total GroupsSpVars GroupsSingle GroupsSpVar No.Group (Recall)
    0Lexicon.2015N/A0N/AN/A0260,431100.00 %
    1NormN/A?126,67596,21630,459229,97288.30%
    2MES214,773111,902108,0923,810256,62198.53%
    3ES11,428110,474109,1441,330259,10199.49%
    4MES3144110,330109,2521,078259,35399.59%
    5ES2429109,901109,563338260,09399.87%
    6MES437109,864109,592272260,15999.90%

  • Future Work:
    • Only recall are tested (because the results are used for SpVar Matcher). Should try Lexicon (include no-spVar) to check precision and recall and find the optimum point.
      => The PRF model should be established for:
      • finding the optimal processes of this model
      • a measurement index when enhanced SpVar matcher and its componenet
      • Could be a short paper
    • Try different order in step to gain the best results (precision and recall)