Computer-Aided Revision
A set of computer-aided program is developed to validate and revise the reconciled Brat annotation data. They are described follows:
${C_SPELL}/PostProcess
${C_SPELL}/PostProcess/bin
${C_SPELL}/PostProcess/data/Brat/NewTestRevised/
${C_SPELL}/PostProcess/data/Brat/NewTestRevised/brat
${C_SPELL}/PostProcess/bin/PostBratNewTest
2
1
2
Tag | Check Items |
---|---|
ToSplit |
|
ToSplitOnPunct |
|
ToMerge |
|
Misspelling |
|
Informal |
|
RealWord |
|
OutOfVocabulary |
|
WordExists |
|
Punctuation |
|
Garbage |
|
Unknown |
|
From our experience, there are two types of errors that commonly seen in spelling annotation.
3
Check Brat Tags spans - the purpose of this check is to ensure generate gold standard correctly for the cases of contain, multi-tag and overlap for both non-word and real-word