Generating Gold Standard
${C_SPELL}/PostProcess
${C_SPELL}/PostProcess/bin
${C_SPELL}/PostProcess/data/Brat/NewTestRevised/
${C_SPELL}/PostProcess/data/Brat/NewTestRevised/bratRev
${C_SPELL}/PostProcess/bin/PostBratNewTest
2
20
(real-word)
21
(non-word)
ArrayList<BratTagObj> bratTagList
Core Algorithm
bto.ToUidStr()
as key for finding container and containee
fileId|tagId|type|startPos|endPos|srcTxt|orgTarTxt|
bto.ToReportStr()
as key for keyBto
fileId|tagId|type|startPos|endPos|srcTxt|orgTarTxt|newTarTxt|
Tag Type | Correction Text | Number |
---|---|---|
ToSplit | Annotation Notes | 164 |
ToSplitOnP | Auto-generated | 320 |
ToMerge | Auto-generated | 27 |
Misspell | Annotation Notes | 438 |
Informal | Annotation Notes (only correctable) | 413 (ABB: 102; ACR: 107; ':106; Correctable: 98) |
RealWord | Annotation Notes | 223 (= 1195 + 14 - 986) |
Punctuation | N/A | 246 |
WordExists | N/A | 79 |
Unknown | N/A | 21 |
Garbage | N/A | 15 |
Oov | N/A | 0 |