CSpell

Errors on Tags - Source Error

I. Introduction

This is the detail log for revising the baseline Brat tags. This page describes terms (tokens) are tagged in Brat, but the original source terms are correct spelling.

II. Revision Program

  • Program: ${POST_PROCESS}/bin/PostBrat
  • Input Data: ${POST_PROCESS_DATA}/Brat/Baseline/brat
  • Output Data: ${POST_PROCESS_DATA}/Brat/BaselineRevised/bratRevised

  • In Brat annotation - correct spelling for original terms
    • shell> PostBrat
      1
      21
    • manually go through each finding (errOnNonWordTags.data.src.tag)

      TagInstance NoNotes
      Updated in Brat
      TAG_ER_ER1The original tag is error. The original term is correct (not need to be corrected).
      TAG_ER_SV6The original tag is error. The original term is correct (spelling varaints).
      Add RealWord Tag in Brat
      TAG_OK_AA24The original tag is OK (the term is a correct spelling - abbreviations or acronyms)
      TAG_OK_RW18The original tag is OK (the term is a correct spelling - realWord)
      Do nothing
      TAG_OK_OK3The original tag is OK (correction is split)
      TAG_OK_ST24The original tag is OK (correction is split)
      TAG_OK_MG2The original tag is OK (correction is merge)
      TAG_TBD0To be done
    • Document in here (html)
    • Manually updates to revised Brat (BaselineRevised)

III. Revision Log

  • Error (7) - (errOnNonWordTags.data.src.ER)
    File IDErrorTypePositionOriginal TermCorrected TermNotes
    10714Punctuation117|121dr.sdrs1, Delete-1
    11Punctuation69|76indepthin-depth2, Delete-2
    12751ToSplit153|161bonafidebona fide3, Delete-3
    13014Misspelling227|241physio-therapyphysiotherapy4, Delete-4
    13787Misspelling619|626on-lineonline5, Delete-5
    14Misspelling440|448developedevelop6, Delete-6
    2Misspelling107|116year-longyearlong7, Delete-7

  • Real-Words (18) - (errOnNonWordTags.data.src.OK_RW):
    File IDErrorTypePositionOriginal TermCorrected TermNotes
    11186Misspelling294|299repotreport8, Change-1
    11456Misspelling248|249mam9, Change-2
    12085Misspelling255|259diesdoes10, Change-3
    12374Misspelling304|307vs,vs.11, Change-4
    15410Misspelling0|9williams'williams12, Change-5
    15410Misspelling65|74william'swilliams13, Change-6
    16247Misspelling269|275lanaislantus14, Change-7
    17281Misspelling94|99donerdonor15, Change-8
    17991ToMerge173|1793/4 rd3/4rd16, Change-9
    19174Misspelling15|18mammadam17, Change-10
    19268Misspelling72|81vertebrasvertebrae18, Change-11
    19883Misspelling572|575amdand19, Change-12
    34Misspelling197|205andersonandersen20, Change-13
    48Misspelling405|415stereotypystereotypic21, Change-14
    5Misspelling114|120origenorigin22, Change-15
    6Misspelling82|90habermanhabermann23, Cahnge-16
    7Misspelling7|13robinsrobin24, Cahnge-17
    83Misspelling85|93noonan'snoonan25, Change-18

  • Abbrviation/Acronyms (24) - (errOnNonWordTags.data.src.OK_AA):
    File IDErrorTypePositionOriginal TermCorrected TermNotes
    11804Misspelling27|29tito26, Change-19
    12374Misspelling791|794ajdand27, Change-20
    12624Misspelling231|233ntnot28, Change-21
    12784Misspelling5|7cncan29, Change-22
    12788Misspelling167|168dthe30, Change-23
    12800Misspelling1875|1877BBBP31, Change-24
    13140Misspelling2|4avhave32, Change-25
    13140Misspelling27|29avhave33, Change-26
    13140Misspelling30|31dthe34, Change-27
    13165ToSplit59|62iami am35, Change-28
    13347Misspelling159|161ndand36, Change-29
    13500Misspelling53|57aisaasia37, Change-30
    13783Misspelling71|74dntdon't38, Change-31
    14263Misspelling156|159itoto39, Change-32
    14276Misspelling53|58pregopregnancy40, Change-33
    16282Misspelling548|552rplyreply41, Change-34
    17170Misspelling19|22gudgood42, Change-35
    17672Misspelling265|267mtmy43, Change-36
    17756Misspelling2|5havhave44, Change-37
    17942Misspelling717|720abdabdominal45, Change-38
    17948Punctuation342|344ie(i.e.46, Delete-8
    18668Misspelling20|23havhave47, Change-39
    18669ToSplit11|14iami am48, Change-40
    94Punctuation397|399ege.g.49, Delete-9