The SPECIALIST Lexicon

Exclusive Filter: A Term contains a measurement

  • Description:
    If a term contains a measurement, it is not a valid. A measurement includes a number and an unit. Three patterns and examples are described in the following table:

  • Examples:
    IDPatternExample
    1float + unit
    • 0.5 mg
    • at -5 degrees
    • 10 cigarettes per day
    2range + unit
    • 2-3 days
    3number + unit
    • four months
    • two patients
    4float + unit/...
    • .1 mg/kg
    • 2 micrograms/ml,
    • 50 mg/kg/day
    5range + unit/...
    • 0.1-2.3 mg/day
    • 1-2 tablets
    6number + unit/...
    • two micrograms/ml
    7month + year
    • from May 2002
    • 2003 to June 2004
    • 1 January 1991
    8age
    • 4-year-old
    • 4 year-old
    • four year-old
    • 4 yr-old
    • 4 year-olds
    • 4 hour-olds

  • Input Term: core-term
  • Filter Algorithm:
    • Logics:

      DescriptionFilterTypeNotes
      Get words from inTermFT_TBD
      if contains regular measurement
      • if a word is a float, range, number
      • if the next word is a [unit] or pattern of [unit/..]
      FT_MEASUREMENTSee example 1-6 in above table
      if date (month + number)
      • if a word is a float, range, number
      • if the previous word is a [month]
      FT_MEASUREMENTSee example-7 in above table
      if age
      • XX-[AGE_UNIT]-old
      • XX [AGE_UNIT]-old
      • XX-[AGE_UNIT]-olds
      • XX [AGE_UNIT]-olds

      [AGE_UNIT]: year, yr, month, week, wk, day, d, hour, hr

      FT_MEASUREMENTSee example-8 in above table

    • source code: FilterMeasurement.java
    • FilterType: FilterType.FT_MEASUREMENT

  • Accuracy Test on Lexicon:
    • InFile:
      • ${OUT_DATA}/03.LeadEndTerm/lexWords.data
      • ${IN_DATA}/unit.data";
      • ${IN_DATA}/NRVAR"
    • Result:

      LexiconFilterSample NoPass NoTrap NoExp NoPass-Rate
      2023FT_MEASUREMENT10018671001743 124 099.9876%
      2022FT_MEASUREMENT998845998721 124 099.9876%
      2021FT_MEASUREMENT992545992421 124 099.9875%
      2020FT_MEASUREMENT983420983297 123 099.9875%
      2019FT_MEASUREMENT972721972604 117 099.9880%
      2018FT_MEASUREMENT955564955456 108 099.9887%
      2017FT_MEASUREMENT935276935193 83 099.9911%
      2016FT_MEASUREMENT915583915510 73 099.9920%
      2015FT_MEASUREMENT896213896175 38 099.9958%
      2014FT_MEASUREMENT875090875061 29 099.9967%