PUBLICATIONS

Abstract

De-identification of Address, Date, and Alphanumeric Identifiers in Narrative Clinical Reports.


Kayaalp M, Browne AC, Dodd Z, Sagan P, McDonald CJ

AMIA Annu Symp Proc. 2014 Nov 14;2014:767-76. eCollection 2014.

Abstract:

INTRODUCTION The Privacy Rule of Health Insurance Portability and Accountability Act requires that clinical documents be stripped of personally identifying information before they can be released to researchers and others. We have been developing a software application, NLM Scrubber, to de-identify narrative clinical reports. METHODS We compared NLM Scrubber with MIT's and MITRE's de-identification systems on 3,093 clinical reports about 1,636 patients. The performance of each system was analyzed on address, date, and alphanumeric identifier recognition separately. Their overall performance on de-identification and on conservation of the remaining clinical text was analyzed as well. RESULTS NLM Scrubber's sensitivity on de-identifying these identifiers was 99%. It's specificity on conserving the text with no personal identifiers was 99% as well. CONCLUSION The current version of the system recognizes and redacts patient names, alphanumeric identifiers, addresses and dates. We plan to make the system available prior to the AMIA Annual Symposium in 2014.


Kayaalp M, Browne AC, Dodd Z, Sagan P, McDonald CJ. De-identification of Address, Date, and Alphanumeric Identifiers in Narrative Clinical Reports. 
AMIA Annu Symp Proc. 2014 Nov 14;2014:767-76. eCollection 2014.

PDF | PMID | PMCID