You are here

De-identification of Address, Date, and Alphanumeric Identifiers in Narrative Clinical Reports.

Printer-friendly versionPrinter-friendly version
Kayaalp M, Browne AC, Dodd Z, Sagan P, McDonald CJ
AMIA Annu Symp Proc. 2014 Nov 14;2014:767-76. eCollection 2014.
Abstract: 

INTRODUCTION

The Privacy Rule of Health Insurance Portability and Accountability Act requires that clinical documents be stripped of personally identifying information before they can be released to researchers and others. We have been developing a software application, NLM Scrubber, to de-identify narrative clinical reports.

METHODS

We compared NLM Scrubber with MIT's and MITRE's de-identification systems on 3,093 clinical reports about 1,636 patients. The performance of each system was analyzed on address, date, and alphanumeric identifier recognition separately. Their overall performance on de-identification and on conservation of the remaining clinical text was analyzed as well.

RESULTS

NLM Scrubber's sensitivity on de-identifying these identifiers was 99%. It's specificity on conserving the text with no personal identifiers was 99% as well.

CONCLUSION

The current version of the system recognizes and redacts patient names, alphanumeric identifiers, addresses and dates. We plan to make the system available prior to the AMIA Annual Symposium in 2014.

Kayaalp M, Browne AC, Dodd Z, Sagan P, McDonald CJ. De-identification of Address, Date, and Alphanumeric Identifiers in Narrative Clinical Reports. AMIA Annu Symp Proc. 2014 Nov 14;2014:767-76. eCollection 2014.