Staff Scientist
Phone Number: (
Expertise and Research Interests: 

Dr. Misra is lead developer on the System for Preservation of Electronic Resources (SPER) project. SPER has archived and made accessible to the general public a collection of FDA released monthly publications under NLM's possession which contain synopses of 65,000 Notice of Judgments (NJs) against food and drug adulteration and misbranding by various agencies. SPER automatically extracts metadata for these NJs from their scanned pages by plugging-in AME tools developed at CEB. It uses the open-source DSpace archiving system to archive and index the contents of the NJs, which may then be searched and accessed publicly via Internet Web browsers.

Publications/Tools by Dharitri Misra: 
Misra D, Thoma GR. Use of descriptive metadata as a knowledgebase for analyzing data in large textual collections. Proc. IS&T Archiving 2013. Washington D.C. Proc. IS&T Archiving 2013. Washington D.C. pg 193-199.
Misra D, Hall RH, Payne SM, Thoma GR. Digital preservation and knowledge discovery based on documents from an international health science program. Proc. 12th ACM/IEEE-CS JCDL, pg 23-26 (2012). doi: 10.1145/2232817.2232823.
Chen S, Misra D, Thoma GR. Efficient Automatic OCR Word Validation Using Word Partial Format Derivation and Language Model Document Recognition and Retrieval XVII. Proceedings of the SPIE. San Jose, CA. January 2010;7534:75340O-75340O-8
Misra D, Seamans J, Thoma GR. Testing the Scalability of a DSpace-based Archive Proc. IS&T Archiving 2008. Bern, Switzerland. June 2008:36-40
Misra D, Mao S, Rees J, Thoma GR. Archiving a Historic Medico-legal Collection: Automation and Workflow Customization Proc IS&T Archiving 2007. Arlington, Virginia, May 2007; 157-61
Thoma GR, Mao S, Misra D, Rees J. Design of a Digital Library for Early 20th Century Medico-legal Documents Proc ECDL 2006. Eds: Gonzalo J et al. Berlin: Springer-Verlag; LNCS 4172: 147-57
Mao S, Misra D, Seamans J, Thoma GR. Design Strategies for a Prototype Electronic Preservation System for Biomedical Documents IS&T Archiving 2005 Conference, April 2005; 48-53.
Thoma GR, Mao S, Misra D. Automated Metadata Extraction to Preserve the Digital Contents of Biomedical Collections Proc VIIP 2005. September 2005. Benidorm, Spain; 214-19