You are here

Printer-friendly versionPrinter-friendly version
Electronics Engineer
Phone Number: (
Expertise and Research Interests: 

Daniel X. Le received the BS degree summa cum laude in Electrical and Computer Engineering from California State Polytechnic University, Pomona, in June 1986 and the MS and PhD degrees in Computer Science from George Mason University, Fairfax, Virginia, in January 1993 and February 1997, respectively.

From June 1986 to April 1989, he was a software engineer at the Jet Propulsion Laboratory, Pasadena, California. From April 1989 to September 1990, he was a system engineer at Science Applications International Corporation, McLean, Virginia. Since September 1990, he has been an electronics engineer here at the Lister Hill National Center for Biomedical Communications, the research and development arm of the National Library of Medicine. 

Dr. Le's research interests are in document analysis and understanding, neural networks, optical character recognition, image quality and image processing. Dr. Le holds one US patent on automated portrait/landscape orientation detection in binary document images.

Publications/Tools by Daniel Le: 
Kim J, Le DX, Thoma GR. Naive Bayes and SVM Classifiers For Classifying Databank Accession Number Sentences From Online Biomedical Articles IS&T/SPIE's 22nd Annual Symposium on Electronic Imaging. San Jose, CA. January 2010;7534:75340U-1 - 8
Zhang X, Zou J, Le DX, Thoma GR. A Stacked Sequential Learning Method For Investigator Name Recognition From Web-based Medical Articles 17th Document Recognition and Retrieval Conference (SPIE-DR&R). San Jose, CA. January 2010;7534:753404-7
Kim J, Le DX, Thoma GR. Inferring Grant Support Types From Online Biomedical Articles 22nd IEEE ISCBMS. Albuquerque, NM. August 2009
Zhang X, Zou J, Le DX, Thoma GR. A Semi-supervised Learning Method to Classify Grant Support Zone in Web-based Medical Articles Proc SPIE Electronic Imaging Science and Technology, Document Recognition and Retrieval. January 2009;7247:7247 OW(1-8)
Kim J, Le DX, Thoma GR. Naive Bayes Classifier for Extracting Bibliographic Information From Biomedical Online Articles Proc 2008 International Conference on Data Mining. Las Vegas, Nevada, USA. July 2008;II:373-8
Thoma GR, Le DX, Kim I, Kim JW, Moon C, Tran L, Zou J. Automation to Accelerate the Production of MEDLINE April 2008 Technical Report to the LHNCBC Board of Scientific Counselors.
Kim IC, Le DX, Thoma GR. Hybrid approach combining contextual and statistical information for identifying and statistical information for identifying MEDLINE citation terms. Proc. SPIE-IS/T Electronic Imaging. San Jose, CA. January 2008;6815:68150P(1-9)
Zou J, Le DX, Thoma GR. Extracting a Sparsely-Located Named Entity from Online HTML Medical Articles Using Support Vector Machine Proc SPIE-IS/T Electronic Imaging. San Jose, CA. January 2008;6815:6815OP(1-10)
Zou J, Le DX, Thoma GR. Online Medical Journal Article Layout Analysis Proc SPIE-IS&T Electronic Imaging 2007, SPIE Vol. 6500: 65000V (1-12)
Zou J, Le DX, Thoma GR. Structure and Content Analysis for HTML Medical Articles: A Hidden Markov Model Approach Proc August 2007 ACM Symposium on Document Engineering. pp. 199-201