You are here

Consumer Health Question Answering

""
Project information
Researchers: 

The consumer health question answering project was launched to support NLM customer services that receive about 90,000 requests a year from a world-wide pool of customers. The requests are categorized by the customer support services staff and are either answered using about 300 stock answers (with or without modifications) or researched and answered by the staff manually. Responding to a customer with a stock reply takes approximately 4 minutes; answering with a personalized stock reply takes about 10 minutes. To reduce the time and cost of customer services, NLM launched the Consumer Health Information and Question Answering (CHIQA) project.  The CHIQA project conducts research in both the automatic classification of customers’ requests and the automatic answering of consumer health questions.

The analysis of the requests identified subsets of reference questions that could be answered automatically.  LHC researchers have developed a customer service support system that categorizes the incoming requests and prepares answers for review by staff responding to customer requests.  The system combines sophisticated statistical methods with knowledge-based natural language processing techniques.  The pilot system was integrated in customer services workflow in May 2014. As the system matures, it could immediately provide answers to customers while they are visiting NLM Web pages.

Datasets

Question Decomposition Data

Question Type Data

CHQA Named Entity Dataset

Consumer Health Spelling Error Dataset

Publications/Tools: 
Mrabet Y, Kilicoglu H, Roberts K, Demner-Fushman D. Combining Open-domain and Biomedical Knowledge for Topic Recognition in Consumer Health Questions. AMIA Annu Symp Proc. 2017 Feb 10;2016:914-923. eCollection 2016.
Ben Abacha A, Demner-Fushman D. NLM_NIH at SemEval-2017 Task 3: from Question Entailment to Question Similarity for Community Question Answering. SemEval@ACL 2017: 349-352
Mrabet Y, Kilicoglu H, Roberts K, Demner-Fushman D. Combining Open-domain and Biomedical Knowledge for Topic Recognition in Consumer Health Questions. AMIA 2016 Annual Symposium, Chicago, IL, November 12-16, 2016.
Demner-Fushman D, Elhadad N. Aspiring to Unintended Consequences of Natural Language Processing: A Review of Recent Developments in Clinical and Consumer-Generated Text Processing. IMIA Yearbook of Medical Informatics 2016.
Roberts K, Demner-Fushman D. Interactive use of online health resources: a comparison of consumer and professional questions. J Am Med Inform Assoc. 2016 Jul;23(4):802-11. doi: 10.1093/jamia/ocw024. Epub 2016 May 4.
Kilicoglu H, Ben Abacha A, Mrabet Y, Roberts K, Rodriguez L, Shooshan SE, Demner-Fushman D. Annotating named entities in consumer health questions. LREC,23-28 May 2016, Portorož.
Demner-Fushman D, Kilicoglu H. Dataset: CHQA Named Entity Dataset
Demner-Fushman D, Roberts K. Dataset: Question Decomposition Data
Demner-Fushman D, Roberts K. Dataset: Question Type Data
Roberts K, Shooshan SE, Rodriguez L, Abhyankar S, Kilicoglu H, Demner-Fushman D. The role of fine-grained annotations in supervised recognition of risk factors for heart disease from EHRs. J Biomed Inform. 2015 Dec;58 Suppl:S111-9. doi: 10.1016/j.jbi.2015.06.010. Epub 2015 Jun 26.

Pages