In Proceedings of the AMIA 2019 Informatics Summit, San Francisco, CA, USA, 2019.
Despite the recent developments in commercial Question Answering (QA) systems, medical QA remains a challenging task. In this paper, we study the factors behind the complexity of consumer health questions and potential improvement tracks. In particular, we study the impact of information source quality and question conciseness through three experiments. First, an evaluation of a QA method based on a Question-Answer collection created from trusted NIH
resources, which outperformed the best results of the medical LiveQA challenge with an average score of 0.711. Then, an evaluation of the same approach using paraphrases and summaries of the test questions, which achieved an average score of 1.125. Our results provide an empirical evidence supporting the key role of summarization and reliable information sources in building efficient CHQA systems. The latter finding on restricting information sources is particularly intriguing as it contradicts the popular tendency of relying on big data for medical QA.
On the role of question summarization and information source restriction in consumer health question answering. In Proceedings of the AMIA 2019 Informatics Summit, San Francisco, CA, USA, 2019.