• 1. Institute of Medical Information, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100020, P. R. China;
  • 2. Qingdao Innovation Development Base, Harbin Engineering University, Qingdao 266000, P. R. China;
SUN Haixia, Email: sun.haixia@imicams.ac.cn
Export PDF Favorites Scan Get Citation

Objective The current medical questionnaire resources are mainly processed and organized at the document level, which hampers user access and reuse at the questionnaire item level. This study aims to propose a multi-class classification of items in medical questionnaires in low-resource scenarios, and to support fine-grained organization and provision of medical questionnaires resources. Methods We introduced a novel, BERT-based, prompt learning approach for multi-class classification of items in medical questionnaires. First, we curated a small corpus of lung cancer medical assessment items by collecting relevant clinical assessment questionnaires, extracting function and domain classifications, and manually annotating the items with "function-domain" combination labels. We then employed prompt learning by feeding the customized template into BERT. The masked positions were predicted and filled, followed by mapping the populated text to labels. This process enables the multi-class classification of item texts in medical questionnaires. Results The constructed corpus comprised 347 clinical assessment items for lung cancer, across nine "function-domain" labels. The experimental results indicated that the proposed method achieved an average accuracy of 93% on our self-constructed dataset, outperforming the runner-up GAN-BERT by approximately 6%. Conclusion The proposed method can maintain robust performance while minimizing the cost of building medical questionnaire item corpora, illustrating its promotion value of research and practice in medical questionnaire classification.

Citation: HAO Jie, PENG Qinglong, CONG Shan, LI Jiao, SUN Haixia. A study on multi-class classification of medical questionnaire item texts based on prompt learning. Chinese Journal of Evidence-Based Medicine, 2024, 24(1): 76-82. doi: 10.7507/1672-2531.202307139 Copy

  • Previous Article

    Research on quantitative evaluation of physical-medical integration policy based on PMC index model
  • Next Article

    The application of Bayesian quantile regression in analysis of clinical medicine data and the R Studio practice