Natural Language-Assisted Multi-modal Medication Recommendation

Jie Tan1, Yu Rong*2, Kangfei Zhao*3, Tian Bian1,
Tingyang Xu2, Junzhou Huang4, Hong Cheng1, Helen Meng1
CIKM'24, October 21–25, 2024, Boise, ID, USA
*Co-corresponding authors

1The Chinese University of Hong Kong

2DAMO Academy, Alibaba Group

3Beijing Institute of Technology

4University of Texas at Arlington

Abstract

Combinatorial medication recommendation (CMR) is a fundamental task of healthcare, which offers opportunities for clinical physicians to provide more precise prescriptions for patients with intricate health conditions, particularly in the scenarios of long-term medical care. Previous research efforts have sought to extract meaningful information from electronic health records (EHRs) to facilitate combinatorial medication recommendations. Existing learning-based approaches further consider the chemical structures of medications, but ignore the textual medication descriptions in which the functionalities are clearly described. Furthermore, the textual knowledge derived from the EHRs of patients remains largely underutilized. To address these issues, we introduce the Natural Language-Assisted Multi-modal Medication Recommendation (NLAMMR), a multi-modal alignment framework designed to learn knowledge from the patient view and medication view jointly. Specifically, NLAMMR formulates CMR as an alignment problem from patient and medication modalities. In this vein, we employ pretrained language models (PLMs) to extract in-domain knowledge regarding patients and medications, serving as the foundational representation for both modalities. In the medication modality, we exploit both chemical structures and textual descriptions to create medication representations. In the patient modality, we generate the patient representations based on textual descriptions of diagnosis, procedure, and symptom. Extensive experiments conducted on three publicly accessible datasets demonstrate that NLAMMR achieves new state-of-the-art performance, with a notable average improvement of 4.72% in Jaccard score. Our source code is publicly available on https://github.com/jtan1102/NLA-MMR_CIKM_2024.

Problem Statement

Framework Image

Illustration of combinatorial medication recommendation with help of knowledge extraction from PLMs.

Model Overview

Framework Image

The architecture of NLA-MMR. NLA-MMR is composed of three modules: (a) Patient Representation Module employs PLMs as the base encoder to construct the representation of patient modality from the diagnosis, procedure, and symptom. (b) Medication Representation Module incorporates the embedding derived from textual medication descriptions and chemical structures to build the representation of medication modality. (c) Cross-Modal Alignment Module takes the representation from patient and medication modality as input and aligns them in the same latent space. We further consider historical medication usage information to model the patient’s clinical history, which can enhance the representation of patient modality.

BibTeX

@inproceedings{DBLP:conf/cikm/TanRZBXH0M24,
        author       = {Jie Tan and
                        Yu Rong and
                        Kangfei Zhao and
                        Tian Bian and
                        Tingyang Xu and
                        Junzhou Huang and
                        Hong Cheng and
                        Helen Meng},
        editor       = {Edoardo Serra and
                        Francesca Spezzano},
        title        = {Natural Language-Assisted Multi-modal Medication Recommendation},
        booktitle    = {Proceedings of the 33rd {ACM} International Conference on Information
                        and Knowledge Management, {CIKM} 2024, Boise, ID, USA, October 21-25,
                        2024},
        pages        = {2200--2209},
        publisher    = {{ACM}},
        year         = {2024},
        url          = {https://doi.org/10.1145/3627673.3679529},
        doi          = {10.1145/3627673.3679529},
        timestamp    = {Sat, 30 Nov 2024 21:10:26 +0100},
        biburl       = {https://dblp.org/rec/conf/cikm/TanRZBXH0M24.bib},
        bibsource    = {dblp computer science bibliography, https://dblp.org}
      }