Analysis of free text in electronic health records for identification of cancer patient trajectories

Kasper Jensen, Cristina Soguero-Ruiz, Karl Oyvind Mikalsen, Rolv-Ole Lindsetmo, Eirini Kouskoumvekaki, Mark A. Girolami, Stein Olav Skrovseth, Knut Magne Augestad

    Research output: Contribution to journalJournal articleResearchpeer-review

    448 Downloads (Pure)


    With an aging patient population and increasing complexity in patient disease trajectories, physicians are often met with complex patient histories from which clinical decisions must be made. Due to the increasing rate of adverse events and hospitals facing financial penalties for readmission, there has never been a greater need to enforce evidence-led medical decision-making using available health care data. In the present work, we studied a cohort of 7,741 patients, of whom 4,080 were diagnosed with cancer, surgically treated at a University Hospital in the years 2004-2012. We have developed a methodology that allows disease trajectories of the cancer patients to be estimated from free text in electronic health records (EHRs). By using these disease trajectories, we predict 80% of patient events ahead in time. By control of confounders from 8326 quantified events, we identified 557 events that constitute high subsequent risks (risk > 20%), including six events for cancer and seven events for metastasis. We believe that the presented methodology and findings could be used to improve clinical decision support and personalize trajectories, thereby decreasing adverse events and optimizing cancer treatment.
    Original languageEnglish
    Article number46226
    JournalScientific Reports
    Number of pages1
    Publication statusPublished - 2017


    Dive into the research topics of 'Analysis of free text in electronic health records for identification of cancer patient trajectories'. Together they form a unique fingerprint.

    Cite this