Advanced natural language processing and temporal mining for clinical discovery

dc.contributor.advisorJones, Josette F.
dc.contributor.authorMehrabi, Saeed
dc.contributor.otherPalakal, Mathew J.
dc.contributor.otherChien, Stanley Yung-Ping
dc.contributor.otherLiu, Xiaowen
dc.contributor.otherSchmidt, C. Max
dc.date.accessioned2016-03-17T17:02:55Z
dc.date.available2016-03-17T17:02:55Z
dc.date.issued2015-08-17
dc.degree.date2016
dc.degree.disciplineSchool of Informatics & Computing
dc.degree.grantorIndiana University
dc.degree.levelPh.D.
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en_US
dc.description.abstractThere has been vast and growing amount of healthcare data especially with the rapid adoption of electronic health records (EHRs) as a result of the HITECH act of 2009. It is estimated that around 80% of the clinical information resides in the unstructured narrative of an EHR. Recently, natural language processing (NLP) techniques have offered opportunities to extract information from unstructured clinical texts needed for various clinical applications. A popular method for enabling secondary uses of EHRs is information or concept extraction, a subtask of NLP that seeks to locate and classify elements within text based on the context. Extraction of clinical concepts without considering the context has many complications, including inaccurate diagnosis of patients and contamination of study cohorts. Identifying the negation status and whether a clinical concept belongs to patients or his family members are two of the challenges faced in context detection. A negation algorithm called Dependency Parser Negation (DEEPEN) has been developed in this research study by taking into account the dependency relationship between negation words and concepts within a sentence using the Stanford Dependency Parser. The study results demonstrate that DEEPEN, can reduce the number of incorrect negation assignment for patients with positive findings, and therefore improve the identification of patients with the target clinical findings in EHRs. Additionally, an NLP system consisting of section segmentation and relation discovery was developed to identify patients' family history. To assess the generalizability of the negation and family history algorithm, data from a different clinical institution was used in both algorithm evaluations.en_US
dc.identifier.doi10.7912/C2DW2W
dc.identifier.urihttps://hdl.handle.net/1805/8895
dc.identifier.urihttp://dx.doi.org/10.7912/C2/953
dc.language.isoen_USen_US
dc.subjectDeep learningen_US
dc.subjectFamily historyen_US
dc.subjectNatural language processingen_US
dc.subjectNegationen_US
dc.subjectPancreatic canceren_US
dc.subjectTemporal pattern discoveryen_US
dc.subject.lcshMedical records -- Data processing
dc.subject.lcshForms management
dc.subject.lcshElectronic records -- Access control
dc.subject.lcshInformation storage and retrieval systems
dc.subject.lcshNatural language processing (Computer science)
dc.subject.lcshComputational linguistics
dc.subject.lcshData mining
dc.titleAdvanced natural language processing and temporal mining for clinical discoveryen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mehrabi_iupui_0104D_10068.pdf
Size:
7.74 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: