Advanced natural language processing and temporal mining for clinical discovery

Mehrabi, Saeed

Advanced natural language processing and temporal mining for clinical discovery

dc.contributor.advisor	Jones, Josette F.
dc.contributor.author	Mehrabi, Saeed
dc.contributor.other	Palakal, Mathew J.
dc.contributor.other	Chien, Stanley Yung-Ping
dc.contributor.other	Liu, Xiaowen
dc.contributor.other	Schmidt, C. Max
dc.date.accessioned	2016-03-17T17:02:55Z
dc.date.available	2016-03-17T17:02:55Z
dc.date.issued	2015-08-17
dc.degree.date	2016
dc.degree.discipline	School of Informatics & Computing
dc.degree.grantor	Indiana University
dc.degree.level	Ph.D.
dc.description	Indiana University-Purdue University Indianapolis (IUPUI)	en_US
dc.description.abstract	There has been vast and growing amount of healthcare data especially with the rapid adoption of electronic health records (EHRs) as a result of the HITECH act of 2009. It is estimated that around 80% of the clinical information resides in the unstructured narrative of an EHR. Recently, natural language processing (NLP) techniques have offered opportunities to extract information from unstructured clinical texts needed for various clinical applications. A popular method for enabling secondary uses of EHRs is information or concept extraction, a subtask of NLP that seeks to locate and classify elements within text based on the context. Extraction of clinical concepts without considering the context has many complications, including inaccurate diagnosis of patients and contamination of study cohorts. Identifying the negation status and whether a clinical concept belongs to patients or his family members are two of the challenges faced in context detection. A negation algorithm called Dependency Parser Negation (DEEPEN) has been developed in this research study by taking into account the dependency relationship between negation words and concepts within a sentence using the Stanford Dependency Parser. The study results demonstrate that DEEPEN, can reduce the number of incorrect negation assignment for patients with positive findings, and therefore improve the identification of patients with the target clinical findings in EHRs. Additionally, an NLP system consisting of section segmentation and relation discovery was developed to identify patients' family history. To assess the generalizability of the negation and family history algorithm, data from a different clinical institution was used in both algorithm evaluations.	en_US
dc.identifier.doi	10.7912/C2DW2W
dc.identifier.uri	https://hdl.handle.net/1805/8895
dc.identifier.uri	http://dx.doi.org/10.7912/C2/953
dc.language.iso	en_US	en_US
dc.subject	Deep learning	en_US
dc.subject	Family history	en_US
dc.subject	Natural language processing	en_US
dc.subject	Negation	en_US
dc.subject	Pancreatic cancer	en_US
dc.subject	Temporal pattern discovery	en_US
dc.subject.lcsh	Medical records -- Data processing
dc.subject.lcsh	Forms management
dc.subject.lcsh	Electronic records -- Access control
dc.subject.lcsh	Information storage and retrieval systems
dc.subject.lcsh	Natural language processing (Computer science)
dc.subject.lcsh	Computational linguistics
dc.subject.lcsh	Data mining
dc.title	Advanced natural language processing and temporal mining for clinical discovery	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Mehrabi_iupui_0104D_10068.pdf
Size:: 7.74 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.88 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Informatics School Theses and Dissertations
Informatics Graduate Theses and PhD Dissertations