Skip to Content
SBMI水平徽标

通过EHR和信息学推进癌症药物epidemiology研究beplay苹果手机能用吗

介绍

The data-intensive nature of EHRs creates challenges for adaptation to clinical research including cancer pharmacoepidemiology studies. Much of the detailed information such as drug exposure, cancer characteristics (e.g., diagnoses and treatment outcomes), and confounding factors isembedded in narrative documents and thus not easily accessible for analysis. In addition, differing EHR platforms and standards of clinical data make cross-site algorithm implementation and data aggregation more challenging. Finally, the uneven nature of clinical documentation and quality of EHR data bring additional problems for data analysis, such as selection bias and missing data issues.

在该项目(通过EHR和Informatics进行癌症的药物学研究,授予NO:U24CA194215),我们建议beplay苹果手机能用吗整合和扩展已建立的工具,以建立EHR数据提取,协调,管理和分析的信息学基础设施,以推进癌症药物药物药物学研究。这些项目团队开发了这些用于次要使用EHR数据的信息学方法和软件。它包括自然语言处理(NLP)系统,例如MEDEX,KMCI和CTAKES。EHR数据归一化工具,例如SharpN数据归一化管道和临床数据管理软件,例如REDCAP。

数据集

Datasets being generated in the project will be listed and made available for download once pertinent papers have been published.

软件

To facilitate the adoption of NLP in cancer research, we have developed a number of modules for extracting cancer-related information from pathology reports, such as tumor size, margin, biomarkers etc., within the existing CLAMP toolkit. The system provides high-performance modules with a user-friendly interface through which users can build customized NLP solutions for their specific needs. This system is available for free download to academic researchers at http://clamp.uth.edu/cancer.php#download. In addition, we are now developing a new web-based system that provides user-friendly interfaces for building customized NLP pipelines for cancer information extraction, with the goal to facilitate cancer researchers to adopt NLP technologies for their research.

我们已经开发并发布了大型化学疗法药物和方案的本体,主要基于网站hemonc.org(MPI Warner is Deputy Editor). Beginning in mid-2017, we began the process of deriving a JSON formatted OWL ontology fromhemonc.org内容。具体而言,我们试图建立一个独立的信息模型,以建立抗肿瘤和支持药物,方案及其使用的环境之间的关系。截至2018年5月26日,派生的本体论包含172,490个公理,1400级和25,299个实体。例如,方案实体“ R-Chop”具有4个别名,16个类(例如,未经处理的弥漫性大型B细胞淋巴瘤的诱导疗法),20个成分实体(包括支持药物,中枢神经系统预防症,以及与先前或后续的链接某些R-CHOP协议变体声明的处理)和42个参考实体。

演讲

2018年,北美中央癌症登记协会(NAACCR)年会,华纳J,血液学/肿瘤治疗方案的综合本体。

2018, National Cancer Policy Forum of the National Academies of Medicine workshop on Improving Cancer Diagnosis and Care, Warner J, Genomic standards and knowledge bases for decision support.

2018, ITCR Annual Meeting, Warmer J, Advancing cancer pharmacoepidemiology research through EHRs and informatics.

2018, ITCR Annual Meeting, Wang L, Information Extraction for Populating Lung Cancer Clinical Research Data.

2018年,CI4CC,春季研讨会和研讨会,精密肿瘤学知识网络,Xu H,建造定制的NLP管道,用于使用夹具进行癌症研究beplay苹果手机能用吗

2017, CDC/NCI/FDA/VA Clinical Natural Language Processing Workshop, Xu H, Supporting cancer registries through automated extraction of pathology and chemotherapy regimen information.

2017, AMIA Joint Summits on Translational Science: Xu H, Panel

2017, CI4CC: Warner J, Natural Language Processing, and Visual Analytics to Improve the Efficiency of Registry Operations.

2017年,ITCR PI会议:Xu Hua

2017, AMIA NLP Working Group invited seminar: Xu H, NLP tools for pathology reports processing: CLAMP and MetaMap

CONFERENCE PAPERS

AMIA Annual Symposium 2018. San Francisco, CA: Information Extraction for Populating Lung Cancer Clinical Research Data.

AMIA年度研讨会2018年。加利福尼亚州旧金山:将事实医学知识和分布式单词表示结合起来,以改善临床名称的实体识别。

ICIBM, 2018, Los Angeles, CA: Integrating Shortest Dependency Path and Sentence Sequence into a Deep Learning Framework for Relation Extraction in Clinical Text.

ICIBM,2018年,加利福尼亚州洛杉矶:解析临床文本:多么好 are最先进的基于深度学习的解析器。

2018, ASCO Annual Meeting, Chicago, IL: Genetic differences between primary and metastatic tumors from cross-institutional data.

2017年,加利福尼亚州旧金山AMIA 2017联合峰会:识别转移的肺癌患者病理报告中相关信息

2017年,华盛顿特区AMIA年度研讨会:利用现有的Corpora使用领域适应(已提交)去识别精神病票据(已提交)

出版物列表

Lee H, Zhang Y, Jiang M, Xu J, Tao C, et al. Identifying direct temporal relations between time and events from clinical notes. BMC medical informatics and decision making (Accepted)

Gregg JR, Lang M, Wang LL, Resnick MJ, Jain SK, Warner JL*, Barocas DA*. Automating the determination of prostate cancer risk strata from electronic medical records. JCO Clinical Cancer Informatics. 2017 Jun 8;1:1-8. http://ascopubs.org/doi/full/10.1200/CCI.16.00045. PMCID: PMC5847303.

Malty AM, Jain SK, Yang PC, Harvey K, Warner JL. 计算机化创建血液学/肿瘤学方案的系统本体论的方法。JCO临床癌信息学。2018年5月11日。http://ascopubs.org/doi/full/10.1200/cci.17.00142[PMCID待定 - NIHMS ID 974428]

Soysal E, Wang J, Jiang M, Wu Y, Pakhomov S, Liu H, and Xu H. CLAMP – a toolkit for efficiently building customized clinical natural language processing pipelines. J Am Med Inform Assoc, 2018, 25(3), 331–336.

Huang J, Duan R, Hubbard RA, Wu Y, Moore JH, Xu H, and Chen Y. PIE: A prior knowledge guided integrated likelihood estimation method for bias reduction in association studies using electronic health records data. J Am Med Inform Assoc, 2018, 25(3), 345–350.

Lee HJ, Wu Y, Zhang Y, Xu J, Xu H, Roberts K. A hybrid approach to automatic de-identification of psychiatric notes. J Biomed Inform. 2017 Nov;75S :S19-S27. doi: 10.1016 / j.jbi.2017.06.006。Epub 2017年6月7日。PubMed PMID: 28602904; PubMed Central PMCID: PMC5705430.

Soysal E,Warner JL,Wang J,Jiang,M,Harvey K,Jain SK,Dong X,Song Hy,Siddhanamatha H.,Wang L,Dai Q,Chen Q,Chen Q,Du X,Tao C,Yang P,Denny JC,Denny JC,Liu H,Xu H. Clamp -Cancer-用于开发用于病理报告的定制癌症信息提取系统的软件。癌症研究。beplay苹果手机能用吗2017年(提交)

Baidu