copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The aluminum standard: using generative Artificial Intelligence tools to synthesize and annotate non-structured patient data

J. Diaz Ochoa, F. Mustafa, F. Weil, Y. Wang, K. Kama, and M. Knott. BMC Medical Informatics and Decision Making, 24 (1): 409-- (2024)
DOI: 10.1186/s12911-024-02825-4

Abstract

Medical narratives are fundamental to the correct identification of a patient’s health condition. This is not only because it describes the patient’s situation. It also contains relevant information about the patient’s context and health state evolution. Narratives are usually vague and cannot be categorized easily. On the other hand, once the patient’s situation is correctly identified based on a narrative, it is then possible to map the patient’s situation into precise classification schemas and ontologies that are machine-readable. To this end, language models can be trained to read and extract elements from these narratives. However, the main problem is the lack of data for model identification and model training in languages other than English. First, gold standard annotations are usually not available due to the high level of data protection for patient data. Second, gold standard annotations (if available) are difficult to access. Alternative available data, like MIMIC (Sci Data 3:1, 2016) is written in English and for specific patient conditions like intensive care. Thus, when model training is required for other types of patients, like oncology (and not intensive care), this could lead to bias. To facilitate clinical narrative model training, a method for creating high-quality synthetic narratives is needed.

@joy's tags highlighted

Cite this publication

@article{diazochoa2024aluminum, abstract = {Medical narratives are fundamental to the correct identification of a patient’s health condition. This is not only because it describes the patient’s situation. It also contains relevant information about the patient’s context and health state evolution. Narratives are usually vague and cannot be categorized easily. On the other hand, once the patient’s situation is correctly identified based on a narrative, it is then possible to map the patient’s situation into precise classification schemas and ontologies that are machine-readable. To this end, language models can be trained to read and extract elements from these narratives. However, the main problem is the lack of data for model identification and model training in languages other than English. First, gold standard annotations are usually not available due to the high level of data protection for patient data. Second, gold standard annotations (if available) are difficult to access. Alternative available data, like MIMIC (Sci Data 3:1, 2016) is written in English and for specific patient conditions like intensive care. Thus, when model training is required for other types of patients, like oncology (and not intensive care), this could lead to bias. To facilitate clinical narrative model training, a method for creating high-quality synthetic narratives is needed.}, added-at = {2025-02-19T17:00:10.000+0100}, author = {Diaz Ochoa, Juan G. and Mustafa, Faizan E. and Weil, Felix and Wang, Yi and Kama, Kudret and Knott, Markus}, biburl = {https://puma.ub.uni-stuttgart.de/bibtex/27b4851821087f0aa95f4ba6297a9d4e6/joy}, doi = {10.1186/s12911-024-02825-4}, interhash = {327deee07a985244f4944d3f7d8491f5}, intrahash = {7b4851821087f0aa95f4ba6297a9d4e6}, issn = {14726947}, journal = {BMC Medical Informatics and Decision Making}, keywords = {ki}, number = 1, pages = {409--}, refid = {Diaz Ochoa2024}, timestamp = {2025-02-19T17:00:10.000+0100}, title = {The aluminum standard: using generative Artificial Intelligence tools to synthesize and annotate non-structured patient data}, url = {https://doi.org/10.1186/s12911-024-02825-4}, volume = 24, year = 2024 }

PUMA

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The aluminum standard: using generative Artificial Intelligence tools to synthesize and annotate non-structured patient data

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

PUMA

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML The aluminum standard: using generative Artificial Intelligence tools to synthesize and annotate non-structured patient data

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

The aluminum standard: using generative Artificial Intelligence tools to synthesize and annotate non-structured patient data

Comments and Reviews
(0)