Med­i­cal Con­cept Em­bed­dings via La­beled Back­ground Cor­po­ra

This entry contains the resources used in and resulting from Eneldo Loza Mencía, Gerard de Melo and Jinseok Nam, Medical Concept Embeddings via Labeled Background Corpora, in: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), 2016 In recent years, we have seen an increasing amount of interest in low-dimensional vector representations of words. Among other things, these facilitate computing word similarity and relatedness scores. The most well-known example of algorithms to produce representations of this sort are the word2vec approaches. In this paper, we investigate a new model to induce such vector spaces for medical concepts, based on a joint objective that exploits not only word co-occurrences but also manually labeled documents, as available from sources such as PubMed. Our extensive experimental analysis shows that our embeddings lead to significantly higher correlations with human similarity and relatedness assessments than previous work. Due to the simplicity and versatility of vector representations, these findings suggest that our resource can easily be used as a drop-in replacement to improve any systems relying on medical concept similarity measures.

Keywords

Embeddings, Medical Concepts, Semantic Similarity, MeSH

Identifier

https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/2936

Related Resources

Is Supplement To

ISBN/978-2-9517408-9-1

Is Version Of

https://www.ke.tu-darmstadt.de/resources/medsim

DFG Classification

1.14-03 Angewandte Sprachwissenschaften, Computerlinguistik
4.43-04 Künstliche Intelligenz und Maschinelle Lernverfahren
4.43-05 Bild- und Sprachverarbeitung, Computergraphik und Visualisierung, Human Computer Interaction, Ubiquitous und Wearable Computing

License

Except where otherwise noted, this license is described as In Copyright

Full item page

Medical Concept Embeddings via Labeled Background Corpora

Files

Date

Type

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Identifier

Endorsement

Related Resources

Is Supplement To

Is Version Of

DFG Classification

Project(s)

Faculty

Collections

License

Med­i­cal Con­cept Em­bed­dings via La­beled Back­ground Cor­po­ra

Files

Date

Type

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Identifier

Endorsement

Related Resources

Is Supplement To

Is Version Of

DFG Classification

Project(s)

Faculty

Collections

License

Medical Concept Embeddings via Labeled Background Corpora