TMTA tool to guide users in finding information on clinical texts

  1. Aparicio Galisteo, Fernando
  2. Buenaga Rodríguez, Manuel de
  3. Rubio, Margarita
  4. Hernando Jerez, María Asunción
  5. Gachet, Diego
  6. Puertas Sanz, Enrique
  7. Giráldez, Ignacio
Journal:
Procesamiento del lenguaje natural

ISSN: 1135-5948

Year of publication: 2011

Issue: 46

Pages: 27-34

Type: Article

More publications in: Procesamiento del lenguaje natural

Abstract

The large amount of medical information available through the Internet, in both structure and text formats, makes that different types of users will encounter different problems when they have to carry out an effective search. On the one hand, medical students, health staff and researchers in the field of biomedicine have a variety of sources and tools of different characteristics which require a learning period sometimes insurmountable. On the other hand, patients, family members and people outside of the medical profession, face the added problem of not being sufficiently familiarized with medical terminology. In this paper we present a tool that can extract relevant medical concepts present in a clinical text, using techniques for named entity recognition, applied on lists of concepts, and annotation techniques from ontologies. To propose these concepts, our tool makes use of a non formal knowledge source, such as Freebase, and formal resources such as MedlinePlus and PubMed. We argue that the combination of these resources, with information less formal and more plain language (like Freebase), with formal information and more plain language (like Medlineplus) or with formal information and more technical language (such as the Pubmed scientific literature), optimize the process of discover medical information on a complex clinical case to users with different profiles and needs, such as are patients, doctors or researchers. Our ultimate goal is to build a platform to accommodate different techniques facilitating the practice of translational medicine.

Bibliographic References

  • Auer, S., C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak y Z. Ives. 2007. DBpedia: A Nucleus for a Web of Open Data. En The Semantic Web, Volume 4825 of Lecture Notes in Computer Science, Chapter 52, 722-735-735. Berlin (Heidelberg).
  • Bellazzi R., C. Larizza, M. Gabetta, G. Milani, A. Nuzzo, V. Favalli y E. Arbustini. 2010. Translational Bioinformatics: Challenges and Opportunities for Case-Based Reasoning and Decision Support. LNCS: Case-Based Reasoning, Vol. 6176, pp. 1-11.
  • Bizer, C., J. Lehmann, G. Kobilarov, S. Auer, C. Becker, R. Cyganiak y S. Hellmann. 2009. DBpedia - A crystallization point for the Web of Data. Web Semantics: Science, Services and Agents on the World Wide Web 7(3):154-165.
  • Bollacker, K., C. Evans, P. Paritosh, T. Sturge y J. Taylor. 2008. Freebase: a collaboratively created graph database for structuring human knowledge. En Proceedings of the international conference on Management of data, ACM SIGMOD, pp. 1247-1250. Vancouver (Canada).
  • Brenes, D. J., D. G. Avello y K. P. González. 2009. Survey and evaluation of query intent detection methods. En Proceedings of the workshop on Web Search Click Data, pp. 1- 7. Barcelona (Spain).
  • Clifford, G., D. J. Scott y M. Villarroel. 2010. User Guide and Documentation for the MIMIC II Database, Rev: 259. Cambridge, MA, USA.
  • Dowell, K. G., M. S. McAndrews-Hill, D. P. Hill, H. J. Drabkin y J. A. Blake. 2009. Integrating text mining into the MGI biocuration workflow. Database 2009(0):bap019+.
  • Han, X. y J. Zhao. 2009. CASIANED: Web Personal Name Disambiguation Based on Professional Categorization. En Proceedings of 2nd Web People Search Evaluation Workshop, WePS2, Madrid (Spain).
  • Hearst M. A. 2008. UIs for Faceted Navigation: Recent Advances and Remaining Open Problems. En Proceedings of the Second Workshop on Human-Computer Interaction and Information Retrieval, pp. 13-17, Redmond
  • Jonquet, C., N. Shah y M. Musen. 2009. The Open Biomedical Annotator. En AMIA Summit on Translational Bioinformatics, pp. 56-60, San Francisco (CA).
  • Jonquet, C., M. A. Musen y N. H. Shah. 2010. Building a biomedical ontology recommender web service. Biomedical semantics, 1(1):S1.
  • Luo, G. y C. Tang. 2008. On iterative intelligent medical search. En Proceedings of the 31st annual international conference on Research and development in information retrieval, ACM SIGIR, pp. 3-10, Singapore.
  • Meyer S., J. Degener, J. Giannandrea y B. Michener. 2010. Optimizing schema-last tuple-store queries in graphd. En Proceedings of the International Conference on Management of Data, ACM SIGMOD, pp. 1047-1056, Indianapolis (Indiana).
  • Mirhaji, P., M. Zhu, M. Vagnoni, E. Bernstam, J. Zhang y J. Smith. 2009. Ontology driven integration platform for clinical and translational research. BMC Bioinformatics 10(Suppl 2):S2+.
  • Musen, M., N. Shah, N. Noy, B. Dai, M. Dorf, N. Griffith, J. D. Buntrock, C. Jonquet, M. J. Montegut y D. L. Rubin. 2008. BioPortal: Ontologies and Data Resources with the Click of a Mouse. En Annual Symposium proceedings, AMIA, pp. 1223-1224.
  • Noy, N. F., N. H. Shah, P. L. Whetzel, B. Dai, M. Dorf, N. Griffith, C. Jonquet, D. L. Rubin, M. A. Storey, C. G. Chute y M. A. Musen. 2009. BioPortal: ontologies and integrated data resources at the click of a mouse. Nucleic acids research 37 (Web Server issue): W170-173.
  • Ruiz-Rico, F., J. Vicedo y M. Rubio-Sánchez. 2008. Multilingual assistant for medical diagnosing and drug prescription based on category ranking. En 22nd International Conference on Computational Linguistics: Demonstration Papers, ACL COLING, pp. 169-172, Morristown (NJ).
  • Ruttenberg, A., J. A. Rees, M. Samwald y M. S. Marshall. 2009. Life sciences on the Semantic Web: the Neurocommons and beyond. Briefings in bioinformatics 10(2):193-204.
  • Sarkar, I. 2010. Biomedical informatics and translational medicine. Translational medicine 8(1):22+.
  • Shariff, S., M. Cuerden, R. B. Haynes, K. A. McKibbon, N. Wilczynski, A. Iansavichus, M. Speechley, A. Thind y A. Garg. 2010. Evaluating the impact of MEDLINE filters on evidence retrieval: study protocol. Implementation Science 5(1):58+.
  • Trivedi, M. 2009. A study of search engines for health sciences. Library and Information Science, 1(5): 069-073
  • Twigger, S., J. Geiger y J. Smith. 2009. Using the NCBO Web Services for Concept Recognition and Ontology Annotation of Expression Datasets. En Proceedings of the Workshop on Semantic Web Applications and Tools for Life Sciences, SWAT4LS, Vol- 559.
  • Watson, M., A. Smith y S. Watter. 2005. Leximancer concept mapping of patient case studies. En LNCS: Part III. Knowledgebased intelligent information and engineering systems 9th international conference, pp. 1232-1238, Berlin (Heidelberg).
  • Weichselbraun, A. 2009. A Utility Centered Approach for Evaluating and Optimizing Geo-Tagging. En First International Conference on Knowledge Discovery and Information Retrieval, KDIR, pp. 134-139, Madeira (Portugal).
  • Woolf, S. H. 2008. The Meaning of Translational Research and Why It Matters. JAMA 299(2):211-213.