Text-based Disease Classification of Medical Literature

Wednesday, February 11, 2015

4:00 pm - 5:00 pm

Hock Plaza Auditorium

Angela Zoss

Informatics Research Seminar

ABSTRACT: This project explored ways to use natural language terms and phrases to detect broad disease categories in the titles of articles from the PubMed database. An early attempt, the project classifies four million papers written in five different languages over the last 50 years into nine broad disease categories, visualizing the results as flows and streams to explore the changing focus of medical research over time. The visualization received an award as a top student submission to the ACM Web Science 2014 Conference data visualization challenge. Future work will include refining disease detection with more sophisticated text and data mining, as well as developing new visual interfaces to the results.

Contact: Evelyn (Rene) Hart

MORE EVENT INFORMATION

Text-based Disease Classification of Medical Literature

Event Calendar