Text mining techniques can be applied to various data sources (e.g., newspaper articles, emails, online discussion posts, etc.) to efficiently extract useful data for different research purposes. For example, health science researchers may be interested in investigating a frequency of a particular disease name mentioned in a large set of newspaper articles. Educational researchers, on the other side, may wish to extract and categorize students' opinions from discussion forum in a high enrollment course. R offers a comprehensive set of functionalities for text mining. In this workshop, you will learn how to implement basic methods for preprocessing textual data, metadata management, a creation of term-document matrices over the collection of textual documents, sentiment analysis, text tokenization, word relationship extraction and text visualization.

Requirements:

  • Participants will need to have R and RStudio installed on their device prior to attending the workshop
  • Familiarity with R and the RStudio environment including an understanding of basic functionality such as object assignment, data structures, and running scripts 

Upcoming workshops

No upcoming workshops available.