This publication contains three major utilities: 1) An example web scraper used to pull position descriptions from internet websites. 2) TextCleanupTools, a set of Java programs written against the Stanford CoreNLP library to analyze and parse unstructured text from position descriptions and course curricula data. 3) A set of R scripts used to plot various data points extracted from the analyzed text.
Cite this work
Researchers should cite this work as follows:
- Seliger, C. S. (2018). Text Mining and Plotting Tools for KSA / DS / HEI Research Study. Purdue University Research Repository. doi:10.4231/R7MK6B49