By the end of this project, you will learn about the concept of document similarity in textual analysis in R. You will know how to load and pre-process a data set of text documents by converting the data set into a corpus and document feature matrix. You will know how to calculate the cosine similarity between documents and explore and plot the output of your calculation. This project is aimed at beginners who have a basic familiarity with the statistical programming language R and the RStudio environment, or people with a small amount of experience who would like to learn how to calculate textual similarity between documents in text analysis.