This contains a curated list of R tutorials and packages for Data Science, NLP and Machine Learning. This also serves as a reference guide for several common data analysis tasks.
Package ‘tidyr’: tidyr is an evolution of reshape2. It’s design specifically for data tidying (not general reshaping or aggregating) and works well with dplyr data pipelines.
Package ‘stringr’: stringr is a set of simple wrappers that make R’s string functions more consistent, simpler and easier to use.
Package ‘stringdist’: Implements an approximate string matching version of R’s native ‘match’ function. Can calculate various string distances based on edits (damerau-levenshtein, hamming, levenshtein, optimal sting alignment), qgrams or heuristic metrics
Package ‘FSelector’: This package provides functions for selecting attributes from a given dataset