Big Data Platforms
Below is a list of big data platforms and their interfaces with R.
Hadoop (or YARN) - a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models
H2O - an open source in-memory prediction engine for big data science
Algorithms provided in H2O:
PCA, GBM, deep learning, random forest, BigData RF, GLM, k-means, Naive Bayes, anomaly detection