RDataMining.com: R and Data MiningRDataMining.com: R and Data Mining

Search this site
    • Home
    • News
    • Training
      • R and Data Mining Course
      • Past Trainings and Talks
      • Tutorial at AusDM 2018
      • Tutorial at Melbourne Data Science Week
      • Short Course at University of Canberra
      • Machine Learning 102 Workshop at SP Jain
    • Documents
      • Introduction to Data Mining with R
      • R Reference Card for Data Mining
      • R and Data Mining: Examples and Case Studies
      • Introduction to Data Mining with R and Data Import/Export in R
      • Data Exploration and Visualization with R
      • Regression and Classification with R
      • Data Clustering with R
      • Association Rule Mining with R
      • Text Mining with R
      • Twitter Data Analysis with R
      • Time Series Analysis and Mining with R
    • Examples
      • Data Exploration
      • Decision Trees
      • Random Forest
      • k-means Clustering
      • Hierarchical Clustering
      • Outlier Detection
      • Time Series Forecasting
      • Time Series Analysis
      • Time Series Clustering and Classification
      • Association Rules
      • Text Mining
      • Twitter Follower Map
      • Social Network Analysis
      • Multidimensional Scaling (MDS)
      • Principal Component Analysis (PCA)
      • Parallel Computing
      • Other Examples
    • Big Data
      • Big Data Platforms
      • Big Data Resources
      • Step-by-Step Guide to Setting Up an R-Hadoop System
      • Building an R Hadoop System
      • Hadoop: from Single-Node Mode to Cluster Mode
    • Resources
      • Online Documents, Books and Tutorials
      • Free Online Courses
      • Data Mining Tutorials
      • Free Datasets
      • Free Data Mining Tools
    • Datasets
    • Books
      • R and Data Mining: Examples and Case Studies
      • Data Mining Applications with R
      • Post-Mining of Association Rules
    • What is R
    • Donation & Supporters
    • Sponsorship and Advertisement
    • Sponsors
    • About RDataMining
    • License
    Big Data‎ > ‎

    Big Data Resources

    Spark

    • Getting Started with Apache Spark -- a free ebook
    • SparkR: Scaling R Programs with Spark, a paper published at SIGMOD 2016
    • ‪Mastering Advanced Analytics with Apache Spark

    Pig

    • Pig vs. MapReduce: When, Why, and How

    RHadoop

    • RHadoop Installation Guide for Red Hat Enterprise Linux

    Graph and Network Analysis

    • Presentations and talks on Apache Giraph, an iterative graph processing system built for high scalability
    • Serious network analysis using Hadoop and Neo4j
    • I Mapreduced a Neo store: Creating large Neo4j Databases with Hadoop


    ©2011-2020 Yanchang Zhao.       Contact: yanchang(at)rdatamining.com

    Sign in|Report Abuse|Powered By Google Sites