sparklyr + Spark 2.0 + AWS

Background The new ‘sparklyr’ package provides a ‘dplyr’ backend to interacting with Spark. It also opens the ML Spark library, which in effect expands the functionality previously available to R via SparkR. I wanted to see what it would take to get a functional Spark 2.0 cluster to interact with this package. AWS/EC2 Setup Go…

Setup a Spark 2.0 Cluster + R in AWS

Background I have been compiling step-by-step documentation, using help guides, blog posts and insights from previous exercises. Now that Spark 2.0 is out, I figured it was a good oportunity to update my 1.6 documentation and to make it available to others. The plan is to leverage a feature in AWS that allows you to…

RStudio Shiny Server in AWS

AWS/EC2 Setup Step 1- Amazon Machine Instance: Ubuntu Step 2 – Instance Type: m4.large Step 3 – No changes Step 4 – Storage: 30 Size GiB Step 5 – No changes Step 6 – Security Group Name: ShinyClick Add Rule, select Type: All Trafic | Source: My IPOptional: If you want to open your Shiny…