Mini-hw-1

Due date: Nov 27, 2018.

Objectives: Mini-hw-1 for GraphX on Spark.

Assignment tools: EMR (spark and GraphX) on Amazon Web Services

What to turn in: You will turn command and results of a query using GraphX on Spark. Submit everything as a single pdf or docx file.

How to submit the assignment: In your gitlab repository, you should see a directory called mini-hw1. Put your report in that directory. Remember to git add, git commit, and git push. You can add your report early and keep updating it and pushing it as you do more work. We will collect the final version after the deadline passes. If you need extra time on an assignment, let us know. This is a graduate course, so we are reasonably flexible with deadlines but please do not overuse this flexibility. Use extra time only when you truly need it.

Assignment Details

In this Assignment you will be required to deploy a EMR cluster with Spark and ingest the flights dataset as specified in the section on GraphX.