Note that there are separate sets of assignments for CS 451/651 and CS 431. Make sure you work on the correct asssignments!
This assignment requires you to compute statistics over tokens in the text of Shakespeare's plays, as you did for Assignment 1. However, instead of performing the analysis purely in Python, you will use Spark (with a Python driver program).
You will need to upload this file to Google Colab:
When you have finished, submit A2 using the same procedure you used for A1:
That's it! Note that your A2.ipynb should be committed and pushed to origin before the deadline.
Hints for Q5 + Q6: