$24
For this assignment, we will focus on analyzing big text with scalable tools and building an end-to-end system. There are some more micro-directed tasks at the beginning of the assignment and then it gradually becomes more free. You get to decide what is important and try out your hypotheses.
As always, you should first open Jupyter and a Terminal, then clone the Homework 6 Bitbucket repository:
git clone https://upenn-cis@bitbucket.org/pennbigdataanalytics/hw56.git
Then go into the hw56 directory in Jupyter and complete the Homework5-6.ipynb notebook.
4.0 Submitting Homework 5-6
Please sanity-check that your Jupyter notebook runs completely and contains all of your work. Add the notebook files to hw56.zip using the zip command at the Terminal, much as you did for HW0 and HW1.
You only need to submit one file inside the zip.
Homework5-6.ipynb
Next, go to the submission site, and if necessary click on the Google icon and log in using your Google@SEAS or GMail account. At this point the system should know you are in the appropriate course. Select CIS 545 Homework 5-6 and upload hw56.zip from your Jupyter folder, typically found under /Users/{myid}.
If you check on the submission site after a few minutes, you should see whether your submission passed validation. You may resubmit as necessary.
For those of you going for the extra credit, we are offering some limited capabilities to check the standing of your system. This will have to be done manually through the TAs and is not required in any way!