September 9, 2021
In this assignment, we will look at some of the algorithms we covered in the first 2 weeks of lectures: n-gram language modeling, and constituency parsing with the CYK algorithm.
All the questions and related instructions can be found on this Google Colab link.
Instructions for submission can be found on the notebook. You will turn in a PDF version of the completed notebook on Gradescope. Remember to:
1. Paste the viewable link in the box provided in the notebook.
2. Select pages for the respective questions on Gradescope.
For this assignment, you are required to work independently, i.e., you may not collaborate with other students. You are allowed to discuss general concepts and ideas related to the assignment, but you must not discuss actual solutions. A total of 110 (+10 extra credit) points can be earned in this assignment.
Enroll yourself in the Gradescope class (entry code provided on Piazza) and submit the PDF before 11.59 PM (EDT) on September 24, 2021 (Friday). No late submission is allowed for this assignment. If you face any technical/other difficulties, please contact the TA/CAs on Piazza.
1