Commit c960cccb by Sanjay Krishnan
parents e8fc68ed f9a96d00
......@@ -19,11 +19,11 @@ Git is installed on all of the CSIL computers, and to install git on your machin
[https://git-scm.com/book/en/v2/Getting-Started-Installing-Git]
Every student in the class has a git repository (a place where you can store completed assignments). This git repository can be accessed from:
[https://mit.cs.uchicago.edu/cmsc13600-spr-19/<your cnetid>.git]
[https://mit.cs.uchicago.edu/cmsc13600-spr-20/<your cnetid>.git]
The first thing to do is to open your terminal application, and ``clone`` this repository (NOTE skr is ME, replace it with your CNET id!!!):
```
$ git clone https://mit.cs.uchicago.edu/cmsc13600-spr-19/skr.git cmsc13600-submit
$ git clone https://mit.cs.uchicago.edu/cmsc13600-spr-20/skr.git cmsc13600-submit
```
Your username and id is your CNET id and CNET password. This will create a new folder that is empty titled cmsc13600-submit. There is similarly a course repository where all of the homework materials will stored. Youshould clone this repository as well:
```
......
# Homework 1. Introduction to Python and File I/O
This homework assignment is meant to be an introduction to Python programming and introduces some basic concepts of encoding and decoding.
Due Date: *Friday April 15, 2020 11:59 pm*
Due Date: *Friday April 17, 2020 11:59 pm*
## Initial Setup
These initial setup instructions assume you've done ``hw0``. Before you start an assingment you should sync your cloned repository with the online one:
......
# Homework 2. Bloom Filter
This homework assignment introduces an advanced use of hashing called a Bloom filter.
Due Date: *Friday April 24, 2020 11:59 pm*
Due Date: *Friday May 1st, 2020 11:59 pm*
## Initial Setup
These initial setup instructions assume you've done ``hw0``. Before you start an assingment you should sync your cloned repository with the online one:
......
......@@ -13,7 +13,7 @@ class Bloom(object):
independent hash functions.
'''
self.array = array.array('B', [0] * 10000)
self.array = array.array('B', [0] * m)
self.hashes = self.generate_hashes(m,k,seed)
......
# Extract-Transform-Load
*Due 5/18/20 11:59 PM*
*Due Friday 5/22/20 11:59 PM*
Extract, transform, load (ETL) is the general procedure of copying data from one or more sources into a destination system which represents the data differently from the source(s). In this project, you will write some of the core primitives in an ETL system.
## Getting Started
......
# Out-of-Core Group By Aggregate
*Due 6/1/19 11:59 PM*
*Due 6/5/19 11:59 PM*
In this assignment, you will implement an out-of-core
version of the group by aggregate (aggregation by key)
seen in lecture. You will have a set memory limit and
you will have to count the number of times a string shows
up in an iterator. Your program should work for any limit
> 20.
less than 20.
## Getting Started
First, pull the most recent changes from the cmsc13600-public repository:
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or sign in to comment