Commit c960cccb by Sanjay Krishnan
parents e8fc68ed f9a96d00
...@@ -19,11 +19,11 @@ Git is installed on all of the CSIL computers, and to install git on your machin ...@@ -19,11 +19,11 @@ Git is installed on all of the CSIL computers, and to install git on your machin
[https://git-scm.com/book/en/v2/Getting-Started-Installing-Git] [https://git-scm.com/book/en/v2/Getting-Started-Installing-Git]
Every student in the class has a git repository (a place where you can store completed assignments). This git repository can be accessed from: Every student in the class has a git repository (a place where you can store completed assignments). This git repository can be accessed from:
[https://mit.cs.uchicago.edu/cmsc13600-spr-19/<your cnetid>.git] [https://mit.cs.uchicago.edu/cmsc13600-spr-20/<your cnetid>.git]
The first thing to do is to open your terminal application, and ``clone`` this repository (NOTE skr is ME, replace it with your CNET id!!!): The first thing to do is to open your terminal application, and ``clone`` this repository (NOTE skr is ME, replace it with your CNET id!!!):
``` ```
$ git clone https://mit.cs.uchicago.edu/cmsc13600-spr-19/skr.git cmsc13600-submit $ git clone https://mit.cs.uchicago.edu/cmsc13600-spr-20/skr.git cmsc13600-submit
``` ```
Your username and id is your CNET id and CNET password. This will create a new folder that is empty titled cmsc13600-submit. There is similarly a course repository where all of the homework materials will stored. Youshould clone this repository as well: Your username and id is your CNET id and CNET password. This will create a new folder that is empty titled cmsc13600-submit. There is similarly a course repository where all of the homework materials will stored. Youshould clone this repository as well:
``` ```
......
# Homework 1. Introduction to Python and File I/O # Homework 1. Introduction to Python and File I/O
This homework assignment is meant to be an introduction to Python programming and introduces some basic concepts of encoding and decoding. This homework assignment is meant to be an introduction to Python programming and introduces some basic concepts of encoding and decoding.
Due Date: *Friday April 15, 2020 11:59 pm* Due Date: *Friday April 17, 2020 11:59 pm*
## Initial Setup ## Initial Setup
These initial setup instructions assume you've done ``hw0``. Before you start an assingment you should sync your cloned repository with the online one: These initial setup instructions assume you've done ``hw0``. Before you start an assingment you should sync your cloned repository with the online one:
......
# Homework 2. Bloom Filter # Homework 2. Bloom Filter
This homework assignment introduces an advanced use of hashing called a Bloom filter. This homework assignment introduces an advanced use of hashing called a Bloom filter.
Due Date: *Friday April 24, 2020 11:59 pm* Due Date: *Friday May 1st, 2020 11:59 pm*
## Initial Setup ## Initial Setup
These initial setup instructions assume you've done ``hw0``. Before you start an assingment you should sync your cloned repository with the online one: These initial setup instructions assume you've done ``hw0``. Before you start an assingment you should sync your cloned repository with the online one:
......
...@@ -13,7 +13,7 @@ class Bloom(object): ...@@ -13,7 +13,7 @@ class Bloom(object):
independent hash functions. independent hash functions.
''' '''
self.array = array.array('B', [0] * 10000) self.array = array.array('B', [0] * m)
self.hashes = self.generate_hashes(m,k,seed) self.hashes = self.generate_hashes(m,k,seed)
......
# Extract-Transform-Load # Extract-Transform-Load
*Due 5/18/20 11:59 PM* *Due Friday 5/22/20 11:59 PM*
Extract, transform, load (ETL) is the general procedure of copying data from one or more sources into a destination system which represents the data differently from the source(s). In this project, you will write some of the core primitives in an ETL system. Extract, transform, load (ETL) is the general procedure of copying data from one or more sources into a destination system which represents the data differently from the source(s). In this project, you will write some of the core primitives in an ETL system.
## Getting Started ## Getting Started
......
# Out-of-Core Group By Aggregate # Out-of-Core Group By Aggregate
*Due 6/1/19 11:59 PM* *Due 6/5/19 11:59 PM*
In this assignment, you will implement an out-of-core In this assignment, you will implement an out-of-core
version of the group by aggregate (aggregation by key) version of the group by aggregate (aggregation by key)
seen in lecture. You will have a set memory limit and seen in lecture. You will have a set memory limit and
you will have to count the number of times a string shows you will have to count the number of times a string shows
up in an iterator. Your program should work for any limit up in an iterator. Your program should work for any limit
> 20. less than 20.
## Getting Started ## Getting Started
First, pull the most recent changes from the cmsc13600-public repository: First, pull the most recent changes from the cmsc13600-public repository:
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or sign in to comment