CSEP551 -- Programming Assignment #2

Out: Tuesday, February 7th, 2012
Due: Monday, February 27th, 2012, by 11:45pm

[ overview | server requirements | code requirements | threaded server | event-driven server | what to turn in ]

Overview

In this assignment, you will implement two versions of a simple web server, using C or C++ as your programming langauge. In the first version, you will use threads (via the pthreads interface) to handle requests concurrently. In the second version, you will use a single thread and event-driven asynchronous I/O to handle requests concurrently. The goal of the assignment is to let you explore these two different styles of concurrent programmiung, and to give you some exposure to the nuts and bolts of network programming and the web.

Server Requirements

Obviously, implementing a fully-featured web server that is capable of handling all aspects of modern Web protocols would be completely unrealistic for a two-week assignment. Instead, we want you to focus on a relatively small subset of features. Your server should support all of the following, for both the threaded and event-driven versions.

Concurrent requests: most importantly, your server should be able to handle and process HTTP requests concurrently. In other words, it should be possible for multiple clients (or even a single client) to open several HTTP requests to your server simultaneously, and your server should make progress on reading the requests from the network, generating responses, and writing the responses back to the clients concurrently. Said differently, the server should not serialize requests: if one request progresses slowly (e.g., the client is on a slow network connection), the other requests should not block behind that slow request.

Simple HTTP/1.1 support: the HTTP specification is complex and large. You only need to handle a very simple subset of it. Specifically, you need to be able to handle GET requests. You do not need to handle persistent connections, pipelined requests, chunked transfers, or other advanced HTTP/1.1 features. You will need to write a simple parser to parse HTTP requests and extract the requested URL from them, but that should be pretty easy to do. You will also need to figure out how to generate a well-formatted HTTP response. So, to do these two things, you should read up on the HTTP protocol enough to determine how to do this. I'd recommend you experiment with real browsers and real servers: figure out how to use a program like netcat (nc on linux) to open a listening socket and direct a browser to it, in order to see what real browser requests look like. Also figure out how to use telnet to connect to a real web server (e.g., www.cs.washington.edu) to figure out what a real server response looks like. Then, mimic that.

Static, in-memory files: instead of reading requested content from files in the file-system, you should instead have a set of static HTML pages hard-coded into your program. When a client requests one of them, you should return that static content. If a client requests some other page, you should return an HTTP error indicating the page could not be found. The specific page URLs and content you should support are:

Command-line specifiable port: typically a web server runs on port 80, but there is nothing requiring this. Your web server should take a single command-line argument, which is the port number on which your server listens. Make sure this port is listening on the external IP address of the machine, rather than the loop-back IP address. (i.e., when we run your server, it should be possible to connect to the server from a browser running different machine.)

Code Requirements

For this assignment, you will implement the two servers largely from scratch in either C or C++. Here are some guidelines and requirements about your code:

Threaded Server

For the threaded server, you should implement your code to have a main thread that first creates the listening socket for your server, and then permanently enters a loop, where it:

Each dispatched thread should read in a request from the new network connection, generate a response, write the response, and then close the network connection to the client. The thread should also handle the case where the request is poorly formatted, or the connection to the client drops unexpectedly. Be sure not to leak memory or file descriptors.

Event driven server

Your event-driven server should have a single thread that processes events in an event loop. The kinds of events this thread must detect and handle include:

You will need to learn how to set a socket to be non-blocking, so that your accept, read, write, and close system calls don't block your single thread. As well, you should learn about the poll system call, and use that to register interest in events and receive those events.

Writing an event-driven server can be confusing. You essentially need to invert the control flow that you're used to with a threaded server. Instead of having a thread sequentially read, generate, and write to handle a connection, your single thread needs to juggle all of the connections simultaneously, processing data when possible on each as events arrive. You should probably think about creating a simple state machine for each connection to keep track of it, and then transition from state to state (READING, WRITING) when handling events.

What to turn in

You should use the following catalyst dropbox to submit your assignment. You should submit to the "assignment #2" box a single .tar.gz archive that contains the following:

As previously mentioned, before you submit your code, please copy it to attu1.cs.washington.edu and make sure that you're able to compile, run, and successfully exercise both of your servers on that machine, since we'll be doing our testing and grading on it.