Description

So far, you have analyzed data from a variety of sources to solve realistic problems from science, engineering, and business. Now it’s your turn to choose and analyze a real-world problem. This is good practice for how you will use Python for the remainder of your career. You are highly encouraged to work with a partner or group of two others on this project!

  • Dataset Exploration
    • Due: Monday 07/07 at 11:59 pm
    • Synopsis Find at least 2 datasets that you may want to explore for the final project. Answer some questions about each dataset's context and why each dataset is interesting to you.
  • Proposal
    • Due: Monday 07/21 at 11:59 pm
    • Synopsis Come up with research questions and commit to using a particular dataset(s). Describe motivation and any background material, describe the dataset, describe your analysis methodology or the algorithm you will use. This requires submitting a work plan that outlines how you will work on the project for the remainder of the quarter.
  • Exploratory Data Analysis
    • Due: Monday 08/04 at 11:59 pm
    • Synopsis Use code to explore your dataset(s) and give summaries of the data you are working with. This is a preliminary version of the final project deliverables.
  • Deliverables - Report/Code
    • Due: Thursday 08/14 at 11:59 pm
    • Synopsis The main project submission where you implement the rest of the code for your project and write up a report.
  • Project Presentations
    • Due: Sunday 08/17 at 11:59 pm
    • Synopsis You will create a 3-minute video presenting the highlights of your project to your peers and TAs. Make sure to discuss your process at a high level, as well as any results you have.
  • Peer Feedback
    • Due: Friday 08/22 at 11:59 pm
    • Synopsis Reflect on the project and give feedback to peers.

For some example reports and slides, please refer to our past project gallery.

This project is structured a lot like the project in CSE 160 with some specific requirements changed. You may NOT submit the same project that you used in CSE 160 for this class. You may use the same dataset, but we would expect that you have a much more novel and interesting set of research questions if you use the same dataset as you did in a previous quarter.

Project Submission

We will use Gradescope to submit the various parts of the project since it does a really good job with group projects. For each part, only one group member should submit on Gradescope. You should use the Group Members functionality to add the appropriate partner(s) if you have them. If you want to learn about how to add Group Members on Gradescope, please see instructions here.

More information will be announced with the project parts.