Skip to the content.

Class Project Details

  1. Project Background
  2. The Modeling Problem
  3. Data Download
  4. Data Description

Project Background

Public schools in the United States face large disparities in funding, often resulting in teachers and staff members filling these gaps by purchasing classroom supplies out of their own pockets. DonorsChoose is an online crowdfunding platform that tries to help alleviate this financial burden on teachers by allowing them to seek funding for projects and resources from the community (projects can include classroom basics like books and markers, larger items like lab equipment or musical instruments, specific experiences like field trips or guest speakers).

Projects on DonorsChoose expire after 4 months, and if the target funding level isn't reached, the project receives no funding. Since its launch in 2000, the platform has helped fund over 2 million projects at schools across the US, but about 1/3 of the projects that are posted nevertheless fail to meet their goal and go unfunded.

The Modeling Problem

For the purposes of the class project, \DonorsChoose has hired a digital content expert who will review projects and help teachers improve their postings and increase their chances of reaching their funding threshold. Because this individualized review is a labor-intensive process, the digital content expert has ** time to review and support only 10% of the projects posted to the platform on a given day**.

You are working with DonorsChoose, and your task is to help this content expert focus their limited resources on projects that most need the help. As such, you want to build a model to identify projects that are least likely to be fully funded before they expire and pass them off to the digital content expert for review.

The data is in four different tables/csv files that you need to download. You can put it into a database (good life practice) or live in csv world (at your own risk).

Dataset Download Link

Dataset Description

For this project, you have access to data from DonorsChoose about projects, teachers, schools, and donations. In the dataset, you'll find four tables:

A very rough data dictionary of the specific fields in each table is provided below:

Data fields

Below is a brief explanation of the provided data fields. Descriptions of self-explanatory names are omitted.

projects.csv

donations.csv

essays.csv

resources.csv