Final Project
Summary of deliverables
Deadline for all deliverables: May 2, Friday, 11:59pm
Deliverables (everything should be in your GitHub repository):
- 5 minute video presentation (in your GitHub repository)
- GitHub repo with all data and code
- A README.md file explaining what you did for your project (2 paragraphs)
You are to submit your link to your GitHub repository to Canvas.
GitHub
- Create a GitHub account:
- Navigate to https://github.com/.
- Click Sign up.
- Follow the prompts to create your personal account.
- Create a new GitHub repository:
- Click “New repository”: Look for the “+ New repository” button in the upper-right corner of the page and click it.
- Name your repository: Enter a short, memorable name for your repository.
- Add a description (optional): You can add a brief description to explain what the repository is about.
- Choose repository visibility: Decide whether your repository should be public (visible to everyone) or private (only accessible to you and collaborators). If you decide on a private repository, make sure you share it with me (my GitHub handle is
picoral
) - Click “Create repository”: Once you’ve filled in the details, click the “Create repository” button to finalize the process.
- Add your files to your repository:
- Click to open your repository
- Click on
Add File
(top right of your repo file listing), then chooseUpload Files
- Drag the files you want to add to your repo
- Scroll down and click on
Commit changes
If I am unable to access your repository to see your files, your grade will be zero.
Data (data_wrangling.py
)
You will choose your your data source. Here are some options where you can find data:
- https://data.gov/
- https://data.europa.eu/en
- https://data.un.org
- https://data.worldbank.org
- https://www.who.int/data
You should have all of your python files in your GitHub repo, including data wrangling/cleaning scripts.
Visualization (plots.ipynb
)
Create a python notebook with at least two plots to visualize your raw data.
Modeling (modeling.py
)
Create a python script with your modeling code.
Use comments to explain what you did
README (README.md
)
Include in your README file information about your data and variables. What modeling approach you used and why. State your performance metrics, what you used, and what results you got.