Final Project
Summary of deliverables
Deadline for all deliverables: May 2, Friday, 11:59pm
Deliverables (everything should be in your GitHub repository):
- 5 minute video presentation (in your GitHub repository)
- GitHub repo with all data and code (preferably in .qmd format)
- Rendered html files (also in the GitHub repo)
You are to submit your link to your GitHub repository to Canvas.
GitHub
- Create a GitHub account:
- Navigate to https://github.com/.
- Click Sign up.
- Follow the prompts to create your personal account.
- Create a new GitHub repository:
- Click “New repository”: Look for the “+ New repository” button in the upper-right corner of the page and click it.
- Name your repository: Enter a short, memorable name for your repository.
- Add a description (optional): You can add a brief description to explain what the repository is about.
- Choose repository visibility: Decide whether your repository should be public (visible to everyone) or private (only accessible to you and collaborators). If you decide on a private repository, make sure you share it with me (my GitHub handle is
picoral
) - Click “Create repository”: Once you’ve filled in the details, click the “Create repository” button to finalize the process.
- Add your files to your repository:
- Click to open your repository
- Click on
Add File
(top right of your repo file listing), then chooseUpload Files
- Drag the files you want to add to your repo
- Scroll down and click on
Commit changes
If I am unable to access your repository to see your files, your grade will be zero.
Data
You will choose your your data source. Here are some options where you can find data:
- https://data.gov/
- https://data.europa.eu/en
- https://data.un.org
- https://data.worldbank.org
- https://www.who.int/data
You should create a .qmd file for your final project with your code and your data description. Make sure to describe all variables in our data.
Question
You should answer at least two (2) questions. Formulate those questions and write them out in your .qmd file.
Descriptive Analysis
- Create at least one histogram of one of your numeric variables.
- Create at least one boxplot of one of your numeric variables (add a categorical variable to it)
- Optionally, if you have two numeric variables, create a scatterplot plot of both variables
- Summarize your data – calculate mean, standard deviation, minimum and maximum of at least one numeric variable by a group
Inferential Statistics
- Run two sample t-test, ANOVA, correlation, or linear regression to answer your questions
- Visualize, interpret and write out results
Conclusion
- Summarize what you did for your analysis and your findings in two paragraphs