EDS 213: Lab 5
Documenting Your Work
This Week’s Goal
You’ve found some datasets, cleaned your data, built a database, written queries, and started analyzing your data.Now make sure someone else (or future you) can understand and reproduce it.
Comment Your Code
Every script and notebook should be self-explanatory. Add comments that explain why, not just what:
A future reader should understand every non-obvious decision without asking you.
Make It Reproducible
Someone should be able to clone your repo and recreate your results (with the exception of needing to download data, but instructions should this should be extremely clear!).That means:
.sql file , cleaning script, and visualization script (both .ipynb or .qmd) are presentThe Dependencies File
Include a .txt file in your repo that lists your environment requirements.
For Python, export from conda or pip:
For R, document your package versions:
Name it requirements.txt or environment.yml.
Repository README
The README is the first thing anyone sees when they visit your repository. It should answer: “What is this, and how do I use it?”
Required sections:
README: What Each Section Should Cover
| Section | What to include |
|---|---|
| Title | Short and descriptive |
| Purpose | Brief explanation of the repo’s goal (paragraphs or bullets) |
| Repository structure | File organization — what’s in each folder/file |
| Data access | Where data lives, how to access it to run the code |
| References | Course, datasets, and any other sources — consistent format with links |
No hidden files (especially no .DS_Store) other than .gitignore.
This Week’s Task
Complete the documentation for your project.
In your code:
In your repository: