📊 Day 5: Comprehensive Exercises
Welcome to the fifth day of the course! Today, we dive into Application of all course topics.
💂️ Exercises
- 🎁 Predicting Santa’s Gift Delivery
- 🍬 Toys production
- ✨ Elf Socks and Sizes
- 🎄 Fulmine and Rudolf’s Favorite Food
- 💵 Scrooge Company Bias Investigation
- 🎅🏻 Santa’s Belly Size vs Fireplace Size
🎥 Presentation
Explore today’s concepts with the presentation below. You can navigate through the slides at your own pace:
📂 Material and Instructions
🎁 Predicting Santa’s Gift Delivery
Santa wants to predict how many gifts he can deliver based on the hours of travel time available on Christmas Eve. Over the past 100 years, he collected data on:
- Travel Hours (continuous)
- Gifts Delivered (integer, in millions)
As the population increases each year, Santa assumes there are 2 billion children.
Tasks:
- Determine the test to use to predict the number of gifts based on travel time.
- Analyze the data.
- Restituate briefly the results.
File: Santa_gift.csv
🍬 Toys production
The elves are analyzing the efficiency of their toy production. They want to predict the total production of toys (continuous) based on two factors:
- Number of elves working in the workshop (continuous)
- Average number of candy canes consumed per elf (continuous)
Tasks:
- Determine the test to evaluate the combined effects of these variables on toy production.
- Analyze the data.
- Restituate briefly the results.
File: Toys_production.csv
✨ Elf Socks and Sizes
Santa wants to ensure all his elves are comfortable and stylish, so he’s analyzing their preferences for sock colors and their sizes. The dataset includes the entire elf population.
Dataset Information:
- Elf_Name: ID of each elf (names anonymized for ethical reasons).
- Sock_Color_Preference: The elf’s preferred sock color (“Stripe,” “Plain Red,” or “Plain White”).
- Elf_Size_cm: The height of the elf in centimeters.
Tasks:
- Analyze Sock Color Preferences:
- Count the number of elves who prefer each sock color.
- Visualize the preferences using a bar chart.
- Analyze Elf Sizes:
- Calculate the average height of elves.
- Find the tallest and shortest elves.
- Create a histogram of elf sizes.
- Relate Sock Preferences to Sizes:
- Compare the average height of elves who prefer each sock color.
File: Socks_Color_Preferences.csv
🎄 Fulmine and Rudolph’s Favorite Food
Fulmine and Rudolph are two of Santa’s trusted reindeer, but they seem to have different preferences when it comes to food.
Dataset Information:
- Reindeer: The name of the reindeer (Fulmine or Rudolph).
- Food_Preference: The reindeer’s preferred food:
- “Carrots”
- “Apples”
- “Hay”
Tasks:
- Determine the test to use to analyze if the two reindeer have independent preferences for food.
- Analyze the data.
- Restituate briefly the results.
File: Fulmine_Rudolf_Food.csv
💵 Scrooge Limited Bias Investigation
Scrooge Limited suspects that Santa may be biased in distributing gifts across the continents. They want to analyze whether Santa’s distribution matches the expected proportion of children in each continent.
Dataset Information:
- Continent: The continent where the gifts were delivered (Asia, Africa, Europe, North America, South America, Oceania).
- Gifts_Delivered: The count of gifts delivered to children on that continent.
Expected Proportions:
- Asia: 40%
- Africa: 20%
- Europe: 15%
- North America: 10%
- South America: 10%
- Oceania: 5%
Tasks:
- Determine the test to use to evaluate if the distribution aligns with the expected proportions.
- Analyze the data.
- Restituate briefly the results.
File: Scrooge_Company_Bias.csv
🎅🏻 Santa’s Belly Size vs Fireplace Size
Santa is concerned that his growing belly might make it harder to fit through chimneys. To prepare for Christmas Eve, he wants to compare the sizes of fireplaces in sampled households with his current belly size (130 cm).
Dataset Information:
- Household: The name or ID of the household.
- Fireplace_Size_cm: The size of the fireplace opening in centimeters.
- Santa_Belly_Size_cm: Santa’s belly size at the moment of the chimney measurement.
Tasks:
- Determine the test to use to analyze if the current Santa’s belly size is significantly larger than the average fireplace size.
- Analyze the data.
- Restituate briefly the results.
File: Fireplace_vs_Belly.csv
Play with ChatGPT:
🎓 I’m Learning Statistics – Want to Play a Game? 🎓
I’ve been diving into the world of basic statistics and have learned some cool stuff! Here’s what I can do so far: ✅ Perform a Chi-square test, t-tests, correlation, and simple/multiple regression. ✅ Understand when to use ANOVA if there are more than two levels of a factor. ✅ Apply ANCOVA when mixing categorical and continuous independent variables.
Now, here’s the fun part: Let’s play a game! 🎮 You provide an example scenario, and I’ll try to guess the correct statistical test or analysis to apply.
💡 Think of something fun, challenging, or tricky – I’m ready to put my skills to the test!
Who’s in? Drop your examples below! 👇