Titanic Activity

GOAL: Prepare for our visit to the Titanic Museum

Complete the following in groups of 2-3 people. Complete in a group Google Doc and have one student upload the final version as a pdf to Moodle. Please include all group member’s names.

You will use Minitab online to complete the analysis part of this activity. You can get to the online app by clicking the link: https://app.minitab.com/. The titanic data (csv) can be found HERE.You will need to download the file and then upload it into Minitab.

We use this data in multiple statistics classes at Cornell. You are not expected to know the data well before this but that is why it is of interest to students. How often do you get to visit a museum, with database terminals, on the data you have completed analysis on?

Data Description

The Titanic was a British luxury oceanliner that sank famously in the icy North Atlantic Ocean on its maiden voyage in April 1912. Of the approximately 2200 passengers on board, 1500 died. The high death rate was blamed largely on the inadequate supply of lifeboats, a result of the manufacturer’s claim that the ship was “unsinkable.” A partial dataset of the passenger list was compiled by Philip Hinde in his Encyclopedia Titanica and is given in the datafile Titanic. (Note: the data has miss-labeled gender as sex).

Two questions of interest are the relationship between survival and age and the relationship between survival and sex. The following variables will be useful for your work on the following questions:

Variable Description
Age which gives the passenger’s age in years
Sex which gives the passenger’s sex (male or female)
Survived a binary variable, where 1 indicates the passenger survived and 0 indicates death
SexCode which numerically codes male as 0 and female as 1
Name passenger name
PClass passenger class

Questions

  1. Conduct a univariate EDA (description statistics on each variable individually) of each variable (1 plot or table each).

Summarize your EDA in under 200 words.

  1. Find the following numbers:
  1. Number saved
  2. Number lost/died
  3. Total number on board
  4. Percentage of 3rd class passengers saved
  5. Percentage of 1st class passengers saved

Bonus: Survival rate of the Goodwin family.

Bring these with you on the day of our visit.

  1. Investigate the relationship between survival and age using a visualization.

Summarize your findings in 2-3 sentences that reference your visualizations.

  1. Investigate the relationship between survival and sex using a visualization.

Summarize your findings in 2-3 sentences that reference your visualizations.

Photo source: Personal photo by Tyler George of ticket at Titanic Belfast museum.