Encyclopedia Titanica

Surviving the Titanic tragedy: A sociological study using machine learning models

Suma De Negocios

   Join Us and Hide Ads

Sociological transactions play an important role in human behaviour and social standing. The Titanic was the perfect example as the passengers belonged to high-income, middle-income, and low-income groups. It is interesting to see how social factors influenced who was going to survive. Machine learning algorithms were applied after carrying out an exploratory and visual analysis. The hypothesis that women and children were saved was tested by a random forest algorithm as well as the hypothesis that family density played a major role in survival. The results showed that title and sex were the most important factors influencing if the passenger was to survive.

by Kshitiz Gupta, Dr. Prayas Sharma and Dr. Carlos N. Bouza Herreras
Key Points

Introduction

  • The Titanic was a symbol of British supremacy and was considered unsinkable.
  • It sank on April 15, 1912, after hitting an iceberg, resulting in the deaths of two-thirds of its passengers and crew.

Data and Methodology

  • Data included 890 passenger samples for training and 418 for testing.
  • Machine learning algorithms, specifically logistic regression and random forest, were used to analyze the data.
  • Key variables included title, sex, fare, class, and family density.

Hypotheses

  1. Economic Determinants: First-class passengers had a higher chance of survival due to better access to lifeboats and information.
  2. Natural Determinants: Younger people had a higher chance of survival.
  3. Social Determinants: Women and children had a higher chance of survival due to social norms.

Results

  • Title and Sex: These were the most significant factors in determining survival.
  • Class and Fare: First-class passengers had a higher survival rate.
  • Family Size: Smaller families had a higher chance of survival.
  • Age: Younger passengers had a better chance of survival.

Statistical Analysis

  • Logistic regression and random forest models were used to predict survival.
  • The models showed that title, sex, and fare were the top three variables influencing survival.

Conclusions

  • The lack of lifeboats was a major reason for the high death toll.
  • Social norms played a significant role in survival, with women and children being prioritized.
  • Economic status and proximity to lifeboats also influenced survival rates.

Find it on www.academia.edu

Encyclopedia Titanica is not responsible for the content of external sites, and the availability of links may change.

About Research References on Encyclopedia Titanica
This item is not available to read on Encyclopedia Titanica, but we have included it as a reference, provided a brief summary of the key points, and linked to the original source to help readers interested in the finer details of the Titanic story.

Find Related Items

Titanic Survivors Statistics Data Analysis Titanic Datasets Machine Learning Quantitative Data

Contribute

  Get in touch