Short Analysis of Airplane Crashes

Gursev Pirge
7 min readOct 9, 2020

--

This article is a visual analysis of airplane crashes between 1969 and 2009. data.world keeps a detailed database about the airplane crashes and gives the opportunity to make an in-depth analysis for anyone interested in the subject. Although the data started from 1908, I preferred to analyze the modern era of flight and decided to take the above-mentioned time domain. There will be no zeppelins to analyze, but the results will be closer to reflect the effectiveness of the modern-day aerospace safety standards.

Wikipedia’s definition for aviation safety is: “Aviation safety means the state of an aviation system or organization in which risks associated with aviation activities, related to, or in direct support of the operation of aircraft, are reduced and controlled to an acceptable level.”

In order to increase the importance of the aviation safety, I tried to concentrate on the following issues for the selected time frame:

  • How many planes crashed?
  • How many people were on board?
  • How many people survived?
  • How many people died?
  • Crash numbers grouped by the make/model of aircraft (separately).

Visual exploratory analysis of the results using Pandas and Seaborn give a better picture about the incidents.

Data Cleaning

The first step is to import and clean the data using pandas before exploring the cause and effect relationship in the air disasters — which is not an easy task.

As shown in the above figure, there are 5268 rows and 13 columns (attributes) of data. Some of the rows define accidents before 1969 and also we will have to get rid of some rows, which have NaNs.

The results for the NaN value count of all the columns tell us to delete some columns. Also, for the current analysis, some columns will not be needed, and they will be deleted.

Next step will be to make a new column, by extracting the year value from the date column.

How Many Planes Crashed?

First, dtype of the new column “year” is converted to integer, then rows containing the data before 1969 were deleted, and we are ready to proceed to the analysis with 2994 rows of data about the air accidents. Much less from the starting value, but there will be no zeppelins, and the results will be closer to reflect the effectiveness of the modern-day aerospace safety standards.

According to the figure, globally, 2994 airplane crashed between the years 1969 and 2009.

The graph below shows the total number of accidents for the same time period. 1972 was an outlier with 104 accidents and there is data for half of 2009 with 30 accidents, still the trend for the number of accidents is downwards.

How Many People Were on Board?

Between 1969 and 2009, we have data for 2988 accidents and total number of passengers and flight crew members onboard during the accident is 98677.

Maximum number was 644, which belongs to a disaster in Tenerife, Spain, where two jumbo jets crashed on the runway.

The average number of people in the aircraft at the time of the accident is 33. Both large and small aircraft were listed, which is shown by the standard deviation (std).

How Many People Died?

Between 1969 and 2009, we have data for 2993 fatalities information for accidents involving airplanes and total number of victims involving passengers, flight crew members onboard and people on the ground, is 68714.

Maximum number was 583, which again belongs to the disaster in Tenerife, Spain, which was mentioned above.

The average number of fatalities is ~ 23. Both large and small aircraft were listed, which is shown by the standard deviation (std).

The graph below gives the annual total fatalities as a result of an aircraft accident. Although there are jumps in the values, the trend is downwards, pointing to the importance of safety precautions.

How Many People Survived?

Between 1969 and 2009, we have data for 2988 survival information for accidents involving airplanes and total number of survivors involving passengers and flight crew members onboard is 29988.

Maximum number of survivors was 516, which again belongs to a hijack incident of a jumbo jet in Japan. Only the captain of the aircraft was killed by the hijacker.

The average number of survivors is ~ 10. Both large and small aircraft were listed, which is shown by the standard deviation (std).

Crash numbers grouped by the make/model of aircraft.

Finally, I want to compare the statistics of the two major airplane manufacturing companies, namely Boeing and Airbus. I also included Fokker and Embraer to the comparison; as these two companies are leaders in the production of regional aircraft.

I do not want to make a mistake here. Both Airbus and Boeing produce very high quality and extremely safe aircraft. I tried to reach some company-originated data, but both companies tend to avoid safety comparisons when selling their aircraft to airlines or comparisons on product quality. Most aircraft dominating the companies’ current sales have good safety records. Older model aircraft, which were first flown during the 1960s, 1970s, and 1980s, have had higher rates of fatal accidents (Wikipedia link). You can also reach a report from this link.

In our case, older model aircraft are defined as Airbus 300 and 310, and Boeing 707 and 727. Accident statistics also prove this case.

I made another data frame using and cleaning the data from the dataset. Basically, I removed much older Boeing models’ accidents (only 7xx series planes were considered). The plot below shows the number of accidents of the companies, which manufacture bigger airplanes (capable of carrying more than 50 passengers) between 1969 and 2009.

Also, details about the accident numbers of the Boeing and Airbus aircraft are shown in the plots below. Remember that, Boeing 737 is in service since 1968, the number built is more than 10,580 and there are many different types including military versions. The highest accident number may be misleading due to being the most popular aircraft in the commercial flight.

Conclusion

In this article, I analyzed the fatal and non-fatal airplane accident and incidents between 1969 and 2009, using the public data provided by data.world. The analysis provides evidence that:

  • Total number of accidents were 2994. 1972 was the year with maximum number of crashes (104) and 2009 had the minimum number (30, only half of the year is covered).
  • Total number of people onboard during the accidents were 98677 and total number of victims was 68714 (including passengers, flight crew and people on the ground).
  • The highest number of people killed in a single accident is 583; which is a crash involving two jumbo jets.
  • Totally, 29988 people survived the accidents.

· There is downward slope in annual total fatalities as a result of an aircraft accident, pointing to the results of increased aviation safety precautions.

  • Finally, crash numbers were grouped by the make/model of aircraft. Four major producers were compared to get an idea about the safety of them. Older model aircraft, which were first flown during the 1960s, 1970s, and 1980s, have had higher rates of fatal accidents, but the newer models experience much less accidents.

Thank you for reading. I am planning to publish a more detailed follow-up article on this subject soon.

You can access to this article and similar ones here.

--

--