Marcin Polkowski 2022-12-30 @ 02:58:27
· Category: Flight Data
Note: If you have questions or comments about the dataset used here, do not hesitate to contact me. You will find my email on my personal website.
The so-called “Southwest meltdown” happened this week as severe winter weather hit most of the US. Thousands of flights got canceled, making tens or even hundreds of thousands of people unhappy. Southwest operates primarily as a low-cost carrier, offering a no-frills experience with a focus on low fares. This implies a different way of operating connections. There is no central hub, and airplanes constantly fly to different locations. It turns out that this model is vulnerable when multiple airports get disrupted. There is a ton of resources and articles on this topic. I want to focus on how the “meltdown” is visible in my data.
Individual airlines
Let’s look at the number of airplanes (distinct machines) [blue line] and the number of flights (the same aircraft serves multiple connections every day) by day [green line]. Let’s look at the past four years and two months.
Thin, red vertical lines on the right-hand side plots show Thanksgiving and two Christmas Days.
Southwest Airlines
Interestingly, the 2022 “meltdown” brought the number of connections below the minimum numbers from the “2020 Covid Lockdown”.
American Airlines
United Airlines
Bonus: Lufthansa
This post is focused on the USA, but Lufthansa is the airline that I use the most (yeah, you can call it my favorite one). What is interesting here is the two significant dips in the number of connections in the second half of 2022. On these two days, Lufthansa personnel was on strike.
Major North America airlines – comparison
Let’s look at ten major airlines in North America. We are looking (below) at the number of daily connections/flights normalized to the average for the period between October 15th and December 15th. You can see that across all airlines, Thanksgiving weekend and Christmas are slower than “usual”, but after this Christmas (while all airlines returned to regular operation), Southwest plunged to around 30% of its normal operation.
Summery
Data is fun, let me know if you like it too!
permalink |
Marcin Polkowski 2022-12-21 @ 03:17:03
· Category: Flight Data
Note: If you have questions or comments about the dataset used here, do not hesitate to contact me. You will find my email on my personal website.
As I mentioned a a few months ago I’ve been collecting flight statistics for over 4 years. It happened that these 4 years were the most interesting in the industry ever. 2022 is the year when the travel demand is back to normal, but the airlines struggle with supply, because of the cuts they made in 2020/2021 to survive.
Depending on the country and region, people were living in lockdowns and countries were banning local and international travel. I think I don’t need to explain that to anyone.
On top of the Pandemic, we were witnessing events influencing individual airplane models. One of the examples of such events was the retirement of Airbuses A380 because they were too big. Some airlines now work hard to bring them back (ex. Lufthansa announced to bring back four A380s in 2023).
In the meantime, we had two catastrophes of Boeing’s 737 MAX-8 that grounded all of them for many months.
A few weeks ago Boeing produced the last 747 – one of the most recognizable planes in history. I created most of the content for this article while onboard 747 – my symbolic tribute to this great plane. I had the pleasure of traveling on both the lower and upper decks of this plane, however, I still hope that one day I will be able to get the First Class ticket that allows sitting “in front of the pilot” – a pretty unique thing.
Ok, let’s get to the data! I used my huge dataset to calculate the number of airplanes (unique machines), by model and date. Let me show you the most interesting results:
Boeing 737 MAX-8
On October 29, 2018, Lion Air Flight 610, a 737 MAX 8, plunged into the Java Sea 13 minutes after takeoff from Soekarno–Hatta International Airport, Jakarta, Indonesia. On March 10, 2019, Ethiopian Airlines Flight 302, a 737 MAX 8, crashed approximately six minutes after takeoff from Addis Ababa, Ethiopia. This second crash (marked as a red vertical dashed line) caused all MAX-8s to be grounded. It took almost 2 years before these planes were allowed back in the air. This model is constantly produced and sold, hence the growing number of planes in time.
Boeing, Airbus, Embraer, and Cessna VS the Pandemic
If we look at results consolidated by plane manufacturers we can see how differently the Pandemic affected commercial airliners (Boeing, Airbus, Embraer) and private planes (Cessna). I do realize that I generalize here the distinction between commercial and private.
Most popular models vs the Pandemic
Let’s look at the selection of popular planes and their counts in the air during the Pandemic. This plot is normalized: for each plane model, 1 on the y-axis is corresponding to the average number of planes a day before January 1st, 2022. We can see that the different models were “hit” differently by the Pandemic, but all of them still struggle to get back to “pre Pandemic” counts.
Individual airplanes
In case you are interested in other examples, I’m sharing plots for most airplane models. I encourage you to take a look and let me know if you found anything interesting.
permalink |
Marcin Polkowski 2022-10-01 @ 12:00:00
· Category: Flight Data
I’m very enthusiastic about both data science and plane travel. I found a way of combining these two passions. It stared from my frequently visiting flight tracking websites like adsbexchange.com or flightradar24.com. The next step for me was to build my own ADS-B receiver using Raspberry Pi. I stared with small antenna by the windows and quickly changed my setup to use big antenna on the top of my house. My Raspberry collects and shares data for over 8 years now (that reminds me, that I’ve never updated it). It receives signals from planes as far as 250 miles from my house. I’ve done some interesting data science using this data, but quickly realized that I’m hungry for more data.
Over 4 years ago I had an idea to start collecting data about all planes around the globe and storing it in the database for future science. I spend time searching for data sources, designing the database and all related processes and finally started collecting data. I started collecting data in November 2018.
Today, almost 4 years later I have a decent database to play with. The database is over 16 billion rows (1.6e10) and occupies almost 3 TB of disk space. I’m using Google BigQuery to store the data ans Docker to run the whole related infrastructure.
I’m going to share some of the most interesting analysis I was able to do with this data. I hope you will find it interesting and maybe even useful. Stay tuned!
permalink |