Data for Airlines Decision Model Under Competition
Readme file for data
The following are the data files included in this set; further details are given after the list.
- final_data_presence_**** – has the data on airport presence of United Airlines (UA) and Delta (DL) calculated using BTS T-100 data.
- airports_list – identifies the airports included in our network
- unionset – indicates the set of routes that existed in the given years
- final_data**** - has the data processed by appling filters to the BTS dataset. Two separate folders contains files with name ‘final_data****’. These are Output Data UA 2005-14 and Output Data DL 2005-14 which corresponds to filtered data for UA and DL respectively.
Details of the data files
This data file contains information on airport presence of the airlines UA and DL. ‘****’ indicator in the name indicates the year for which this data file is created. This included matrix that had the following columns:
ALL_POSSIBLE_ROUTES - This is the set of 8646 possible routes. They are identified by a ten digit code, the first five digits of which are the Airport ID of the origin airport as assigned in the BTS dataset, and the last five digits are the same for the destination airport.
AP_PRESENCE_DELTA - Airport presence of DL for that year
AP_PRESENCE_UNITED - Airport presence of UA for that year
The airport presence was calculated using the data recorded in BTS T-100 database.
This file has three columns. The first column is the airport code, the second column is the five digit airport ID given in the BTS dataset, and the third column identifies (using indicator '1') if the airport is a hub.
This file identifies the set of routes that existed in the years under consideration. Each of the columns are for a year and correspond to the 'ROUTES_CURRENT' column in the 'final_dataXXXX' file as described above.
This data file includes the filtered data from BTS that we used for our analysis. The '****' indicator in the name indicates the year for which this data file is created. The included matrix has the following columns:
possRoutes - This is the set of 8646 possible routes, which, if all of them exist, would make for a fully connected network. They are identified by a ten digit code, the first five digits of which are the Airport ID of the origin airport as assigned in the BTS dataset, and the last five digits are the same for the destination airport.
DISTANCE - This is the distance between the airport pairs.
ROUTES - A column that identifies the routes that exist in the given year. '1' indicates the route exists while '0' indicates that it does not.
ADD - These are the routes that were added in the current year; they are identified by '1'.
DEL - These are the routes that were deleted in the current year; they are identified by '1'.
HUBS - These identify the 'hub-level' of the route. '0' indicates both the terminal airports on this route are non-hubs, '1' indicates that one of the two airports is a hub, and '2' indicates that both airports are hubs.
DIRECT_COST - This is the direct operating cost on the route. It is calculated as a weighted by number of operations sum using data in the BTS Schedule P-5.2.
SEG_PAX - This is the segment demand the route as reported in the BTS T-100 dataset.
MKT_PAX - This is the market demand the route as reported in the BTS T-100 dataset.
These were data for individual airlines considered in our analysis. These data are provided for two airlines in the US ATS: UA and DL. Data for UA is in the folder ‘Output Data UA 2005-14’ and data for DL is in the folder ‘Output Data DL’.
Cite this work
Researchers should cite this work as follows:
- Thekinen, J. D., Moolchandani, K., Panchal, J. H., Delaurentis, D. A. (2018). Data For Airlines Decision Model Under Competition. Purdue University Research Repository. doi:10.4231/R7C53J4B