Lab 7

Due by 11:59 PM on Friday, March 19, 2021

We’ll work more with the nycflights13 dataset here.

Here’s a plot of how the data is related to each other:

Load the nycflights13 and all five data frames into R.
How many different plane “brands” (i.e. manufacturers) fly from “LGA”?
Calculate the average age of each plane for each flight; How does the average age of each plane differ by origin airport?
How many airports are in the airports data frame that are not a direct destination from any of the NYC airports (i.e. the flights dataframe)?
Calculate the number of tailnumbers (observations) per carrier. Arrange in alphabetical order by the full carrier name from the airlines dataframe.
How many flights were made to an “America/Los_Angeles” timezone from “JFK”? Which destination within that timezone received the most flights from “JFK”?
For each flight, find if it departed with decent weather. Drop observations that we don’t have weather information or dep_delay information. See if decent weather is associated with the delay of the plane.
- Define “fair weather” to mean no precipitation, wind-speeds under 20 mph, and visibility of at least 10 miles.

flights %>%
  mutate(hour = dep_time %/% 100) %>%
  YOUDECIDE_join() %>%
  mutate() ... and so on

Last updated on March 18, 2021