Should read We don't need all of these columns - just one column that we can join to our dataframe on, and another for the LSOA. Subset the dataframe to just contain the columns 'query' and 'result_codes_lsoa'.