You have some datasets that have % values strings e.g. '95%',''82%' etc. It would be great if this could be automatically dealt with. On Pandas dataframe this can be done with df = df.replace('%','',regex=True).astype('float')