问题描述
整个错误读取-
ValueError:基于位置的索引只能有 [标签(必须在 索引),标签切片(包括两个端点!可以是切片 如果索引是整数,则为整数),标签列表,布尔值] 类型
要重现错误,只需粘贴并运行此文件(36MB 文件,可能需要几秒钟)-
import pandas as pd
df = pd.read_csv('https://raw.githubusercontent.com/vyaduvanshi/helper-files/master/error_df.csv') #36MB file
temp_total_tests = []
temp_total_recovered = []
temp_total_cases = []
temp_total_vaccinations = []
temp_total_deaths = []
temp_variables = [temp_total_tests,temp_total_recovered,temp_total_cases,temp_total_vaccinations,temp_total_deaths]
for country in df['country'].unique():
temp_df = df[df['country'] == country]
[(temp_variable.append(temp_df.loc[:,temp_variable].ffill())) for temp_variable in temp_variables]
df['total_tests'] = pd.concat(temp_variables[0],ignore_index=True)
df['total_recovered'] = pd.concat(temp_variables[1],ignore_index=True)
df['total_cases'] = pd.concat(temp_variables[2],ignore_index=True)
df['total_vaccinations'] = pd.concat(temp_variables[3],ignore_index=True)
df['total_deaths'] = pd.concat(temp_variables[4],ignore_index=True)
我想要做的事情可以用这段代码(下面)来实现,但是这里有很多重复,所以我尝试了上面的列表理解。
for country in df['country'].unique():
temp_df = df[df['country'] == country]
temp_df.loc[:,'total_tests'] = temp_df.loc[:,'total_tests'].ffill()
temp_total_tests.append(temp_df['total_tests'])
temp_df.loc[:,'total_recovered'] = temp_df.loc[:,'total_recovered'].ffill()
temp_total_recovered.append(temp_df['total_recovered'])
temp_df.loc[:,'total_cases'] = temp_df.loc[:,'total_cases'].ffill()
temp_total_cases.append(temp_df['total_cases'])
temp_df.loc[:,'total_vaccinations'] = temp_df.loc[:,'total_vaccinations'].ffill()
temp_total_vaccinations.append(temp_df['total_vaccinations'])
temp_df.loc[:,'total_deaths'] = temp_df.loc[:,'total_deaths'].ffill()
temp_total_deaths.append(temp_df['total_deaths'])
df['total_tests'] = pd.concat(temp_total_tests,ignore_index=True)
df['total_recovered'] = pd.concat(temp_total_recovered,ignore_index=True)
df['total_cases'] = pd.concat(temp_total_cases,ignore_index=True)
df['total_vaccinations'] = pd.concat(temp_total_vaccinations,ignore_index=True)
df['total_deaths'] = pd.concat(temp_total_deaths,ignore_index=True)
解决方法
暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!
如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。
小编邮箱:dio#foxmail.com (将#修改为@)