Apr-24-2020, 10:08 AM
Good day fellow coders,
I am a little stuck with an index error issue. Below is the description:
Table name: Sub
ID, Pred
2014_1107_1110, 0.5
2014_1107_1112, 0.5
2014_1107_1113, 0.5
2014_1107_1124, 0.5
Season
Season, Seed, TeamID
1998, W01, 3330
1998, W02, 3163
1998, W03, 3112
1998, W04, 3301
Method to simplfy the dataset
[b]Output from simplification method
Season TeamID seed_int
0 1998 3330 1
1 1998 3163 2
2 1998 3112 3
3 1998 3301 4
4 1998 3272 5
Pandas encaps SeedsTable
Season Seed TeamID
0 1998 W01 3330
1 1998 W02 3163
2 1998 W03 3112
3 1998 W04 3301
4 1998 W05 3272
Code that generates index Error
---------------------------------------------------------------------------
I am a little stuck with an index error issue. Below is the description:
Table name: Sub
ID, Pred
2014_1107_1110, 0.5
2014_1107_1112, 0.5
2014_1107_1113, 0.5
2014_1107_1124, 0.5
Season
Season, Seed, TeamID
1998, W01, 3330
1998, W02, 3163
1998, W03, 3112
1998, W04, 3301
Method to simplfy the dataset
def seed_to_int(seed):
s_int = int(seed[1:3])
return s_int
df_seeds['seed_int'] = df_seeds.Seed.apply(seed_to_int)
df_seeds.drop(labels = ['Seed'], inplace = True, axis = 1)
df_seeds.head()[b]Output from simplification method
Season TeamID seed_int
0 1998 3330 1
1 1998 3163 2
2 1998 3112 3
3 1998 3301 4
4 1998 3272 5
Pandas encaps SeedsTable
df_seeds = pd.read_csv(data_directory + '/Seeds.csv')output
Season Seed TeamID
0 1998 W01 3330
1 1998 W02 3163
2 1998 W03 3112
3 1998 W04 3301
4 1998 W05 3272
Code that generates index Error
X_test = np.zeros(shape = (n_test_games, 1))
for ii, row in df_sample_sub.iterrows():
year, t1, t2 = get_year_t1_t2(row.ID)
t1_seed = df_seeds[(df_seeds.TeamID == t1) & (df_seeds.Season == year)].seed_int.values[0]
t2_seed = df_seeds[(df_seeds.TeamID == t2) & (df_seeds.Season == year)].seed_int.values[0]
diff_seed = t1_seed - t2_seed
X_test[ii,0] = diff_seedActual index Error output---------------------------------------------------------------------------
IndexError Traceback (most recent call last)
<ipython-input-37-0215f157ee60> in <module>
3 for ii, row in df_sample_sub.iterrows():
4 year, t1, t2 = get_year_t1_t2(row.ID)
----> 5 t1_seed = df_seeds[(df_seeds.TeamID == t1) & (df_seeds.Season == year)].seed_int.values[0]
6 t2_seed = df_seeds[(df_seeds.TeamID == t2) & (df_seeds.Season == year)].seed_int.values[0]
7 diff_seed = t1_seed - t2_seed
IndexError: index 0 is out of bounds for axis 0 with size 0As you can see, the prolem is pointing to this line:t1_seed = df_seeds[(df_seeds.TeamID == t1) & (df_seeds.Season == year)].seed_int.values[0]Please can anyone proffer a solution to this?
