Input contains NaN, infinity or a value too large for dtype('float32')

Question

How do I fix this error message , "ValueError: Input contains NaN, infinity or a value too large for dtype('float32')"

# Importing the libraries
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

# Loading the dataset
data = pd.read_csv(r'C:\Users\sam.jones\Desktop\Fixed Income project\Data Pull\Data\Fixed Income_Data dump_2018.csv',error_bad_lines=False,encoding = "ISO-8859-2")
X = np.array([data.iloc[:,158].values])
Y = data.iloc[:,92].values


#Fitting Random Forest Regression to the dataset
from sklearn.ensemble import RandomForestRegressor
regressor = RandomForestRegressor(n_estimators = 10, random_state = 0)
regressor.fit(X,Y)

Answer 1

Input might have Nan's. So use np.nan_to_num(X) to fill them with zeroes first.

Answer 2

尝试声明一个变量。

x = x.fillna(test.mean())

Answer 3

In my case that error was due to big numbers, in particular I found those with scientific notation, such as 3.63E+08, 1.25E+09... The solution is to replace those numbers with something smaller: you can either simply replace them with x / 1000 or, the best solution, use a function to scale or normalise the data. After that, you can train your model

Input contains NaN, infinity or a value too large for dtype('float32')

Question

3 answers

solution1
2 2019-02-27 19:27:22

solution2
0 2020-03-05 06:53:38

solution3
0 2021-01-09 10:49:41

Input contains NaN, infinity or a value too large for dtype('float32')

Question

3 answers

solution1 2 2019-02-27 19:27:22

solution2 0 2020-03-05 06:53:38

solution3 0 2021-01-09 10:49:41

solution1
2 2019-02-27 19:27:22

solution2
0 2020-03-05 06:53:38

solution3
0 2021-01-09 10:49:41