简体   繁体   中英

R with ff and FSelector package

I have a 1360x92735 csv dataset and I have to reduce dimensionality using FSelector package for R (information.gain()), but it requests a lot of ram.

My question is, can I use the ff package in combination with FSelector? If yes, how?

ps I have 8GB of ram and 8GB of swap on linux.

Thanks.

[EDIT]

I've try to use ff and FSelector package with iris dataset. It seems to work well, but now I've a problem with ff.

My csv dataset is 1303x92735 and when I try to use an ff object to convert a dataframe with as.ffdf(), or to directly load dataset with read.csv.ffdf(), R crash with "write error".

Here someone has same problem, but I don't understand if reachs a solution or not.

Thanks.

The error is likely due to the fact that ff opens a file for each column in the ff data frame. You have 92,735 columns which is likely to be many more than your system configuration for the max number of open files. I've answered this on SO here .

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM