简体   繁体   中英

cPickle EOFError when loading pickled data

EDIT: I'm stupid

Sorry everyone. The reason my loop wasn't working is a profoundly stupid one; I had the arguments in the wrong order for pickle.dump().

OP

First time pickling objects, sorry if this is a nub question; have looked at various other EOFError posts but none either work or seem to apply.

Additionally, though I close the .pkl file, when the script has run it always has a zero-byte file size. I don't understand why.

I'm on OS X (Mavericks); here's my code (slightly reduced for the purposes of readability; the to_file_inspections() def is a class method and a couple of var names have been made more useful to a general audience, otherwise is verbatim. And FileInspection class is verbatim!

def to_file_inspections(data_store)
    jar = open("jar.pkl","wb")
    for f in data_store:
        f_ob = FileInspection(f) #custom class, not very elaborate
        cPickle.dump(jar,f_ob, -1)
    jar.close()
    #now just to test it...
    data = cPickle.load(open("jar.pkl", "rb"))
    print repr(data)

Results in:

    Traceback (most recent call last):
      File "main.py", line 400, in <module>
        to_file_inspections()
      File "main.py", line 355, in to_file_inspections
        data = cPickle.load(open("jar.pkl", "rb"))
    EOFError

In case it matters, this is my FileInspection class:

class FileInspection:
    def __init__(self, path):
        self._path = path.rstrip('\n')
        self._hash = self.hash_file()
        self._inspectable = True
        stats = os.stat(self._path)
        self._size = stats.st_size
        self._last_mod = stats.st_mtime

    def hash_file(self):
        read_size = 1024 # You can make this bigger
        checksum = hashlib.md5()
        with open(self._path) as f:
            data = f.read(read_size)
            while data:
                checksum.update(data)
                data = f.read(read_size)
        checksum = checksum.hexdigest()

        return checksum

    def toString(self):
        return '{0} = {1}"checked":false, "path":"{2}", "hash":{3} "inspectable":{4}, "filesize":{5}, "lastmod":"{6}"{7}'.format(
            self._hash, "{", self._path, self._inspectable, self._hash, self._last_mod, "}\n")

    def toJSON(self):
        return '\t"{0}":{1}\n\t\t"checked":false,\n\t\t"path":"{2}",\n\t\t"hash":"{3}",\n\t\t"inspectable":"{4}",\n\t\t"filesize":"{5}",\n\t\t"last_mod":"{6}"\n\t{7}'.format(
            self._hash, "{", self._path, self._hash, self._inspectable, self._size, self._last_mod, "}")

    @property
    def path(self):
        return self._path

    @property
    def hash(self):
        return self._hash

    @property
    def inspectable(self):
        return self._inspectable

    @property
    def size(self):
        return self._size

    @property
    def last_mod(self):
        return self._last_mod

Sorry everyone. The reason my loop wasn't working is a profoundly stupid one; I have the arguments in the wrong order for pickle.dump().

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM