分解python進程

Question

我想嘗試從使用python從apriori下載的apriori算法中分解一些行代碼，因為當我嘗試計算大數據集時它會給我的內存錯誤。 這是我剛發現的一些問題。

def joinSet(itemSet, length):
        """Join a set with itself and returns the n-element itemsets"""
        return set([i.union(j) for i in itemSet for j in itemSet if len(i.union(j)) == length])

我想知道錯誤觸發時返回var的實際大小，所以我嘗試將這些代碼細分為此。

def joinSet(itemSet, length):
    """Join a set with itself and returns the n-element itemsets"""
    ret = []
    for i in itemSet:
        for j in itemSet:
            if len(i.union(j)) == length:
                ret.append(i.union(j))
    return ret

所以我可以監控每一步，但我的故障代碼並沒有給出與原始代碼相同的結果。

我還有什么想念？ 如果我的方法出錯，如果你能給我實際的解決方案，我將非常感激。 謝謝。

Answer 1

我想原文可能會返回一個集合，並且您將返回一個列表。

def joinSet(itemSet, length):
    """Join a set with itself and returns the n-element itemsets"""
    ret = []
    for i in itemSet:
        for j in itemSet:
            if len(i.union(j)) == length:
                ret.append(i.union(j))
    return set(ret)

此外，我認為您可以通過以下原始編輯節省大量內存：

def joinSet(itemSet, length):
        """Join a set with itself and returns the n-element itemsets"""
        return {i.union(j) for i in itemSet for j in itemSet if len(i.union(j)) == length}

這是一個集合理解，需要python> = 2.7

分解python進程

問題描述

1 個解決方案

解決方案1
0 已采納 2016-04-08 15:40:16

分解python進程

問題描述

1 個解決方案

解決方案1 0 已采納 2016-04-08 15:40:16

解決方案1
0 已采納 2016-04-08 15:40:16