如何提高我的選擇過程的速度，python

Question

編輯：由於我的代碼中的錯誤，我用我最老但工作的代碼更新

我從數據庫中獲得了一個速度記錄列表，我想找到該列表中的最大速度。 聽起來很容易，但我對任何最大速度都有一些要求：

如果最大速度超過一定水平，則必須將超過一定數量的記錄識別為最大速度。 這種邏輯的原因是我想要在正常條件下的最大速度，而不僅僅是錯誤或一次出現。 出於同樣的原因，我還有一個約束，即速度必須超過一定的限制才能計算。

以下是速度數組的示例：

v = [8.0, 1.3, 0.7, 0.8, 0.9, 1.1, 14.9, 14.0, 14.1, 14.2, 14.3, 13.8, 13.9, 13.7, 13.6, 13.5, 13.4, 15.7, 15.8, 15.0, 15.3, 15.4, 15.5, 15.6, 15.2, 12.8, 12.7, 12.6, 8.7, 8.8, 8.6, 9.0, 8.5, 8.4, 8.3, 0.1, 0.0, 16.4, 16.5, 16.7, 16.8, 17.0, 17.1, 17.8, 17.7, 17.6, 17.4, 17.5, 17.3, 17.9, 18.2, 18.3, 18.1, 18.0, 18.4, 18.5, 18.6, 19.0, 19.1, 18.9, 19.2, 19.3, 19.9, 20.1, 19.8, 20.0, 19.7, 19.6, 19.5, 20.2, 20.3, 18.7, 18.8, 17.2, 16.9, 11.5, 11.2, 11.3, 11.4, 7.1, 12.9, 14.4, 13.1, 13.2, 12.5, 12.1, 12.2, 13.0, 0.2, 3.6, 7.4, 4.6, 4.5, 4.3, 4.0, 9.4, 9.6, 9.7, 5.8, 5.7, 7.3, 2.1, 0.4, 0.3, 16.1, 11.9, 12.0, 11.7, 11.8, 10.0, 10.1, 9.8, 15.1, 14.7, 14.8, 10.2, 10.3, 1.2, 9.9, 1.9, 3.4, 14.6, 0.6, 5.1, 5.2, 7.5, 19.4, 10.7, 10.8, 10.9, 0.5, 16.3, 16.2, 16.0, 16.6, 12.4, 11.0, 1.7, 1.6, 2.4, 11.6, 3.9, 3.8, 14.5, 11.1]

這是我的代碼，以找到我定義為真正的最大速度：

from collections import Counter
while max(speeds)>30:
    speeds.remove(max(speeds))
nwsp = []
for s in speeds:
    nwsp.append(np.floor(s))
count = Counter(nwsp)
while speeds and max(speeds)>14 and count[np.floor(max(speeds))]<10:
    speeds.remove(max(speeds))
while speeds and max(speeds)<5:
    speeds.remove(max(speeds))
if speeds:
    print max(speeds)
    return max(speeds)
else:
    return False

結果如圖所示：19.9

我做nwsp的原因是，如果f.ex 19.6只被發現9次對我來說無關緊要 - 如果在同一個整數內有任何數字，f.ex 19.7也被找到3次，那么19.6將是有效。

如何重寫/優化此代碼以便選擇過程更快？ 我已經刪除了max（速度），而是對列表進行了排序，並使用velocity [-1]引用了最大的元素。

很抱歉沒有為我的速度添加任何單位。

Answer 1

您的代碼速度很慢，因為您調用max並一遍又一遍地remove ，並且每個調用都需要與列表長度成比例的時間。 任何合理的解決方案都會快得多。

如果您知道False不會發生，那么這就足夠了：

speeds = [8.0, 1.3, 0.7, 0.8, 0.9, 1.1, 14.9, 14.0, 14.1, 14.2, 14.3, 13.8, 13.9, 13.7, 13.6, 13.5, 13.4, 15.7, 15.8, 15.0, 15.3, 15.4, 15.5, 15.6, 15.2, 12.8, 12.7, 12.6, 8.7, 8.8, 8.6, 9.0, 8.5, 8.4, 8.3, 0.1, 0.0, 16.4, 16.5, 16.7, 16.8, 17.0, 17.1, 17.8, 17.7, 17.6, 17.4, 17.5, 17.3, 17.9, 18.2, 18.3, 18.1, 18.0, 18.4, 18.5, 18.6, 19.0, 19.1, 18.9, 19.2, 19.3, 19.9, 20.1, 19.8, 20.0, 19.7, 19.6, 19.5, 20.2, 20.3, 18.7, 18.8, 17.2, 16.9, 11.5, 11.2, 11.3, 11.4, 7.1, 12.9, 14.4, 13.1, 13.2, 12.5, 12.1, 12.2, 13.0, 0.2, 3.6, 7.4, 4.6, 4.5, 4.3, 4.0, 9.4, 9.6, 9.7, 5.8, 5.7, 7.3, 2.1, 0.4, 0.3, 16.1, 11.9, 12.0, 11.7, 11.8, 10.0, 10.1, 9.8, 15.1, 14.7, 14.8, 10.2, 10.3, 1.2, 9.9, 1.9, 3.4, 14.6, 0.6, 5.1, 5.2, 7.5, 19.4, 10.7, 10.8, 10.9, 0.5, 16.3, 16.2, 16.0, 16.6, 12.4, 11.0, 1.7, 1.6, 2.4, 11.6, 3.9, 3.8, 14.5, 11.1]

from collections import Counter
count = Counter(map(int, speeds))
print max(s for s in speeds
          if 5 <= s <= 30 and (s <= 14 or count[int(s)] >= 10))

如果可能發生False情況，這將是一種方式：

speeds = [8.0, 1.3, 0.7, 0.8, 0.9, 1.1, 14.9, 14.0, 14.1, 14.2, 14.3, 13.8, 13.9, 13.7, 13.6, 13.5, 13.4, 15.7, 15.8, 15.0, 15.3, 15.4, 15.5, 15.6, 15.2, 12.8, 12.7, 12.6, 8.7, 8.8, 8.6, 9.0, 8.5, 8.4, 8.3, 0.1, 0.0, 16.4, 16.5, 16.7, 16.8, 17.0, 17.1, 17.8, 17.7, 17.6, 17.4, 17.5, 17.3, 17.9, 18.2, 18.3, 18.1, 18.0, 18.4, 18.5, 18.6, 19.0, 19.1, 18.9, 19.2, 19.3, 19.9, 20.1, 19.8, 20.0, 19.7, 19.6, 19.5, 20.2, 20.3, 18.7, 18.8, 17.2, 16.9, 11.5, 11.2, 11.3, 11.4, 7.1, 12.9, 14.4, 13.1, 13.2, 12.5, 12.1, 12.2, 13.0, 0.2, 3.6, 7.4, 4.6, 4.5, 4.3, 4.0, 9.4, 9.6, 9.7, 5.8, 5.7, 7.3, 2.1, 0.4, 0.3, 16.1, 11.9, 12.0, 11.7, 11.8, 10.0, 10.1, 9.8, 15.1, 14.7, 14.8, 10.2, 10.3, 1.2, 9.9, 1.9, 3.4, 14.6, 0.6, 5.1, 5.2, 7.5, 19.4, 10.7, 10.8, 10.9, 0.5, 16.3, 16.2, 16.0, 16.6, 12.4, 11.0, 1.7, 1.6, 2.4, 11.6, 3.9, 3.8, 14.5, 11.1]

from collections import Counter
count = Counter(map(int, speeds))
valids = [s for s in speeds
         if 5 <= s <= 30 and (s <= 14 or count[int(s)] >= 10)]
print max(valids) if valids else False

或者排序並使用next ，默認情況下可以使用False ：

speeds = [8.0, 1.3, 0.7, 0.8, 0.9, 1.1, 14.9, 14.0, 14.1, 14.2, 14.3, 13.8, 13.9, 13.7, 13.6, 13.5, 13.4, 15.7, 15.8, 15.0, 15.3, 15.4, 15.5, 15.6, 15.2, 12.8, 12.7, 12.6, 8.7, 8.8, 8.6, 9.0, 8.5, 8.4, 8.3, 0.1, 0.0, 16.4, 16.5, 16.7, 16.8, 17.0, 17.1, 17.8, 17.7, 17.6, 17.4, 17.5, 17.3, 17.9, 18.2, 18.3, 18.1, 18.0, 18.4, 18.5, 18.6, 19.0, 19.1, 18.9, 19.2, 19.3, 19.9, 20.1, 19.8, 20.0, 19.7, 19.6, 19.5, 20.2, 20.3, 18.7, 18.8, 17.2, 16.9, 11.5, 11.2, 11.3, 11.4, 7.1, 12.9, 14.4, 13.1, 13.2, 12.5, 12.1, 12.2, 13.0, 0.2, 3.6, 7.4, 4.6, 4.5, 4.3, 4.0, 9.4, 9.6, 9.7, 5.8, 5.7, 7.3, 2.1, 0.4, 0.3, 16.1, 11.9, 12.0, 11.7, 11.8, 10.0, 10.1, 9.8, 15.1, 14.7, 14.8, 10.2, 10.3, 1.2, 9.9, 1.9, 3.4, 14.6, 0.6, 5.1, 5.2, 7.5, 19.4, 10.7, 10.8, 10.9, 0.5, 16.3, 16.2, 16.0, 16.6, 12.4, 11.0, 1.7, 1.6, 2.4, 11.6, 3.9, 3.8, 14.5, 11.1]

count = Counter(map(int, speeds))
print next((s for s in reversed(sorted(speeds))
            if 5 <= s <= 30 and (s <= 14 or count[int(s)] >= 10)),
           False)

而不是Counter ，你也可以使用groupby ：

speeds = [8.0, 1.3, 0.7, 0.8, 0.9, 1.1, 14.9, 14.0, 14.1, 14.2, 14.3, 13.8, 13.9, 13.7, 13.6, 13.5, 13.4, 15.7, 15.8, 15.0, 15.3, 15.4, 15.5, 15.6, 15.2, 12.8, 12.7, 12.6, 8.7, 8.8, 8.6, 9.0, 8.5, 8.4, 8.3, 0.1, 0.0, 16.4, 16.5, 16.7, 16.8, 17.0, 17.1, 17.8, 17.7, 17.6, 17.4, 17.5, 17.3, 17.9, 18.2, 18.3, 18.1, 18.0, 18.4, 18.5, 18.6, 19.0, 19.1, 18.9, 19.2, 19.3, 19.9, 20.1, 19.8, 20.0, 19.7, 19.6, 19.5, 20.2, 20.3, 18.7, 18.8, 17.2, 16.9, 11.5, 11.2, 11.3, 11.4, 7.1, 12.9, 14.4, 13.1, 13.2, 12.5, 12.1, 12.2, 13.0, 0.2, 3.6, 7.4, 4.6, 4.5, 4.3, 4.0, 9.4, 9.6, 9.7, 5.8, 5.7, 7.3, 2.1, 0.4, 0.3, 16.1, 11.9, 12.0, 11.7, 11.8, 10.0, 10.1, 9.8, 15.1, 14.7, 14.8, 10.2, 10.3, 1.2, 9.9, 1.9, 3.4, 14.6, 0.6, 5.1, 5.2, 7.5, 19.4, 10.7, 10.8, 10.9, 0.5, 16.3, 16.2, 16.0, 16.6, 12.4, 11.0, 1.7, 1.6, 2.4, 11.6, 3.9, 3.8, 14.5, 11.1]

from itertools import *
groups = (list(group) for _, group in groupby(reversed(sorted(speeds)), int))
print next((s[0] for s in groups
            if 5 <= s[0] <= 30 and (s[0] <= 14 or len(s) >= 10)),
           False)

以防所有這些看起來很奇怪，這里有一個接近你原來的。 只需查看從最快到最慢的速度並返回符合要求的第一個：

def f(speeds):
    count = Counter(map(int, speeds))
    for speed in reversed(sorted(speeds)):
        if 5 <= speed <= 30 and (speed <= 14 or count[int(speed)] >= 10):
            return speed
    return False

順便說一下，你對“真正的最大速度”的定義對我來說似乎很奇怪。 只看某個百分點怎么樣？ 也許是這樣的：

print sorted(speeds)[len(speeds) * 9 // 10]

Answer 2

我不確定這是否更快，但它更短，我認為它可以滿足您的要求。 它使用Counter 。

from collections import Counter
import math

def valid(item):
  speed,count = item
  return speed <= 30 and (speed <= 13 or count >= 10)

speeds = [4,3,1,3,4,5,6,7,14,16,18,19,20,34,5,4,3,2,12,58,14,14,14]

speeds = map(math.floor,speeds)
counts = Counter(speeds)
max_valid_speed = max(filter(valid,counts.items()))

結果： max_valid_speed == (12,1)

Answer 3

使用您的排序想法，我們可以從列表末尾開始，小於30的數字，返回匹配條件的第一個數字或返回False：

from collections import Counter

def f(speeds):
    # get speeds that satisfy the range
    rev = [speed for speed in speeds if 5 <= speed < 30]
    rev.sort(reverse=True)
    c = Counter((int(v) for v in rev))
    for speed in rev:
        # will hit highest numbers first
        # so return first that matches
        if speed > 14 and c[int(speed)] > 9 or speed < 15:
            return speed
    # we did not find any speed that matched our requirement
    return False

列表輸出v：

In [70]: f(v)
Out[70]: 19.9

如果沒有排序，您可以使用dict，根據您的數據將決定哪個最好，它將適用於所有情況，包括空列表：

 def f_dict(speeds):
    d = defaultdict(lambda: defaultdict(lambda: 0, {}))
    for speed in speeds:
        key = int(speed)
        d[key]["count"] += 1
        if speed > d[key]["speed"]:
            d[key]["speed"] = speed
    filt = max(filter(lambda x: (15 <= x[0] < 30 and
                                 x[1]["count"] > 9 or x[0] < 15), d.items()), default=False)
    return filt[1]["speed"] if filt else False

輸出：

In [95]: f_dict(v)
Out[95]: 19.9

如何提高我的選擇過程的速度，python

問題描述

3 個解決方案

解決方案1
2 已采納 2015-04-28 21:59:05

解決方案2
1 2015-04-28 18:53:49

解決方案3
1 2015-04-28 19:27:11

如何提高我的選擇過程的速度，python

問題描述

3 個解決方案

解決方案1 2 已采納 2015-04-28 21:59:05

解決方案2 1 2015-04-28 18:53:49

解決方案3 1 2015-04-28 19:27:11

解決方案1
2 已采納 2015-04-28 21:59:05

解決方案2
1 2015-04-28 18:53:49

解決方案3
1 2015-04-28 19:27:11