有效地查找数量最小的索引，该索引大于大型排序列表中的某个值

Question

If I have a long list of sorted numbers, and I want to find the index of the smallest element larger than some value, is there a way to implement it more efficiently than using binary search on the entire list? 如果我有一个很长的排序数字列表，并且我想找到大于某个值的最小元素的索引，是否有比在整个列表上使用二进制搜索更有效地实现它的方法？

For example: 例如：

import random
c = 0
x = [0 for x in range(50000)]

for n in range(50000):
    c += random.randint(1,100)
    x[n] = c

What would be the most efficient way of finding the location of the largest element in x smaller than some number, z 在x中找到小于某个数字z的最大元素的位置的最有效方法是什么

I know that you can already do: 我知道您已经可以：

import bisect
idx = bisect.bisect(x, z)

But assuming that this would be performed many times, would there be an even more efficient way than binary search? 但是，假设此操作可以执行多次，那么还有比二进制搜索更有效的方法吗？ Since the range of the list is large, creating a dict of all possible integers uses too much memory. 由于列表的范围很大，因此创建所有可能整数的字典将占用过多内存。 Would it be possible to create a smaller list of say every 5000 numbers and use that to speed up the lookup to a specific portion of the large list? 是否可以创建一个较小的列表（例如每5000个数字），并使用该列表来加快对较大列表的特定部分的查找？

Answer 1

Can you try if this can be a solution? 您能尝试一下是否可以解决？ It takes long to generate the list, but seems fast to report the result. 生成列表需要很长时间，但是报告结果似乎很快。

Given the list: 给定列表：

import random
limit = 50 # to set the number of elements
c = 0
x = [0 for x in range(limit)]

for n in range(limit):
    c += random.randint(1,100)
    x[n] = c
print(x)

Since it is a sorted list, you could retrieve the value using a for loop: 由于它是一个有序列表，因此可以使用for循环来检索值：

z = 1600 # reference for lookup

res = ()
for i, n in enumerate(x):
  if n > z:
    res = (i, n)
    break

print (res) # this is the index and value of the elements that match the condition to break
print(x[res[0]-1]) # this is the element just before

有效地查找数量最小的索引，该索引大于大型排序列表中的某个值

问题描述

1 个解决方案

解决方案1
0 2018-12-05 09:01:25

有效地查找数量最小的索引，该索引大于大型排序列表中的某个值

问题描述

1 个解决方案

解决方案1 0 2018-12-05 09:01:25

解决方案1
0 2018-12-05 09:01:25