用另一個列表中的項目索引替換列表中所有出現的單詞

Question

我有一個清單-

A=["hi how are you","have good day","where are you going ","do you like the place"]

和另一個列表-

B=["how","good","where","going","like","place"]

列表B包含列表A中存在的某些單詞。我想用列表B中的索引替換列表A中出現的所有列表B中的單詞。如果不存在該單詞，請將其替換為0

因此，替換后的清單A應該是

["0 1 0 0","0 2 0","3 0 0 4","0 0 5 0 6"]

我嘗試使用for循環，但由於列表長度> 10000，所以效率不高。我也嘗試使用map函數，但未成功

這是我的嘗試：

for item in list_A:
    words=sorted(item.split(), key=len,reverse=True)
    for w in word:
        if w.strip() in list_B:
            item=item.replace(w,str(list_B.index(w.strip())))
        else:
            item=item.replace(w,0)

Answer 1

您可以做的是創建一個字典，將列表B中的每個單詞映射到其索引。 然后，您只需要遍歷第一個列表一次。

就像是

B = ["how","yes"]
BDict = {}
index = 0
for x in B:
    Bdict[x] = index
    index += 1

for sentence in A:
     for word in sentence:
         if word in BDict:
              #BDict[word] has the index of the current word in B
         else:
              #Word does not exist in B

這將大大減少運行時間，因為字典具有O（1）訪問時間。 但是，根據B的大小，字典可能會變得很大

編輯：您的代碼有效，之所以變慢，是因為使用列表時in和index運算符必須執行線性搜索。 因此，如果B變大，則速度可能會大大降低。 但是，字典需要一定的時間來查看字典中是否存在鍵以及獲取值。 通過使用字典，您可以將2個O（n）操作替換為O（1）操作。

Answer 2

您應該定義一個函數以返回第二個列表中的單詞索引：

def get_index_of_word(word):
    try:
        return str(B.index(word) + 1)
    except ValueError:
        return '0'

然后，您可以使用嵌套列表推導來生成結果：

[' '.join(get_index_of_word(word) for word in sentence.split()) for sentence in A]

更新

from collections import defaultdict

index = defaultdict(lambda: 0, ((word, index) for index, word in enumerate(B, 1))

[' '.join(str(index[word]) for word in sentence.split()) for sentence in A]

Answer 3

您可以嘗試以下方法：

A=["hi how are you","have good day","where are you going ","do you like the place"]
A = map(lambda x:x.split(), A)
B=["how","good","where","going","like","place"]
new = [[c if d == a else 0 for c, d in enumerate(i)] for i in A for a in B]

final = map(' '.join, map(lambda x: [str(i) for i in x], new))

print final

Answer 4

這是在Python 3.x中

A=["hi how are you","have good day","where are you going ","do you like the place"]
B=["how","good","where","going","like","place"]
list(map(' '.join, map(lambda x:[str(B.index(i)+1) if i in B else '0' for i in x], [i.split() for i in A])))

輸出：

['0 1 0 0', '0 2 0', '3 0 0 4', '0 0 5 0 6']

Answer 5

您好，您的解決方案正在進行太多查找。

這是我的：

A=["hi how are you",
   "have good day",
   "where are you going ",
   "do you like the place"]

B=["how","good","where","going","like","place"]

# I assume B contains only unique elements.

gg = { word: idx for (idx, word) in enumerate(B, start=1)}
print(gg)

lookup = lambda word: str(gg.get(word, 0)) # Buils your index and gets you efficient search with proper object types.

def translate(str_):
    return ' '.join(lookup(word) for word in str_.split())        

print(translate("hi how are you")) # check for one sentence.


translated =  [translate(sentence) for sentence in A] # yey victory.

print(translated)

# Advanced usage

class  missingdict(dict):
    def __missing__(self, key):
        return 0

miss = missingdict(gg)

def tr2(str_):
    return ' '.join(str(miss[word]) for word in str_.split())


print([tr2(sentence) for sentence in A])

當您在python中更加自信時，您可能還會使用yield關鍵字。

用另一個列表中的項目索引替換列表中所有出現的單詞

問題描述

5 個解決方案

解決方案1
1 2017-07-09 15:19:59

解決方案2
1 2017-07-09 15:26:42

解決方案3
0 2017-07-09 15:26:25

解決方案4
0 2017-07-09 15:31:03

解決方案5
0 2017-07-09 15:38:20

用另一個列表中的項目索引替換列表中所有出現的單詞

問題描述

5 個解決方案

解決方案1 1 2017-07-09 15:19:59

解決方案2 1 2017-07-09 15:26:42

解決方案3 0 2017-07-09 15:26:25

解決方案4 0 2017-07-09 15:31:03

解決方案5 0 2017-07-09 15:38:20

解決方案1
1 2017-07-09 15:19:59

解決方案2
1 2017-07-09 15:26:42

解決方案3
0 2017-07-09 15:26:25

解決方案4
0 2017-07-09 15:31:03

解決方案5
0 2017-07-09 15:38:20