我有一個列表列表，如何根據語言進行分類？

Question


I have three lists:
    id = [1,3,4]
    text = ["hello","hola","salut"]
    date = ["20-12-2020","21-04-2018","15-04-2016"]

#I then combined it all in one list:
    new_list = zip(id, text, date)

#which looks like [(1,"hello","20-12-2020"),(3,"hola","21-04-2018"),(4,"salut","15-04-2016")

I want to delete the whole list if it is not in english, do to this i installed lang id and am using lang id.classify

I ran a loop on only the text and its working but am unsure how to delete the whole value such as: (3,"hola","21-04-2018") as hola is not in english.

我正在嘗試獲得一個新列表，其中只有那些只有英文的列表。 我想進一步將 output 列表寫入 xml 文件中。 為此，我制作了一個樣本 xml 文件，並使用日期作為父鍵，因為多個文本的日期可以相同。

Answer 1

試試這個簡單的 for 循環

new_list = [(1,"hello","20-12-2020"),(3,"hola","21-04-2018"),(4,"salut","15-04-2016")]

for x in new_list:
    # condition to check if word or sentence is english
    if not isEnglishWord(x[1]):
        new_list.pop(x)

Answer 2

不確定 lang id.classify是如何工作的或者它接受的參數是什么，但這樣的事情應該有效：

for i in range(len(new_list)):
    if id.classify(new_list[i][1]) != 'english':
        new_list.pop[i]

在這種情況下，我假設 id.classify 接收一個 str 並輸出該詞所屬的語言（作為 str）。

我還使用范圍列表方法進行迭代，因此我們不會在迭代時更改列表。

我有一個列表列表，如何根據語言進行分類？

問題描述

2 個解決方案

解決方案1
2 2020-09-02 06:01:21

解決方案2
1 2020-09-02 06:02:11

我有一個列表列表，如何根據語言進行分類？

問題描述

2 個解決方案

解決方案1 2 2020-09-02 06:01:21

解決方案2 1 2020-09-02 06:02:11

解決方案1
2 2020-09-02 06:01:21

解決方案2
1 2020-09-02 06:02:11