简单的python脚本，多线程？

Question

我有一个简单的 python 脚本，它从两个 .txt 文件中读取并将输出保存在一个新的文本文件中。

这是我的代码的样子：

import os

with open("users.txt", encoding="utf-8") as f:
    users = f.readlines()
users = [x.strip() for x in users] 

with open("users1.txt", encoding="utf-8") as f:
    passes = f.readlines()
passes = [x.strip() for x in passes]

results = []

for user in users:
    h = user[user.find(':') + 1:]
    for p in passes:
        if p[:p.find(':')] == h:
            results.append((user[:user.find(':')], p[p.find(':') + 1:]))
            passes.remove(p)
            break

with open('results.txt', 'w+', encoding="utf-8") as f:
    for item in results:
        f.write(f"{item[0]}:{item[1]}\n")

user.txt 大约有 3-10M 行.. 当然这需要很长时间来处理，我说 1-3 个小时。 我认为唯一的原因是脚本在单个线程中运行。 我做了一些挖掘，我没有想法。 有没有办法让这个简单的脚本在 1 个以上的线程上运行？ 它会加快这个过程吗？

谢谢！

Answer 1

如果您首先将传递转换为某种字典，它将为您节省大量时间。 我不完全理解您的代码，但似乎每个user和每个pass都嵌入了某种 ID 或密钥，您正在寻找匹配的 ID。

将通行证转换为字典

my_dictionary = { pass[:pass.find(':')] : pass for pass in passes }

那么你的外循环是：

for user in users:
    id = user[user.find(':') + 1:]
    if id in my_dictionary:
        pass = my_dictionary[id]
        .... whatever you do with user and pass ...
        del my_dictionary[id]

单个字典查找而不是嵌套循环。

简单的python脚本，多线程？

问题描述

1 个解决方案

解决方案1
2 2020-10-23 06:39:37

简单的python脚本，多线程？

问题描述

1 个解决方案

解决方案1 2 2020-10-23 06:39:37

解决方案1
2 2020-10-23 06:39:37