简体   繁体   English

如何替换文件中的单词?

[英]How to replace words in file?

In test.txt: 在test.txt中:

rt : objective
tr350rt : objective
rtrt : objective
@username : objective
@user_1236 : objective
@254test!! : objective
@test : objective
#15 : objective

My codes: 我的代码:

import re
file3 = 'C://Users/Desktop/test.txt'
rfile3 = open(file3).read()
for altext in rfile3.split("\n"):
    saltext = altext.split("\t")
    for saltword in saltext:
        ssaltword = saltword.split(" ")
        if re.search(r'^rt$', ssaltword[0]):
        print ssaltword[0], ssaltword[2]
        testreplace = open(file3, 'w').write(rfile3.replace(ssaltword[0], ""))
        if re.search(r'^@\w', ssaltword[0]):
            print ssaltword[0], ssaltword[2]
        testreplace = open(file3, 'w').write(rfile3.replace(ssaltword[0], ""))

I got: 我有:

 : objective
tr350 : objective
 : objective
@username : objective
@user_1236 : objective
@254test!! : objective
 : objective
#15 : objective

I am trying to replace only "rt" and all @ with space 我正在尝试只用空格替换“ rt”和所有@

But from my codes all "rt" were replaced and only one @ was replaced. 但是从我的代码中,所有“ rt”都被替换了,只有一个@被替换了。

I would like to get: 我想得到:

 : objective
tr350rt : objective
rtrt : objective
 : objective
 : objective
 : objective
 : objective
#15 : objective

Any suggestion? 有什么建议吗?

I think regex is overkill here: 我认为正则表达式在这里过于矫kill过正:

with open("test.txt") as in_fp, open("test2.txt", "w") as out_fp:
    for line in in_fp:
        ls = line.split()
        if ls and (ls[0].startswith("@") or ls[0] == "rt"):
            line = line.replace(ls[0], "", 1)
        out_fp.write(line)

produces 产生

localhost-2:coding $ cat test2.txt 
 : objective
tr350rt : objective
rtrt : objective
 : objective
 : objective
 : objective
 : objective
#15 : objective

Note that I've also changed it not to overwrite the original. 请注意,我还对其进行了更改,以不覆盖原始内容。

Edit: if you really want to overwrite the original in-place, then I'd read the whole thing into memory first: 编辑:如果您真的想覆盖原位,那么我首先将整个内容读入内存:

with open("test.txt") as fp:
    lines = fp.readlines()

with open("test.txt", "w") as out_fp:
    for line in lines:
        ls = line.split()
        if ls and (ls[0].startswith("@") or ls[0] == "rt"):
            line = line.replace(ls[0], "", 1)
        out_fp.write(line)
import re
with open("test.txt") as infile:
    text = infile.read()
    newtext = re.sub(r"(?m)^(?:rt\b|@\w+)(?=\s*:)", " ", text)

Explanation: 说明:

(?m)      # Turn on multiline mode
^         # Match start of line
(?:       # Either match...
 rt\b     # rt (as a complete word
|         # or
 @\w+     # @ followed by an alphanumeric "word"
)         # End of alternation
(?=\s*:)  # Assert that a colon follows (after optional whitespace)

Try this, 尝试这个,

import os

mydict = {"@":'',"rt":''}

filepath = 'C://Users/Desktop/test.txt'
s = open(filepath).read()
for k, v in mydict.iteritems():
    s = s.replace(k, v)
f = open(filepath, 'w')
f.write(s)
f.close()

Not even necessary to use regex here: 甚至不需要在这里使用正则表达式:

with open("test.txt") as file:
    lines = file.readlines()
    for line in lines:
        if (line.startswith("@") and ":" in line) or line.startswith("rt :"):
            line = " :" + line.split(":", 1)[1]

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM