从每一行的文本文件中提取子字符串？

Question

Is there a way to extract substrings from a textfile from each like eg Say this is the text file but with alot more lines like this:有没有办法从每个文本文件中提取子字符串，例如说这是文本文件，但有更多这样的行：

president, Donald Trump, 74, USA

Priminster, Boris Johnson, 56, UK

I would need to loop through each line and get substrings which are split by commas.我需要遍历每一行并获取用逗号分隔的子字符串。 So the that the substring would be Donald Trump, 74 and so on for the other lines.所以 substring 将是Donald Trump, 74等等其他线路。

Answer 1

Here you go:这里是 go：

with open('data.file') as f:
    for line in f:
        parts = line.split(', ')
        if len(parts) == 4:
            print(', '.join(parts[1:3]).strip())

Output: Output：

Donald Trump, 74
Boris Johnson, 56

Answer 2

You can use split, for splitting string at a specific character.您可以使用 split 来在特定字符处拆分字符串。 You will get a list, that you can join later on.您将获得一个列表，您可以稍后加入。 Reading a file is easy.读取文件很容易。

with open('filename.txt', 'r') as rf:
    lines = rf.readlines()

For this specific example you can do对于这个特定的例子，你可以做

for line in lines:
    line = line.strip()
    row  = "{}, {}".format(line.split(',')[1], line.split(',')[2])
    print(row)

Otherwise, please be more clear about what you would like to achieve.否则，请更清楚您想要实现的目标。

Answer 3

You could do it easily using simple split() and join() methods of string in python -您可以使用 python 中字符串的简单split()和join()方法轻松完成 -

Working Code -工作代码 -

# You could open your file like this
#file1 = open('myfile.txt', 'r') 

# For now I am assuming your file contains the following line of data. 
# You could uncomment above line and use.

file1 = ['president, Donald Trump, 74, USA','president, Donald Trump, 74, USA']
for line in file1: 
    print("".join(line.split(',')[1:3]))

Output: Output：

Donald Trump, 74
Donald Trump, 74

Explanation解释

Basically you are just splitting the string ( each line in file ) at comma and converting the string into array.基本上你只是用逗号分割字符串（文件中的每一行）并将字符串转换为数组。 So line.split(',') will give -所以line.split(',')会给 -
```
 ['president', ' Donald Trump', ' 74', ' USA']
```
Now, we are just joining the 2nd and the 3rd element of the list obtained in the above step.现在，我们只是加入在上述步骤中获得的列表的第二个和第三个元素。 This is done by ",".join() which will join each elements of list with ',' .这是由",".join()完成的，它将用','连接列表的每个元素。
Also, note that we have used [1:3] which will select only the 1st and the 2nd element from the list.另外，请注意，我们使用了[1:3] ，它将 select 仅是列表中的第一个和第二个元素。 So they will give the result which is displayed above所以他们会给出上面显示的结果

Hope this helps !希望这可以帮助！

Answer 4

Open the file, read the file line by line, then use pythons string.split method with a delimiter of a comma to get a list of words you can filter through.打开文件，逐行读取文件，然后使用带有逗号分隔符的string.split方法获取可以过滤的单词列表。

with open('filename.txt', 'r') as my_file:
    line = my_file.readline()
    while line:
        word_list = line.split(',')
        print(f'{word_list[1]}, {word_list[2]}')
        line = my_file.readline()

Answer 5

Try this:尝试这个：

lst = []
with open("textfile.txt", "r") as file:
  for line in file:
    stripped_line = line.strip()
    #to save it as a list
    lst.append(stripped_line.split(",")[1:-1])
print(lst)

#to print each of the element
for i in lst:
    print(",".join(i))

从每一行的文本文件中提取子字符串？

问题描述

5 个解决方案

解决方案1
1 2020-07-06 19:29:14

解决方案2
0 2020-07-06 19:23:28

解决方案3
0 已采纳 2020-07-06 19:24:04

Working Code -工作代码 -

Explanation解释

解决方案4
0 2020-07-06 19:29:59

解决方案5
0 2020-07-06 19:45:09

从每一行的文本文件中提取子字符串？

问题描述

5 个解决方案

解决方案1 1 2020-07-06 19:29:14

解决方案2 0 2020-07-06 19:23:28

解决方案3 0 已采纳 2020-07-06 19:24:04

Working Code -工作代码 -

Explanation解释

解决方案4 0 2020-07-06 19:29:59

解决方案5 0 2020-07-06 19:45:09

解决方案1
1 2020-07-06 19:29:14

解决方案2
0 2020-07-06 19:23:28

解决方案3
0 已采纳 2020-07-06 19:24:04

解决方案4
0 2020-07-06 19:29:59

解决方案5
0 2020-07-06 19:45:09