[英]How can I manipulate a list into a column?
我有一些来自如下所示的 word 文件的输出:
Doc = docx2python('C:/Users/Sam/Data/Information.docx')
print(Doc.body[0])
[[['Event Info', '1)\tHalf (1 or 2)', '2)\tMinutes (on video)', '3)\tSeconds (on video)', '4)/tStaff, 0 = N/A)',]]]
我想知道如何将这些列表放入显示以下输出的列中:
Event
Half
Minutes
Seconds
Staff
像这样的东西?
Doc = docx2python('C:/Users/Sam/Data/Information.docx')
d=Doc.body[0]
# Putting some data into d for testing.
# Remove this for actual production.
d= [[['Event Info', '1)\tHalf (1 or 2)', '2)\tMinutes (on video)', '3)\tSeconds (on video)', '4)\tStaff, 0 = N/A)',]]]
# We'll need regular expressions.
import re
# Helper functions.
def startsWithADigit(x):
return re.match(r"^[0-9]", x)
def getStuffAfterPotentialTabCharacter(x):
return x.split("\t")[-1]
def getFirstWord(x):
return re.sub(r"([a-zA-Z]+).*", r'\1', x)
# Get rid of indented lists.
l=d[0][0]
# Get stuff after potential tab characters.
p=[getStuffAfterPotentialTabCharacter(x) for x in l]
# Get the first word in each record, as that seems to be requested.
q=[getFirstWord(x) for x in p]
# Print the result.
for x in q:
print(x)
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.