简体   繁体   English

使用python从纯文本中提取包括空值的字符串

[英]extract string including empty values from a plain text using python

I have a string 我有一个弦

a='S
LINC             SHORT LEGAL                                   TITLE NUMBER
0037 471 661     1720278;16;21                                 172 211 342

LEGAL DESCRIPTION
PLAN 1720278  
BLOCK 16  
LOT 21  
EXCEPTING THEREOUT ALL MINES AND MINERALS  

ESTATE: FEE SIMPLE  
ATS REFERENCE: 4;24;54;2;SW

MUNICIPALITY: CITY OF EDMONTON

REFERENCE NUMBER: 172 023 641 +71

---------------------------------------------------------------------------- 
----
                     REGISTERED OWNER(S)
REGISTRATION    DATE(DMY)  DOCUMENT TYPE      VALUE           CONSIDERATION
----------------------------------------------------------------------------- 
---

172 211 342    15/08/2017                      $610,000        CASH & MTGE'

Need to extract values below document type, value and consideration and output in an array like ['','$610,000','CASH & MTGE'] I tried using findall(r'(?<!\\S)(?:[$]\\S+|[^$\\d]+)\\b', a) . 需要提取低于文档类型,值和注意事项的值,并以['','$610,000','CASH & MTGE']等数组的形式输出,我尝试使用findall(r'(?<!\\S)(?:[$]\\S+|[^$\\d]+)\\b', a) But I could only get ['$610,000','CASH & MTGE'] and no value for document type since it is empty. 但是我只能得到['$610,000','CASH & MTGE'] ,文档类型没有值,因为它为空。

From what I understood, you would like to return an Array with the values $610,000 CASH & MTGE' from the string right? 据我了解,您想从字符串中返回一个值为$610,000 CASH & MTGE'的数组吗?

Assuming the required string value would remain in the end, we can leverage the splitlines function. 假设所需的字符串值将保留在最后,我们可以利用splitlines函数。 Then, use len(a)-1 to fetch the required string something like this: 然后,使用len(a)-1来获取所需的字符串,如下所示:

>>> a='''S
LINC             SHORT LEGAL                                   TITLE NUMBER
0037 471 661     1720278;16;21                                 172 211 342

LEGAL DESCRIPTION
PLAN 1720278  
BLOCK 16  
LOT 21  
EXCEPTING THEREOUT ALL MINES AND MINERALS  

ESTATE: FEE SIMPLE  
ATS REFERENCE: 4;24;54;2;SW

MUNICIPALITY: CITY OF EDMONTON

REFERENCE NUMBER: 172 023 641 +71

---------------------------------------------------------------------------- 
----
                     REGISTERED OWNER(S)
REGISTRATION    DATE(DMY)  DOCUMENT TYPE      VALUE           CONSIDERATION
----------------------------------------------------------------------------- 
---

172 211 342    15/08/2017                      $610,000        CASH & MTGE'''

>>> b=a.splitlines()
>>> req_line = b[len(b)-1]
>>> print(req_line)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM