在Python中選擇正則表達式

Question

我有一個字符串，這是

"contributors_enabled": false, "geo_enabled": false, "created_at": "Fri Nov 11 15:38:06 +0000 2016"}, "text": "Facts On Managed Forex Trading htps:////t.co////E4cxCvvjD #forex #binaryoptions #cryptocurrency #stockmarket", "timestamp_ms": "1509073455803",.

我將使用正則表達式選擇文本：

Facts On Managed Forex Trading htps:////t.co////E4cxCvvjD #forex #binaryoptions #cryptocurrency #stockmarket

在“ text”之后：“”和“ timestamp_ms”之前：

是否可以收集這些文本？

Answer 1

可能？ 是。

def text_scrap(text, start, end):
    """This function returns the data between start and end."""
    _,_,rest = text.partition(start)
    result,_,_ = rest.partition(end)
    return result

my_text = "contributors_enabled": false, "geo_enabled": false, "created_at": "Fri Nov 11 15:38:06 +0000 2016"}, "text": "Facts On Managed Forex Trading htps:////t.co////E4cxCvvjD #forex #binaryoptions #cryptocurrency #stockmarket", "timestamp_ms": "1509073455803",.

data_scrapped = text_scrap(my_text, start=' "text": "', end="timestamp_ms") # use our new shiny function
print(data_scrapped)

好主意？ 可能不是。

您的代碼是字典，因此您可以更輕松地訪問字典的“文本”鍵。 請選中此以了解字典。

Answer 2

盡管從字符串看來，您的整個字符串似乎都可以被解析，因為它似乎是JSON。 但是，由於您正在尋找與正則表達式相關的解決方案，所以希望以下對您有用。

import re

pattern = '"text": "(.*), "timestamp_ms"'

str = """
"contributors_enabled": false, "geo_enabled": false, "created_at": "Fri Nov 11 15:38:06 +0000 2016"}, "text": "Facts On Managed Forex Trading htps:////t.co////E4cxCvvjD #forex #binaryoptions #cryptocurrency #stockmarket", "timestamp_ms": "1509073455803",.
"""

print re.findall(pattern, string=str)[0]

輸出：

Facts On Managed Forex Trading htps:////t.co////E4cxCvvjD #forex #binaryoptions #cryptocurrency #stockmarket"

在Python中選擇正則表達式

問題描述

2 個解決方案

解決方案1
0 2017-10-27 04:34:22

解決方案2
0 已采納 2017-10-27 04:47:08

在Python中選擇正則表達式

問題描述

2 個解決方案

解決方案1 0 2017-10-27 04:34:22

解決方案2 0 已采納 2017-10-27 04:47:08

解決方案1
0 2017-10-27 04:34:22

解決方案2
0 已采納 2017-10-27 04:47:08