僅匹配 IP 地址而不匹配其他數字

Question

我希望以下正則表達式代碼返回 IP 地址的 output，而不從源文件返回其他數值作為 IP。

編碼：

import re

logdata = 146.204.224.152 - feest6811 [21/Jun/2019:15:45:24 -0700] "POST /incentivize HTTP/1.1" 302 4622
for item in re.finditer("(?P<host>[\d.]+)", logdata):
    print(item.groupdict())

所需 output：

{'host': '146.204.224.152'}

不需要的 output：

{'host': '6811'}

Answer 1

我認為應該這樣做：

(?P<host>(\d+\.){3}\d+)

Answer 2

利用

import re
logdata = r'146.204.224.152 - feest6811 [21/Jun/2019:15:45:24 -0700] "POST /incentivize HTTP/1.1" 302 4622'
for item in re.finditer(r"\b(?P<host>(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?)(?:\.(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?)){3})\b", logdata):
    print(item.groupdict())

參見Python 證明。

結果： {'host': '146.204.224.152'} 。

請參閱使用 regex 從字符串中提取 ip 地址。

像您一樣從日志行獲取host和time ：

import re
logdata = r'146.204.224.152 - feest6811 [21/Jun/2019:15:45:24 -0700] "POST /incentivize HTTP/1.1" 302 4622'
match_data = re.search(r'^(?P<host>\S+).*?\[(?P<time>.*?)]', logdata)
if match_data:
    print(match_data.groupdict())

參見Python 證明。

解釋

--------------------------------------------------------------------------------
  ^                        the beginning of the string
--------------------------------------------------------------------------------
  (?P<host>                  group and capture to (?P=host):
--------------------------------------------------------------------------------
    \S+                      non-whitespace (all but \n, \r, \t, \f,
                             and " ") (1 or more times (matching the
                             most amount possible))
--------------------------------------------------------------------------------
  )                        end of (?P=host)
--------------------------------------------------------------------------------
  .*?                      any character except \n (0 or more times
                           (matching the least amount possible))
--------------------------------------------------------------------------------
  \[                       '['
--------------------------------------------------------------------------------
  (?P<time>                  group and capture to (?P=time):
--------------------------------------------------------------------------------
    .*?                      any character except \n (0 or more times
                             (matching the least amount possible))
--------------------------------------------------------------------------------
  )                        end of (?P=time)
--------------------------------------------------------------------------------
  ]                        ']'

僅匹配 IP 地址而不匹配其他數字

問題描述

2 個解決方案

解決方案1
1 已采納 2021-03-26 21:53:13

解決方案2
1 2021-03-26 22:22:59

僅匹配 IP 地址而不匹配其他數字

問題描述

2 個解決方案

解決方案1 1 已采納 2021-03-26 21:53:13

解決方案2 1 2021-03-26 22:22:59

解決方案1
1 已采納 2021-03-26 21:53:13

解決方案2
1 2021-03-26 22:22:59