How to extract text starting from a specific word in a string?

Question

So I have tried to extract only the address from this string, but I'm having troubles with it. This is how the string looks like:

1040 S. Vintage Ave.
Building A Ontario, CA 91761
United States Phone: 9099725134 Fax: 9099065401

Web: http://www.aareninc.com

I want to extract only the text that comes before the word 'Phone' , so only the address.

I've tried with strip('Phone') and then take the first element of an array but it gives me the first letter of that string.

address = contacts.strip('Phone')
print(address[0])

Answer 1

Use split function, not strip.

address = contacts.split('Phone')
print(address[0])

This should work.

Answer 2

Considering you have something like this with you

st = '1040 S. Vintage Ave.Building A Ontario, CA 91761 United States Phone: 9099725134 Fax: 9099065401 Web: http://www.aareninc.com'

v = st.split("Phone"))
print(v[0])

This will work for Python3. If you are using Python2 you can avoid using parenthesis with the print statement.

Answer 3

正如@JonClements所说，解决方案是：

contacts.partition('Phone')[0]

Answer 4

For that task you might use so-called zero length assertion (positive lookahead in this case)

import re
text = '''1040 S. Vintage Ave.
Building A Ontario, CA 91761
United States Phone: 9099725134 Fax: 9099065401 

Web: http://www.aareninc.com'''
adress = re.findall('.*(?=Phone)',text,re.DOTALL)[0]
print(adress)

output

1040 S. Vintage Ave.
Building A Ontario, CA 91761
United States

Note that it will cause error, if text do not contain Phone substring. Note re.DOTALL flag, so . also matches newline character ( /n ), without that flag output would be Unites States .

Answer 5

I hope this works.

Tested on python 2.7

string = r"1040 S. Vintage Ave. Building A Ontario, CA 91761 United States Phone: 9099725134 Fax: 9099065401 Web: http://www.aareninc.com"

f = re.split(' (?=Phone:)', string)

print 'String before Phone:', f[0]

Answer 6

using regular expressions:

import re
re.split('(Phone)', strng)
['1040 S. Vintage Ave. Building A Ontario, CA 91761 United States ',
'Phone',
': 9099725134 Fax: 9099065401 Web: http://www.aareninc.com']

Answer 7

Suppose your string is defined as:

contacts = """1040 S. Vintage Ave.
Building A Ontario, CA 91761
United States Phone: 9099725134 Fax: 9099065401

Web: http://www.aareninc.com"""

contacts.split('Phone')[0] or contacts.partition('Phone')[0] must give you the same result.

Answer 8

You can initially split to get a list of string on both the sides of "Phone". Then you'd want to use strip to remove leading and trailing white-space.

contacts.split('Phone')[0].strip()

This works.

Answer 9

You can use re.search() :

import re

adress = re.search(r'^(.+?)\sPhone', s, flags=re.MULTILINE | re.DOTALL)
print(adress.group(1))

# 1040 S. Vintage Ave.
# Building A Ontario, CA 91761
# United States

How to extract text starting from a specific word in a string?

Question

9 answers

solution1
1 2019-02-20 11:50:10

solution2
1 2019-02-20 12:10:22

solution3
0 ACCPTED 2019-02-20 11:54:52

solution4
0 2019-02-20 12:00:35

solution5
0 2019-02-20 12:00:52

solution6
0 2019-02-20 12:14:56

solution7
0 2019-02-20 12:38:04

solution8
0 2019-02-21 06:35:22

solution9
0 2019-02-21 06:58:13

How to extract text starting from a specific word in a string?

Question

9 answers

solution1 1 2019-02-20 11:50:10

solution2 1 2019-02-20 12:10:22

solution3 0 ACCPTED 2019-02-20 11:54:52

solution4 0 2019-02-20 12:00:35

solution5 0 2019-02-20 12:00:52

solution6 0 2019-02-20 12:14:56

solution7 0 2019-02-20 12:38:04

solution8 0 2019-02-21 06:35:22

solution9 0 2019-02-21 06:58:13

solution1
1 2019-02-20 11:50:10

solution2
1 2019-02-20 12:10:22

solution3
0 ACCPTED 2019-02-20 11:54:52

solution4
0 2019-02-20 12:00:35

solution5
0 2019-02-20 12:00:52

solution6
0 2019-02-20 12:14:56

solution7
0 2019-02-20 12:38:04

solution8
0 2019-02-21 06:35:22

solution9
0 2019-02-21 06:58:13