Punctuation not detected between words with no space

Question

How can I split sentences, when punctuation is detected (.?!) and occurs between two words without a space?

Example:

>>> splitText = re.split("(?<=[.?!])\s+", "This is an example. Not 
    working as expected.Because there isn't a space after dot.")

output:

['This is an example.', 
"Not working as expected.Because there isn't a space after dot."]

expected:

['This is an example.', 
'Not working as expected.', 
'Because there isn't a space after dot.']`

Answer 1

splitText = re.split("[.?!]\s*", "This is an example. Not working as expected.Because there isn't a space after dot.")

+ is used for 1 or more of something, * for zero of more.

if you need to keep the . you probably don't want to split, instead you could do:

splitText = re.findall(".*?[.?!]", "This is an example. Not working as expected.Because there isn't a space after dot.")

which gives

['This is an example.',
 ' Not working as expected.',
 "Because there isn't a space after dot."]

you can trim those by playing with the regex (eg '\\s*.*?[.?!]' ) or just using .trim()

Answer 2

Use https://regex101.com/r/icrJNl/3/ .

import re
from pprint import pprint

split_text = re.findall(".*?[?.!]", "This is an example! Working as "
                        "expected?Because.")

pprint(split_text)

Note: .*? is a lazy (or non-greedy) quantifier in opposite to .* which is a greedy quantifier.

Output:

['This is an example!', 
 ' Working as expected?', 
 'Because.']

Another solution:

import re
from pprint import pprint

split_text = re.split("([?.!])", "This is an example! Working as "
    "expected?Because.")

pprint(split_text)

Output:

['This is an example', 
'!', 
' Working as expected', 
'?', 
'Because', 
'.', 
'']

Punctuation not detected between words with no space

Question

2 answers

solution1
1 ACCPTED 2017-06-30 11:13:16

solution2
0 2017-06-30 11:21:05

Punctuation not detected between words with no space

Question

2 answers

solution1 1 ACCPTED 2017-06-30 11:13:16

solution2 0 2017-06-30 11:21:05

solution1
1 ACCPTED 2017-06-30 11:13:16

solution2
0 2017-06-30 11:21:05