Split string if separator is not in-between two characters

Question

I want to write a script that reads from a csv file and splits each line by comma except any commas in-between two specific characters.

In the below code snippet I would like to split line by commas except the commas in-between two $ s.

line = "$abc,def$,$ghi$,$jkl,mno$"

output = line.split(',')

for o in output:
   print(o)

How do I write output = line.split(',') so that I get the following terminal output?

~$ python script.py
$abc,def$
$ghi$
$jkl,mno$

Answer 1

One solution (maybe not the most elegant but it will work) is to replace the string $,$ with something like $,,$ and then split ,, . So something like this

output = line.replace('$,$','$,,$').split(',,')

Using regex like mousetail suggested is the more elegant and robust solution but requires knowing regex (not that anyone KNOWS regex)

Answer 2

You can do this with a regular expression:

In re, the (?<!\$) will match a character not immediately following a $ .

Similarly, a (?!\$) will match a character not immediately before a dollar.

The | character cam match multiple options. So to match a character where either side is not a $ you can use:

expression = r"(?<!\$),|,(?!\$)"

Full program:

import re
expression = r"(?<!\$),|,(?!\$)"
print(re.split(expression, "$abc,def$,$ghi$,$jkl,mno$"))

Answer 3

Try regular expressions :

import re

line = "$abc,def$,$ghi$,$jkl,mno$"

output = re.findall(r"\$(.*?)\$", line)

for o in output:
    print('$'+o+'$')

$abc,def$
$ghi$
$jkl,mno$

Answer 4

First, you can identify a character that is not used in that line:

c = chr(max(map(ord, line)) + 1)

Then, you can proceed as follows:

line.replace('$,$', f'${c}$').split(c)

Here is your example:

>>> line = '$abc,def$,$ghi$,$jkl,mno$'
>>> c = chr(max(map(ord, line)) + 1)
>>> result = line.replace('$,$', f'${c}$').split(c)
>>> print(*result, sep='\n')
$abc,def$
$ghi$
$jkl,mno$

Answer 5

Solution using regex:

import re

output = re.split('(?<=\$),(?=\$)', line)

for o in output:
    print(o)

Explanation: regex expression (?<=\$),(?=\$) splits the string by commas that are between two dollar ( $ ) signs, but keeps $ signs in the parts of string after the splitting. See also Regex Lookahead and Lookbehind .

Split string if separator is not in-between two characters

Question

4 answers

solution1
1 2022-07-22 09:39:04

solution2
1 2022-07-22 09:40:27

solution3
1 2022-07-22 09:43:44

solution4
0 2022-07-22 09:49:12

solution5
0 2022-07-22 09:52:58

Split string if separator is not in-between two characters

Question

4 answers

solution1 1 2022-07-22 09:39:04

solution2 1 2022-07-22 09:40:27

solution3 1 2022-07-22 09:43:44

solution4 0 2022-07-22 09:49:12

solution5 0 2022-07-22 09:52:58

solution1
1 2022-07-22 09:39:04

solution2
1 2022-07-22 09:40:27

solution3
1 2022-07-22 09:43:44

solution4
0 2022-07-22 09:49:12

solution5
0 2022-07-22 09:52:58