regex - python re.split() to split by spaces, commas, and periods, but not in cases like 1,000 or 1.50

Question

Welcome To Ask or Share your Answers For Others

regex - python re.split() to split by spaces, commas, and periods, but not in cases like 1,000 or 1.50

asked Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

regex - python re.split() to split by spaces, commas, and periods, but not in cases like 1,000 or 1.50

I want to use python re.split() to split a string into individual words by spaces, commas and periods. But I don't want "1,200" to be split into ["1", "200"] or ["1.2"] to be split into ["1", "2"].

Example

l = "one two 3.4 5,6 seven.eight nine,ten"

The result should be ["one", "two", "3.4", "5,6" , "seven", "eight", "nine", "ten"]

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-10-23T17:44:43+0000

Use a negative lookahead and a negative lookbehind:

> s = "one two 3.4 5,6 seven.eight nine,ten"
> parts = re.split('s|(?<!d)[,.](?!d)', s)
['one', 'two', '3.4', '5,6', 'seven', 'eight', 'nine', 'ten']

In other words, you always split by s (whitespace), and only split by commas and periods if they are not followed (?!d) or preceded (?<!d) by a digit.

DEMO.

EDIT: As per @verdesmarald comment, you may want to use the following instead:

> s = "one two 3.4 5,6 seven.eight nine,ten,1.2,a,5"
> print re.split('s|(?<!d)[,.]|[,.](?!d)', s)
['one', 'two', '3.4', '5,6', 'seven', 'eight', 'nine', 'ten', '1.2', 'a', '5']

This will split "1.2,a,5" into ["1.2", "a", "5"].

DEMO.

Categories

regex - python re.split() to split by spaces, commas, and periods, but not in cases like 1,000 or 1.50

regex - python re.split() to split by spaces, commas, and periods, but not in cases like 1,000 or 1.50

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags