Regular expression for matching specific words in a string

Question

I'm looking for a regular expression to match the beginning of specific words throughout a string. Say I have this:

This is the example string of text.

I would be looking to match each T that starts a word:

his is he example string of ext. 他是外接的他举例字符串。

Any help would be much appreciated.

Answer 1

trying in irb (Ruby):

irb(main):001:0> "This is the example string of text.".scan(/\bt\w+/i)
=> ["This", "the", "text"]

for /\\bt\\w+/i , \\b is the boundary, t is the character t that you want to start with, and \\w+ is alphanumeric or underscore, with 1 or more occurrences. The i is ignore case.

If you want only alphabets and want to match just a t as well, then you can use

irb(main):002:0> "This is the example string of text.".scan(/\bt[a-z]*/i)
=> ["This", "the", "text"]

The [az] means the class of characters from a to z . * means 0 or more occurrences.

Answer 2

I believe that the \\b metacharacter limits a regular expression to "word boundaries", meaning it will let you match "whole words only". By using \\b at the beginning of your regular expression you can match whatever you want, as long as it starts a word.

In your example: \\b[tT]

Similarly, if you'd wish to match the letter T at the end of words you would just place the \\b at the end of the regular expression.

Regular expression for matching specific words in a string

Question

2 answers

solution1
1 ACCPTED 2014-01-13 07:45:39

solution2
1 2014-01-13 07:50:16

Regular expression for matching specific words in a string

Question

2 answers

solution1 1 ACCPTED 2014-01-13 07:45:39

solution2 1 2014-01-13 07:50:16

solution1
1 ACCPTED 2014-01-13 07:45:39

solution2
1 2014-01-13 07:50:16