<p>I am learning regex but have not been able to find the right regex in python for selecting characters that start with a particular alphabet.</p> <p>Example below</p> <pre class="prettyprint"><code>text='this is a test' match=re.findall('(?!t)\w*',text) # match returns ['his', '', 'is', '', 'a', '', 'est', ''] match=re.findall('[^t]\w+',text) # match ['his', ' is', ' a', ' test'] </code></pre> <p>Expected : <code>['is','a']</code></p>

<h3>With regex</h3> <p>Use the negative set <code>[^\Wt]</code> to match any alphanumeric character that is not <em>t</em>. To avoid matching subsets of words, add the word boundary metacharacter, <code>\b</code>, at the beginning of your pattern.</p> <p>Also, do not forget that you should use raw strings for regex patterns.</p> <pre class="prettyprint"><code>import re text = 'this is a test' match = re.findall(r'\b[^\Wt]\w*', text) print(match) # prints: ['is', 'a'] </code></pre> <p>See the demo here.</p> <h3>Without regex</h3> <p>Note that this is also achievable without regex.</p> <pre class="prettyprint"><code>text = 'this is a test' match = [word for word in text.split() if not word.startswith('t')] print(match) # prints: ['is', 'a'] </code></pre>

Match words that don't start with a certain letter using regex

Tags:

python

regex

regex-negation

regex-lookarounds

I am learning regex but have not been able to find the right regex in python for selecting characters that start with a particular alphabet.

Example below

text='this is a test'
match=re.findall('(?!t)\w*',text)

# match returns
['his', '', 'is', '', 'a', '', 'est', '']

match=re.findall('[^t]\w+',text)

# match
['his', ' is', ' a', ' test']

Expected : ['is','a']

329

asked May 16 '18 15:05

Priya

1 Answers

With regex

Use the negative set [^\Wt] to match any alphanumeric character that is not t. To avoid matching subsets of words, add the word boundary metacharacter, \b, at the beginning of your pattern.

Also, do not forget that you should use raw strings for regex patterns.

import re

text = 'this is a test'
match = re.findall(r'\b[^\Wt]\w*', text)

print(match) # prints: ['is', 'a']

See the demo here.

Without regex

Note that this is also achievable without regex.

text = 'this is a test'
match = [word for word in text.split() if not word.startswith('t')]

print(match) # prints: ['is', 'a']

111

answered Sep 19 '22 16:09

Olivier Melançon

Related questions
                            
                                import scipy error: cannot import name '_ccallback_c'
                            
                                Group list of dictionaries by value [duplicate]
                            
                                ValueError: `decode_predictions` expects a batch of predictions (i.e. a 2D array of shape (samples, 1000)). Found array with shape: (1, 7)
                            
                                Python Matplotlib - Plotting cuboids
                            
                                Using sklearn StandardScaler on only select columns
                            
                                PEP 3106 suggests slower way? Why?
                            
                                Parsing elements from list of list of strings
                            
                                Find period of a signal out of the FFT
                            
                                What is the recommended way to serialize a collection of spaCy Docs?
                            
                                python 'module' object is not callable when calling a function
                            
                                get-pip.py broken on Windows 10
                            
                                OpenCV Masking Image - error: (-215) (mtype == 0 || mtype == 1) && _mask.sameSize(*psrc1) in function cv::binary_op
                            
                                Add labels to Seaborn bivariate KDE plot
                            
                                Anaphora resolution in stanford-nlp using python
                            
                                How to initialize variables defined in tensorflow function?
                            
                                How to find an optimum number of processes in GridSearchCV( ..., n_jobs = ... )?
                            
                                NumPy: Where in the source code are `arange` and `array` functions defined?
                            
                                How to replace accents in a column of a pandas dataframe
                            
                                Django aggregate(sum error
                            
                                Python set operations - complement union of set

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With