Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
794 views
in Technique[技术] by (71.8m points)

matlab - Removing stop words from single string

My query is string = 'Alligator in water' where in is a stop word. How can I remove it so that I get stop_remove = 'Alligator water' as output. I have tried it with ismember but it returns integer value for matching word, I want to get the remaining words as output.

in is just an example, I'd like to remove all possible stop words.

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Answer

0 votes
by (71.8m points)

A slightly more elegant way than Luis Mendo's solution is to use regexprep that does exactly what you want

>> result = regexprep( 'Alligator in water', 'ins*', '' ); % replace with an empty string 
result =    
   Alligator water

If you have several stop words you can simply add them to the pattern (in this example I consider 'in' and 'near' as stop words):

>> result = regexprep( 'Alligator in water near land', {'ins*','nears*'}, '' )
result =
   Alligator water land

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome to OStack Knowledge Sharing Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...