r - Regular expressions (RegEx) and dplyr::filter()

Question

Welcome To Ask or Share your Answers For Others

r - Regular expressions (RegEx) and dplyr::filter()

asked Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

r - Regular expressions (RegEx) and dplyr::filter()

I have a simple data frame that looks like this:

x <- c("aa", "aa", "aa", "bb", "cc", "cc", "cc")
y <- c(101, 102, 113, 201, 202, 344, 407)
df = data.frame(x, y)    

    x   y
1   aa  101
2   aa  102
3   aa  113
4   bb  201
5   cc  202
6   cc  344
7   cc  407

I would like to use a dplyr::filter() and a RegEx to filter out all the y observations that start with the number 1

I'm imagining that the code will look something like this:

df %>%
  filter(y != grep("^1"))

But I am getting an Error in grep("^1") : argument "x" is missing, with no default

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-10-23T19:02:51+0000

You need to double check the documentations for grepl and filter.

For grep/grepl you have to also supply the vector that you want to check in (y in this case) and filter takes a logical vector (i.e. you need to use grepl). If you want to supply an index vector (from grep) you can use slice instead.

df %>% filter(!grepl("^1", y))

Or with an index derived from grep:

df %>% slice(grep("^1", y, invert = TRUE))

But you can also just use substr because you are only interested in the first character:

df %>% filter(substr(y, 1, 1) != 1)

Categories

r - Regular expressions (RegEx) and dplyr::filter()

r - Regular expressions (RegEx) and dplyr::filter()

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags