简体   繁体   中英

How can I remove certain part of row names in data frame

I have a data set with the following format:

ID                         | Value
-------------------------- | -------------------------------
AAA1|404744                | 1.7554
ANKHD1-EIF4EBP3|404734     | 0.5174     
HLA-B|3106                 | 11.7659               
HLA-A|3105                 | 18.0851  

What I want is removing certain part of the row names like this:

ID                    | Value
--------------------- | -------------------------------
AAA1                  | 1.7554
ANKHD1-EIF4EBP3       | 0.5174     
HLA-B                 | 11.7659               
HLA-A                 | 18.0851  

Thanks a lot!

We can do this with sub . Match the | (a metacharacter implies or - so either escape \\\\| it or place it in brackets to get the literal character) followed by characters ( .* ) and replace it with blank ( "" )

df$ID <- sub("[|].*", "", df$ID)

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM