创建字典并用R替换拉丁词

Question

I have dataset with latin words 我有带有拉丁语单词的数据集

text<-c("TESS",
"MAG")

I want to set transliteration from latin-cyrillic 我想设置拉丁西里尔字母的音译

library(stringi)
d=stri_trans_general(mydat$text, "latin-cyrillic")

But I want to manually create the translit dictionary. 但是我想手动创建翻译字典。 For example: 例如：

dictionary<-c("Tess"="ТЕСС"
"MAG"="МАГ"
.......
......
)

when dictionary is created, in mydat$text,all latin words must be replaced by cyrillic words, which i set. 创建字典时，在mydat $ text中，所有拉丁词都必须替换为我设置的西里尔字母。 something like this 像这样的东西

d=dictionary(mydat$text)

How perform such replacing? 如何进行这种替换？

input 输入

text<-c("TESS",
"MAG")

file with translit 带翻译的文件

dict=path.csv

it containt 它包含

dict=

structure(list(old = structure(c(2L, 1L), .Label = c("mag", "tess"
), class = "factor"), new = structure(c(2L, 1L), .Label = c("маг", 
"тесс"), class = "factor")), .Names = c("old", "new"), class = "data.frame", row.names = c(NA, 
-2L))

#output #output

text<-c("ТЕСС",
"МАГ")

that's all 就这样

Answer 1

There you go! 你去！

dict <- structure(list(
  old = structure(c(2L, 1L), .Label = c("mag", "tess"),class = "factor"),
  new = structure(c(2L, 1L), .Label = c("маг", "тесс"), class = "factor")),
  .Names = c("old", "new"), class = "data.frame", row.names = c(NA, -2L))

input<-c("TESS","MAG")

output <- with(lapply(dict,as.character), new[match(tolower(input),old)])
output
# [1] "тесс" "маг"

创建字典并用R替换拉丁词

问题描述

input 输入

file with translit 带翻译的文件

1 个解决方案

解决方案1
1 已采纳 2018-10-07 12:09:42

创建字典并用R替换拉丁词

问题描述

input 输入

file with translit 带翻译的文件

1 个解决方案

解决方案1 1 已采纳 2018-10-07 12:09:42

解决方案1
1 已采纳 2018-10-07 12:09:42