简体   繁体   中英

Is any Machine Learning model appropriate for this dataset and desired output?

My dataset consists of video game titles from various websites, formatted in different ways. Here's my example:

"The Legend Of Zelda: Wind Waker, Nintendo"
"The Legend Of Zelda: The Wind Waker"
"The Legend Of Zelda: Wind Waker, Nintendo"
"The Legend Of Zelda: Wind Waker, Nintendo"
"Zelda: Wind Waker Hd Nintendo Wii U Game"
"The Legend Of Zelda: The Wind Waker"
"Legend Of Zelda: The Wind Waker Hd (nintendo Wii"
"The Legend Of Zelda: Wind Waker Of Game (nintendo"
"The Legend Of Zelda: The Wind Waker Nintendo Wii"
"Nintendo Wii U Game Zelda: Wind Waker Hd"
"The Legend Of Zelda: The Wind Waker Hd Wii U"
"The Legend Of Zelda: Wind Waker, Nintendo Pinterest"
"Zelda: Hd (nintendo Wii The"
"The Legend Of Zelda: The Wind Waker Hd Wii U Pinterest"
"The Legend Of Zelda: The Wind Waker Hd"
"Legend Of Zelda: Wind Waker Hd (nintendo Wii"
"The Legend Of Zelda: The Wind Waker Hd"
"The Legend Of Zelda: Wind Waker, Nintendo Wii U"
"The Legend Of Zelda Wind Hd"
"Zelda Wind Waker Hd"
"The Legend Of Zelda: Wind Waker, Nintendo Pinterest"
"The Legend Of Zelda Wind Waker Wii U Nintendo"
"Wii U The Legend Of Zelda: The Wind Waker Hd"
"Zelda: Wind Waker Hd"
"The Legend Of Zelda: The Wind Waker Hd Game Wii"
"The Legend Of Zelda: The Wind Waker Hd Nintendo Wii U"
"Zelda: Wind Waker Hd"
"The Legend Of Zelda The Wind Waker Hd Wii U"

The correct output for this data would be:

The Legend Of Zelda: The Wind Waker HD - Title

Wii U - Platform

Nintendo - Publisher

I can feed a model 100's of these datasets, with what I would then expect as the correct output, and then hope that the model "learns" for future datasets of titles what an expected output might be.

Is this something that Machine Learning can do? What model should I use? I have never done anything with ML before so I'm unsure if this is a good use case for it.

正如我在您的问题中所看到的,标题、平台和发布者(输出)是从原始数据(输入)中提取的,因此您可以使用类似于命名实体识别的内容,您应该查看文献以了解更多信息,但这是最有可能的方向。

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM