繁体 English 中英

非结构化数据的文本分析

[英]Text analysis for unstructured data

原文 2019-03-07 09:41:21 5 1 python/ classification/ naivebayes

我有一个问题，我确实有大量的非结构化文本数据，我想将其分类为不同的扇区。

我正在为此使用朴素贝叶斯分类器

现在，我的问题是我应该通过什么？ 因为我没有目标值

并且根据语法我必须通过它。

mnb = MultinomialNB()

mnb.fit(X,y)

TypeError: fit() missing 1 required positional argument: 'y'

如我所说，我没有目标价值。

我怎样才能做到这一点？

帮助将不胜感激

1 个解决方案

朴素贝叶斯分类器是一种有监督的学习方法，它要求您使用预先知道目标的带标签数据进行训练。 然后，您可以将其用于未标记的数据以预测将来的值，但不能针对没有目标值的数据进行训练。

在不了解您的任务的情况下很难推荐一种不同的方法，但是听起来您想研究无监督的聚类算法。 k均值是一个相对简单的起点。

非结构化文本到结构化数据

[英]Unstructured Text to Structured Data

Python 如何处理来自文本文件的非结构化数据

[英]Python How to Handle Data Unstructured From Text File

从非结构化文本中提取特定类型的数据，即研究所

[英]Extracting a particular type of data from unstructured text namely Institutes

从文本文件中读取（有点）非结构化数据以创建 Python 字典

[英]Reading (somewhat) unstructured data from a text file to create Python Dictionary

使用Python从大型非结构化文本文件中提取数据元素

[英]Extracting data elements from large unstructured text files with Python

使用 Python 从 docx 中提取非结构化数据/文本

[英]Exracting unstructured data/text from docx using Python

在Python中解析非结构化文本

[英]Parsing unstructured text in Python

解析 python 中的非结构化文本

[英]Parse unstructured text in python

改进非结构化文本的解析

[英]Improving parsing of unstructured text

读取非结构化数据熊猫

[英]Read unstructured data pandas

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 非结构化文本到结构化数据 Python 如何处理来自文本文件的非结构化数据从非结构化文本中提取特定类型的数据，即研究所从文本文件中读取（有点）非结构化数据以创建 Python 字典使用Python从大型非结构化文本文件中提取数据元素使用 Python 从 docx 中提取非结构化数据/文本在Python中解析非结构化文本解析 python 中的非结构化文本改进非结构化文本的解析读取非结构化数据熊猫

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM