简体   繁体   English

回归,机器学习分类

[英]Regression, classification on Machine Learning

I have a classification and regression question on machine learning. 我有一个关于机器学习的分类和回归问题。 First question, the following dataset http://it.tinypic.com/view.php?pic=oh3gj7&s=8#.VIjhRDGG_lF 第一个问题,以下数据集http://it.tinypic.com/view.php?pic=oh3gj7&s=8#.VIjhRDGG_lF

Can we say, the data set is linearly separable? 我们可以说数据集是线性可分离的吗? In order to apply a linear model for classi cation, a transformation of the input space is not needed for this dataset, or is not possible for this dataset? 为了将线性模型应用于分类,该数据集不需要输入空间的转换,或者对于该数据集是不可能的? My answer is no, but I am not sure for the second, I am not sure a transformation is possible for the dataset. 我的回答是“否”,但我不确定第二点,也不确定该数据集是否可以进行转换。

Second question about regression probl: Give the following data set f : R -> R http://it.tinypic.com/view.php?pic=madsmr&s=8#.VIjhVjGG_lE 关于回归概率的第二个问题:给出以下数据集f:R-> R http://it.tinypic.com/view.php?pic=madsmr&s=8#.VIjhVjGG_lE

Can we say that : A linear model for regression can be used to learn the function associated to this data set ? 我们可以这样说吗:可以使用线性回归模型来学习与此数据集相关的函数? Given this data set, it is not possible to determine an optimal con guration of the linear model? 给定此数据集,无法确定线性模型的最佳配置?

I am reading the book of Tom Mitchell Machine learning, and Pattern Recognition and Machine Learning Bishop, but I still have trouble giving the right answer. 我正在阅读《汤姆·米切尔机器学习》,《模式识别和机器学习Bishop》这本书,但是我仍然很难给出正确的答案。 Thanks in advance. 提前致谢。

Neither of this datasets can be modeled using linear classification/regression. 使用线性分类/回归无法对这两个数据集进行建模。

In case of the "input data transfromation" if only dataset is consistent (there are no two exact same points with two different labels) there always exists transformation after which data is linearly separable . 在“输入数据转换”的情况下,如果仅数据集是一致的(不存在带有两个不同标签的两个完全相同的点), 则始终存在变换,此后数据是线性可分离的 In particular one can construct it with: 特别是可以使用以下方法构造它:

phi(x) = 1 iff label of x is "1"

in other words, you map all positive samples to "1" and negatives to "0", so your data is now trivialy linearly separable. 换句话说,您将所有正样本映射为“ 1”,将负样本映射为“ 0”,因此您的数据现在可以线性地分离。 Or simply map your N points into N unit vectors in R^N space in such a way that i'th point is mapped to [0 0 0 ... 1 ... 0 0 0]^T where this "1" appears at i'th place. 或简单地将您的N个点映射到R ^ N空间中的N个单位向量,以使第i个点映射到[0 0 0 ... 1 ... 0 0 0] ^ T,其中出现“ 1”在我的地方。 Such dataset is trivialy linearly separable for any labeling. 这样的数据集对于任何标记都是线性可分离的。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM