繁体 English 中英

如何在Python的MeanShift模块中估算带宽时选择合适的分位数？

[英]How to choose appropriate quantile value while estimating bandwidth in MeanShift module of python?

原文 2015-02-05 02:15:13 3 1 python/ cluster-analysis/ mean-shift

我正在对数据集执行均值漂移聚类。 Estimate_bandwidth函数估计适当的带宽以执行均值漂移聚类。

句法：

sklearn.cluster.estimate_bandwidth(X, quantile=0.3, n_samples=None, random_state=0)

我发现，估计带宽随分位数的增加而增加，从而导致簇数减少。 类似地，分位数的减少会减少带宽，因此不会增加带宽。 集群。

因此，似乎没有。 簇的数量取决于选择的分位数。

如何选择最佳分位数？

1 个解决方案

分位数用于KNN（在estimate_bandwidth函数内部使用）来确定带宽。
具体来说：

**n = KNN中的样本数=批次中的样本数*分位数**

然后，将基于同一群集中样本之间的平均成对距离（由KNN返回）来计算带宽。 因此，您可以使用它来弄清楚如何设置带宽。 该函数返回的带宽平均将覆盖n个样本，这将严重影响平均移位将返回的簇数。

如何将值列转换为熊猫python的分位数？

[英]How transform value column to quantile at pandas python?

如何使用Python解析ini文件并选择适当的功能

[英]How to parse ini files and choose appropriate function with python

Python MeanShift内存错误

[英]Python MeanShift Memory Error

如何返回分位数剪切范围的最大值而不是分位数标签

[英]How to return max value of quantile cut range instead of quantile label

使用Python根据给定的数据估算值

[英]Estimating the value based on given data using Python

登录python时如何选择处理程序

[英]How to choose handler while logging in python

是否可以通过 python 中的确切值获得分位数？

[英]Is it possible to get quantile by the exact value in python?

如何提高在Python中估计`Pi`的性能

[英]How to increase the performance for estimating `Pi`in Python

如何在熊猫中找到每个数字的分位数

[英]How to find the quantile value of each number in pandas

Python：如何从发行版中选择一个值？

[英]Python: how to choose a value from a distribution?

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 如何将值列转换为熊猫python的分位数？如何使用Python解析ini文件并选择适当的功能 Python MeanShift内存错误如何返回分位数剪切范围的最大值而不是分位数标签使用Python根据给定的数据估算值登录python时如何选择处理程序是否可以通过 python 中的确切值获得分位数？如何提高在Python中估计`Pi`的性能如何在熊猫中找到每个数字的分位数 Python：如何从发行版中选择一个值？

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM