大数据集上的主成分分析

1条回答

网友

1楼 · 发布于 2024-05-15 00:15:46

我建议在OpenTURNS中使用KarhunenLoeveSVDAlgorithm。它提供了随机SVD算法的4种实现。约束条件是必须预先设置要计算的奇异值的数量

为了启用该算法，我们必须在ResourceMap中设置KarhunenLoeveSVDAlgorithm-UseRandomSVD键。然后KarhunenLoeveSVDAlgorithm-RandomSVDMaximumRank键设置要计算的奇异值的数量（默认值为1000）

提供了两种实现：

Nathan Halko，Per Gunnar Martinsson，Joel A.Tropp。寻找随机结构：构造近似矩阵分解的概率算法
Nathan Halko，Per Gunnar Martisson，Yoel Shkolnisky和Mark Tygert。大数据集主成分分析的一种算法

可以使用KarhunenLoeveSVDAlgorithm-RandomSVDVariant键选择这些算法

在下面的示例中，我使用AbsoluteExponential协方差模型模拟了一个来自高斯过程的大过程样本

import openturns as ot
mesh = ot.IntervalMesher([10]*2).build(ot.Interval([-1.0]*2, [1.0]*2))
s = 0.01
model = ot.AbsoluteExponential([1.0]*2)
sampleSize = 100000
sample = ot.GaussianProcess(model, mesh).getSample(sampleSize)

然后使用随机SVD算法：

ot.ResourceMap_SetAsBool('KarhunenLoeveSVDAlgorithm-UseRandomSVD', True)
algorithm = ot.KarhunenLoeveSVDAlgorithm(sample, s)
algorithm.run()
result = algorithm.getResult()

result对象包含进程的Karhunen Loève分解。这对应于具有规则网格（和相等权重）的PCA

相关问题更多 >

编程相关推荐

热门问题

热门文章

大数据集上的主成分分析

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >