稀疏主成分分析(Sparse PCA)概述

稀疏主成分分析(Sparse PCA)由Hui Zou等人于2006年提出,旨在解决PCA解释性不强的问题。通过引入lasso(弹性网络)约束,SPCA能生成具有稀疏载荷的改进主成分,提高可解释性。算法已成功应用于常规多变量数据和基因表达数组,实验结果令人鼓舞。
AI助手已提取文章相关产品:

Hui Zou et al. 2006年发表在《Journal of computational and graphical statistics》上的文章“Sparse principal component analysis” [1] 首次提出SparsePCA 的概念,截止到目前(2014年4月3日)该文章已经被引用853次(参考谷歌学术搜索数据)[2]。

Literature Sparse principal component analysis”的摘要如下:

Principal component analysis (PCA) is widelyused in data processing and dimensionality reduction. However, PCA suffers fromthe fact that each principal component is a linear combination of all theoriginal variables, thus it is often difficult to interpret the results. We introducea new method called sparse principal component analysis (SPCA) using the lasso (elasticnet) to produce modified principal components with sparse loadings. We firstshow that PCA can be formulated as a regression-type optimization problem;sparse loadings are then obtained by imposing the lasso (elastic net)constraint on the regression coefficients. Efficient algorithms are proposed tofit our SPCA models for both regular multivariate data and gene expressionarrays. We also give a new formula to compute the total variance of modifiedprincipal components. As illustrations, SPCA is applied to real and simulateddata with encouraging results.


Motivation

PCA方法的不足:

       PCA方法的主成分(Principal Component)是原有变量的线性组合,因此很难地接受其结果。


 未完待续。。。

 

[1]Zou,Hui, Trevor Hastie, and Robert Tibshirani."Sparse principal component analysis." Journal of computational and graphical statistics 15.2(2006): 265-286.

[2]http://scholar.google.com/citations?view_op=view_citation&hl=en&user=dJfEfJgAAAAJ&citation_for_view=dJfEfJgAAAAJ:lSLTfruPkqcC

您可能感兴趣的与本文相关内容

评论 2
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值