郑州大学学报（理学版）

2025, 01, v.57 67-73

基于关系有向图正则化的属性三因子分解模型

1.江苏师范大学电气工程及自动化学院 2.中国矿业大学信息与控制工程学院 3.中国矿业大学深地工程智能建造与健康运维全国重点实验室

基金项目(Foundation): 国家自然科学基金项目(42372329); 江苏省高等学校基础科学(自然科学)研究项目(21KJB520005); 江苏省自然科学基金项目(BK20200632); 徐州市基础研究计划(KC23019); 江苏师范大学科研项目(21XSRS001)

邮箱(Email): Ruilin.Li@cumt.edu.cn;

DOI: 10.13705/j.issn.1671-6841.2023147

投稿时间： 2023-06-14

投稿日期（年）： 2023

修回时间： 2024-04-19

终审时间： 2025-01-14

终审日期（年）： 2025

审稿周期（年）： 2

发布时间： 2024-04-29

出版时间： 2024-04-29

网络发布时间： 2024-04-29

移动端阅读

105	1	112
下载次数	被引频次	阅读次数

引用本文下载本文

PDF

引用导出

GB/T 7714-2015 MLA APA Refworks EndNote NoteExpress NoteFirst

摘要全文参考文献出版信息相关文章

摘要：

针对零样本图像分类中属性和特征之间映射不全面以及属性空间结构挖掘不充分问题，提出了基于关系有向图正则化的属性三因子分解模型。首先，利用属性矩阵三因子分解实现了属性空间和特征空间的映射；其次，通过权值矩阵构建了属性关系有向图；最后，在属性空间或特征空间计算测试样本和各测试类别的相似性，进而实现图像分类。在aPY和SUN数据集上的实验结果表明，所提模型有效地提高了零样本图像分类精度。

关键词： 零样本图像; 属性三因子分解; 关系有向图; 正则化;

Abstract：

Aiming at the problems of incomplete mapping between attributes and features, as well as the insufficient mining of the attribute space structure in zero-shot image classification, an attribute tri-factorization model with regularization of relation digraph was proposed. Firstly, the mapping between attribute space and feature space was achieved by matrix tri-factorization of attributes. Secondly, the attribute relation digraph was constructed using the weight matrix. Finally, the similarity between the testing sample and each testing class was calculated in either the attribute space or the feature space to finish image classification. Experimental results on aPY and SUN datasets showed that the proposed model was capable of efficiently improving the accuracy of zero-shot image classification.

KeyWords： zero-shot image; attribute tri-factorization; relation digraph; regularization;

参考文献

[1] XIE G S,ZHANG Z,XIONG H,et al.Towards zero-shot learning:a brief review and an attention-based embedding network[J].IEEE transactions on circuits and systems for video technology,2023,33(3):1181-1197.

[2] 吴兰，李崇阳.深度加权子域自适应网络[J].郑州大学学报(理学版),2022,54(1):54-61.WU L,LI C Y.Deep-weight subdomain adaptive network [J].Journal of Zhengzhou university (natural science edition),2022,54(1):54-61.

[3] 彭涛，郑传锟，张自力，等.基于时空特征融合的语音情感识别[J].郑州大学学报(理学版),2022,54(4):42-48.PENG T,ZHENG C K,ZHANG Z L,et al.Speech emotion recognition based on spatio-temporal feature fusion [J].Journal of Zhengzhou university (natural science edition),2022,54(4):42-48.

[4] LAMPERT C H,NICKISCH H,HARMELING S.Attribute-based classification for zero-shot visual object categorization[J].IEEE transactions on pattern analysis and machine intelligence,2014,36(3):453-465.

[5] AKATA Z,REED S,WALTER D,et al.Evaluation of output embeddings for fine-grained image classification[C]//IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE Press,2015:2927-2936.

[6] YU Y L,JI Z,GUO J C,et al.Zero-shot learning via latent space encoding[J].IEEE transactions on cybernetics,2019,49(10):3755-3766.

[7] CHANGPINYO S,CHAO W L,GONG B Q,et al.Synthesized classifiers for zero-shot learning[C]//IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE Press,2016:5327-5336.

[8] VERMA V K,RAI P.A simple exponential family framework for zero-shot learning[C]//Joint European Conference on Machine Learning and Knowledge Discovery in Databases.Cham:Springer International Publishing,2017:792-808.

[9] XU X,SHEN F M,YANG Y,et al.Matrix tri-factorization with manifold regularizations for zero-shot learning[C]//IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE Press,2017:2007-2016.

[10] DING C,LI T,PENG W,et al.Orthogonal nonnegative matrix t-factorizations for clustering[C]//Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.New York:ACM Press,2006:126-135.

[11] BOYD S P,VANDENBERGHE L.Convex optimization[M].New York:Cambridge University Press,2004.

[12] JAYARAMAN D,GRAUMAN K.Zero-shot recognition with unreliable attributes [C]//27th International Conference on Neural Information Processing Systems.Cambridge:MIT Press,2014:3464-3472.

[13] MAATEN L.Accelerating t-SNE using tree-based algorithms[J].Journal of machine learning research,2014,15(1):3221-3245.

[14] ROMERA-PAREDES B,TORR P H S.An embarrassingly simple approach to zero-shot learning[C]//Proceedings of the 32nd International Conference on International Conference on Machine Learning.New York:ACM Press,2015:2152-2161.

[15] ZHANG Z M,SALIGRAMA V.Zero-shot learning via semantic similarity embedding[C]//IEEE International Conference on Computer Vision.Piscataway:IEEE Press,2016:4166-4174.

[16] ZHANG Z M,SALIGRAMA V.Zero-shot learning via joint latent similarity embedding[C]//IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE Press,2016:6034-6042.

[17] LONG Y,LIU L,SHEN F M,et al.Zero-shot learning using synthesised unseen visual data with diffusion regularisation[J].IEEE transactions on pattern analysis and machine intelligence,2018,40(10):2498-2512.

[18] YU Y L,JI Z,LI X,et al.Transductive zero-shot learning with a self-training dictionary approach[J].IEEE transactions on cybernetics,2018,48(10):2908-2919.

基本信息:

DOI：10.13705/j.issn.1671-6841.2023147

中图分类号:TP391.41

引用信息:

[1]张嘉睿,李瑞林,孔毅,等.基于关系有向图正则化的属性三因子分解模型[J].郑州大学学报(理学版),2025,57(01):67-73.DOI:10.13705/j.issn.1671-6841.2023147.

基金信息:

国家自然科学基金项目(42372329); 江苏省高等学校基础科学(自然科学)研究项目(21KJB520005); 江苏省自然科学基金项目(BK20200632); 徐州市基础研究计划(KC23019); 江苏师范大学科研项目(21XSRS001)

投稿时间：

2023-06-14

投稿日期（年）：

2023

修回时间：

2024-04-19

终审时间：

2025-01-14

终审日期（年）：

2025

审稿周期（年）：

发布时间：

2024-04-29

出版时间：

2024-04-29

网络发布时间：

2024-04-29

请选择需要下载的pdf数据

郑州大学学报（理学版）

使用微信“扫一扫”功能。
将此内容分享给您的微信好友或者朋友圈

请选择需要下载的pdf数据

郑州大学学报（理学版）

使用微信“扫一扫”功能。将此内容分享给您的微信好友或者朋友圈

使用微信“扫一扫”功能。
将此内容分享给您的微信好友或者朋友圈