83 | 0 | 38 |
下载次数 | 被引频次 | 阅读次数 |
针对基于约束的因果关系发现方法中的马尔科夫等价类问题及函数因果模型对噪声的非高斯性假设问题,使用Cai-伪残差的三个定理,提出了Cai-伪残差因果定向算法。首先,假设变量之间关系线性且不限制噪声类型,在此条件下,对于贝叶斯网络的三种结构,Cai-伪残差与变量间的独立性表现出不同的结果。其次,利用基于约束的方法构建马尔科夫等价类之后,通过不同结果进一步发现并区分三种结构,对马尔科夫等价类中部分未定向的边进一步定向。最后,在不同因果网络构成的线性高斯数据集和线性非高斯数据集上分别进行了实验,结果表明,所提算法不仅显著减少了马尔科夫等价类中无向边的数量,同时也有效地提高了因果关系定向的准确性。
Abstract:In addressing the issues of Markov equivalence class in constraint-based causal discovery methods and the non-Gaussian noise assumption in functional causal models, the Cai-pseudo residual causal orientation algorithm was proposed using the three theorems of the Cai-pseudo residuals. Firstly, the relationships between variables were assumed to be linear, and no restrictions were imposed on the type of noise. With these conditions, the independence between the Cai-pseudo residuals and variables was manifested in diverse ways across the three distinct structures of Bayesian networks. Secondly, after construction of the Markov equivalence class using a constraint-based method, such varying associations were exploited to further distinguish the three structures and direct some previously undirected edges within the Markov equivalence class. Finally, experiments were performed on both linear Gaussian datasets and non-Gaussian datasets made up of different causal network structures. The results highlighted that the proposed algorithm not only greatly lessened the quantity of undirected edges in the Markov equivalence class, but also notably enhanced the accuracy of causal direction determination.
[1] SIDDIQI S H,KORDING K P,PARVIZI J,et al.Causal mapping of human brain function[J].Nature reviews neuroscience,2022,23(6):361-375.
[2] 曹小敏,刘进锋.基于因果推断的两阶段长尾分类研究[J].郑州大学学报(理学版),2024,56(5):31-38.CAO X M,LIU J F.A study of two-stage long-tail classification based on causal inference[J].Journal of Zhengzhou university (natural science edition),2024,56(5):31-38.
[3] SCH?LKOPF B,LOCATELLO F,BAUER S,et al.Toward causal representation learning[J].Proceedings of the IEEE,2021,109(5):612-634.
[4] CAMPS-VALLS G,GERHARDUS A,NINAD U,et al.Discovering causal relations and equations from data[J].Physics reports,2023,1044:1-68.
[5] ARONOW P M,S?VJE F.The book of why:the new science of cause and effect[J].Journal of the American statistical association,2020,115(529):482-485.
[6] SPIRTES P,GLYMOUR C.An algorithm for fast recovery of sparse causal graphs[J].Social science computer review,1991,9(1):62-72.
[7] COLOMBO D,MAATHUIS M H.Order-independent constraint-based causal structure learning[J].Journal of machine learning research,2014,15(1):3741-3782.
[8] RAMSEY J.Improving accuracy and scalability of the PC algorithm by maximizing P-value[EB/OL].(2016-10-05)[2024-04-23].https://doi.org/10.48550/arXiv.1610.00378.
[9] SPIRTES P,MEEK C,RICHARDSON T.An algorithm for causal inference in the presence of latent variables and selection bias[M]//Computation,Causation,and Discovery.Palo Alto:AAAI Press,1999:211-252.
[10] VERMA T S,PEARL J.Equivalence and synthesis of causal models[M]//Probabilistic and Causal Inference.New York:ACM Press,2022:221-236.
[11] SHIMIZU S,HOYER P O,HYV?RINEN A,et al.A linear non-Gaussian acyclic model for causal discovery[J].Journal of machine learning research,2006,7:2003-2030.
[12] KITSON N K,CONSTANTINOU A C,GUO Z G,et al.A survey of Bayesian network structure learning[J].Artificial intelligence review,2023,56(8):8721-8814.
[13] RUNGE J,GERHARDUS A,VARANDO G,et al.Causal inference for time series[J].Nature reviews earth & environment,2023,4:487-505.
[14] YAO L,CHU Z,LI S,et al.A survey on causal inference[J].ACM transactions on knowledge discovery from data,2021,15(5):1-46.
[15] CAI R C,XIE F,GLYMOUR C,et al.Triad constraints for learning causal structure of latent variables[C]//Proceedings of the 33rd International Conference on Neural Information Processing Systems.Cambridge:MIT Press,2019:12863-12872.
基本信息:
DOI:10.13705/j.issn.1671-6841.2024102
中图分类号:TP18
引用信息:
[1]牛瑞琪,原泽鹏,翟岩慧等.基于Cai-伪残差与变量独立性的因果定向方法[J].郑州大学学报(理学版),2025,57(06):24-33.DOI:10.13705/j.issn.1671-6841.2024102.
基金信息:
国家自然科学基金项目(62072294,61972238)