节点文献
基于子图同构的Hive数据操作合规分析方法
Compliance Analysis Method of Hive Data Operation Based on Subgraph Isomorphism
【摘要】 Hive现有的审计功能不能对数据操作目的进行合规判断。针对以上问题,该文提出一种基于子图同构的Hive数据操作合规分析方法。首先,提出基于图的Hive数据操作和合规规则的建模方法,形成数据溯源图和合规规则图;然后,将数据操作合规判断建模为溯源图和合规图的匹配问题,并提出基于子图同构的求解算法。最后,在数据治理平台Apache Atlas及Hive中进行了实验验证,实验结果表明,相比于基于集合、VF2以及Ullmann的合规验证,该文方法具有更高的合规验证效率。
【Abstract】 Hive’s existing audit function can not make compliance judgment on the purpose of data operation.To solve the above problems,a Hive data operation compliance analysis method based on subgraph isomorphism is proposed.Firstly,the modeling method of Hive data operation and compliance rules based on graph is proposed to form data traceability graph and compliance rule graph;Then,the compliance judgment of data operation is modeled as the matching problem of traceability graph and compliance graph,and a solution algorithm based on subgraph isomorphism is proposed.Finally,the experimental verification is carried out in the data governance platforms Apache Atlas and Hive.The experimental results show that the proposed method has higher compliance verification efficiency than the collection based,VF2 and Ullmann compliance verification.
【Key words】 Hive database; Apache Atlas; Compliance analysis; Subgraph isomorphism;
- 【文献出处】 电子与信息学报 ,Journal of Electronics & Information Technology , 编辑部邮箱 ,2022年12期
- 【分类号】TP311.13
- 【下载频次】12