Robust Attributed Graph Alignment via Joint
Structure Learning and Optimal Transport
Jianheng Tang
2†
, Weiqi Zhang
2
, Jiajin Li
3
, Kangfei Zhao
4
, Fugee Tsung
1,2
, Jia Li
1,2∗
1
Hong Kong University of Science and Technology (Guangzhou),
2
Hong Kong University of Science and Technology,
3
Stanford University,
4
Tencent AI Lab
{jtangbf,wzhangcd}@connect.ust.hk, jiajinli@stanford.edu, zkf1105@gmail.com, {season,jialee}@ust.hk
Abstract—Graph alignment, which aims at identifying cor-
responding entities across multiple networks, has been widely
applied in various domains. As the graphs to be aligned are usu-
ally constructed from different sources, the inconsistency issues
of structures and features between two graphs are ubiquitous
in real-world applications. Most existing methods follow the
“embed-then-cross-compare” paradigm, which computes node
embeddings in each graph and then processes node correspon-
dences based on cross-graph embedding comparison. However,
we find these methods are unstable and sub-optimal when
structure or feature inconsistency appears. To this end, we
propose SLOTAlign, an unsupervised graph alignment frame-
work that jointly performs Structure Learning and Optimal
Transport Alignment. We convert graph alignment to an optimal
transport problem between two intra-graph matrices without the
requirement of cross-graph comparison. We further incorporate
multi-view structure learning to enhance graph representation
power and reduce the effect of structure and feature inconsistency
inherited across graphs. Moreover, an alternating scheme based
algorithm has been developed to address the joint optimization
problem in SLOTAlign, and the provable convergence result is
also established. Finally, we conduct extensive experiments on
six unsupervised graph alignment datasets and the DBP15K
knowledge graph (KG) alignment benchmark dataset. The pro-
posed SLOTAlign shows superior performance and strongest
robustness over seven unsupervised graph alignment methods
and five specialized KG alignment methods.
1
Index Terms—Graph alignment, Unsupervised learning, Struc-
ture learning, Optimal transport
I. INTRODUCTION
Graph alignment refers to the problem of identifying the
node correspondences (i.e., anchor links) across different
graphs. With graph data becoming ubiquitous in the Web
era, graph alignment establishes connections between multiple
networks and integrates them into a world-view network for
subsequent analysis and downstream applications. Thus, graph
alignment provides a comprehensive perspective for structured
data compared with mining each individual network. As a
well-established problem, graph alignment has received much
attention due to its vast applicable tasks, e.g., linking accounts
in different social network platforms [25], [26], [40], matching
entities across different knowledge graphs [34], [45], [63],
∗
Corresponding author.
†
Work done during an internship at Tencent AI Lab.
1
Code and data are released at https://github.com/squareRoot3/SLOTAlign
[69], integrating protein-protein interactions of different species
[21], [35], merging scholar profiles of academic collaboration
networks [47], [67].
Graph alignment is usually treated as a supervised problem
[33], [36], [67], [68], in which a set of ground-truth node
correspondences is given. However, these correspondences
are usually unavailable and further suffer from the labor
expensiveness issue in real-world applications. Thus, unsu-
pervised graph alignment methods have attracted increasing
attention [4], [9], [14], [17], [19], [31], [72]. Also, graph
nodes are often associated with wealthy side information,
such as the user information of social network accounts
or the embedding of knowledge graph entities. These high-
dimensional node features/attributes can serve as an additional
source of knowledge in graph alignment, especially under the
unsupervised setting.
Most existing graph alignment methods rely on high-quality
and well-measured graph structures. They require that the
structure of the overlapped parts between two graphs is similar,
which is named structure consistency thereafter. However, real-
world graphs are often coupled with outliers [46] or with
missing/irrelevant edges [28], [54], [61], leading to structure
inconsistency across graphs. It is often observed that the same
entities in different networks have quite different neighbors
due to the structural noise [4], [72]. Figure 1 demonstrates an
example of graph alignment on two social network platforms.
Black dashed lines are anchor links that connect the same copies
of users across two networks. As can be seen, two circled nodes
are connected on Platform A, but their corresponding nodes
on Platform B are not connected.
Besides structure inconsistency, another largely overlooked
issue is that node features in different graphs are usually
unaligned and inconsistent. Due to various functionalities of
different networks (e.g., LinkedIn for job seeking and Twitter
for opinion sharing), the same user in different networks
commonly does not share the same features. Taking Figure 1
as an example, user information in Platform A includes the real
name, gender, and education experiences, while Platform B
contains the anonymized username, locations, posts, etc. Under
this situation, corresponding nodes across two networks are
not similar to each other and are incomparable. Likewise, in
cross-lingual knowledge graph alignment, entities in different
arXiv:2301.12721v2 [cs.DB] 20 Apr 2023
评论