
ery Driven-Graph Neural Networks for Community Search:
From Non-Aributed, Aributed, to Interactive Aributed
Yuli Jiang
∗1
, Yu Rong
†2
, Hong Cheng
∗3
, Xin Huang
‡4
, Kangfei Zhao
†5
, Junzhou Huang
†6
∗
The Chinese University of Hong Kong,
†
Tencent AI Lab,
‡
Hong Kong Baptist University, China
{
1
yljiang,
3
hcheng,
5
kfzhao}@se.cuhk.edu.hk,
2
yu.rong@hotmail.com,
4
xinhuang@comp.hkbu.edu.hk,
5
zkf1105@gmail.com,
6
jzhuang@uta.edu
ABSTRACT
Given one or more query vertices, Community Search (CS) aims
to nd densely intra-connected and loosely inter-connected struc-
tures containing query vertices. Attributed Community Search
(ACS), a related problem, is more challenging since it nds com-
munities with both cohesive structures and homogeneous vertex
attributes. However, most methods for the CS task rely on inexible
pre-dened structures and studies for ACS treat each attribute inde-
pendently. Moreover, the most popular ACS strategies decompose
ACS into two separate sub-problems, i.e., the CS task and subse-
quent attribute ltering task. However, in real-world graphs, the
community structure and the vertex attributes are closely corre-
lated to each other. This correlation is vital for the ACS problem. In
this vein, we argue that the separation strategy cannot fully capture
the correlation between structure and attributes simultaneously
and it would compromise the nal performance.
In this paper, we propose Graph Neural Network (GNN) models
for both CS and ACS problems, i.e., Query Driven-GNN (
QD
-
GNN
)
and Attributed Query Driven-GNN (
AQD
-
GNN
). In
QD
-
GNN
, we
combine the local query-dependent structure and global graph
embedding. In order to extend
QD
-
GNN
to handle attributes, we
model vertex attributes as a bipartite graph and capture the relation
between attributes by constructing GNNs on this bipartite graph.
With a Feature Fusion operator,
AQD
-
GNN
processes the structure
and attribute simultaneously and predicts communities according
to each attributed query. Experiments on real-world graphs with
ground-truth communities demonstrate that the proposed mod-
els outperform existing CS and ACS algorithms in terms of both
eciency and eectiveness. More recently, an interactive setting
for CS is proposed that allows users to adjust the predicted com-
munities. We further verify our approaches under the interactive
setting and extend to the attributed context. Our method achieves
2
.
37% and 6
.
29% improvements in F1-score than the state-of-the-art
model without attributes and with attributes respectively.
PVLDB Reference Format:
Yuli Jiang, Yu Rong, Hong Cheng, Xin Huang, Kangfei Zhao, Junzhou
Huang. Query Driven-Graph Neural Networks for Community Search:
From Non-Attributed, Attributed, to Interactive Attributed. PVLDB, 15(6):
XXX-XXX, 2022.
doi:10.14778/3514061.3514070
This work is licensed under the Creative Commons BY-NC-ND 4.0 International
License. Visit https://creativecommons.org/licenses/by-nc-nd/4.0/ to view a copy of
this license. For any use beyond those covered by this license, obtain permission by
emailing info@vldb.org. Copyright is held by the owner/author(s). Publication rights
licensed to the VLDB Endowment.
Proceedings of the VLDB Endowment, Vol. 15, No. 6 ISSN 2150-8097.
doi:10.14778/3514061.3514070
1 INTRODUCTION
Graph is an essential data structure to represent entities and their
relationships, e.g., social networks, protein-protein interaction net-
works, web graphs, and knowledge graphs, to name a few. Com-
munity, a subgraph of densely intra-connected and loosely inter-
connected structure, naturally exists as a functional module in real-
world graphs. Community Search (CS) [
7
,
11
,
19
,
21
,
25
,
37
] is a vital
application in graph analytics. Concretely, given any query vertices,
CS aims to nd a vertex set with cohesive structure according to the
query, i.e., query-dependent communities. Attributed Community
Search (ACS), a related but more challenging problem, has attracted
a lot of attention recently [
10
,
11
,
22
,
23
,
25
]. Given any query vertex
and attribute set, ACS aims at nding query-dependent commu-
nities with homogeneous attributes, which means the community
members share similar attributes with the query attributes.
For the CS and ACS problems, existing studies suer from two
serious limitations, that is, structure inflexibility and aribute
irrelevance
. Structure inexibility refers to the problem that most
community search models are based on a pre-dened subgraph
pattern, such as
𝑘
-core [
8
,
10
,
37
],
𝑘
-truss [
1
,
21
,
22
],
𝑘
-clique [
7
,
44
],
and
𝑘
-edge connected component (
ECC
) [
6
,
19
]. The pre-dened
subgraph pattern imposes a very rigid requirement on topological
structure of communities, which may not perfectly hold in real-
world communities. Attribute irrelevance means existing models
treat each attribute independently [
10
,
22
]. However, in real graphs,
vertex attributes are not independent of each other. Ignoring such
implicit relations would harm the quality of queried communities.
Figure 1 depicts a toy example illustrating the limitation of exist-
ing algorithms. The faculty hierarchy is a tree-like structure from
the faculty dean, department chairman to the professors in each
department. Using existing methods based on pre-dened subgraph
patterns, we can only nd a 1-core community of vertex 6 in
𝐻
1
and a 2-truss community in
𝐻
2
, which are the entire graph. These
𝑘
-core [
37
] and
𝑘
-truss [
21
] patterns cannot discover the tree-like
department communities owing to the structure inexibility. For
attributed community search, when querying the community of
vertex 6 and attribute “ML”, current methods [
10
,
22
] nd the com-
munity
𝐻
3
since they ignore the implicit relations between “ML”,
“DL” and “CV”. Thus, existing studies suer from these two inade-
quacies on structure and attribute respectively.
Moreover, for the ACS problem, existing studies [
10
,
22
] usually
adopt a two-stage strategy which rst nds the candidate com-
munity by considering the topological structure only, and then
performs a ltering on the candidate community by considering
the attribute similarity. The two-stage strategy treats the structure
cohesiveness and attribute homogeneity separately. But there is usu-
ally a correlation between structure and attribute, for instance, in
arXiv:2104.03583v2 [cs.DB] 23 Mar 2022
评论