update processing, which often leads to an imbalance of
computing resource allocation between queries and updates.
As the contention intensifies, the user waiting time for queries
can be extended, resulting in lengthy query response time
for the users. In a high-load situation, the queue builds up
quickly, and the response time of a query can be significantly
exaggerated, impacting user satisfaction. Therefore, the query
response time closely reflects the Quality-of-Service (QoS) to
the user in a PPR-based application.
The problem: QoS-Aware PPR Computations. Despite the
importance of queue-based PPR query response time opti-
mization in practical applications, most of the existing PPR
algorithms [10], [11], [12], [13], [14], [15], [16] focus on a
stand-alone PPR query or update, as reviewed in Section III.
However, the query response time does not solely depend
on query or update processing time, but is more related to
the behavior combination of PPR queries and data updates
in a waiting queue, as we discussed earlier. In this paper,
we make the first attempt at PPR computation with a focus
on optimizing the query response time for single-source PPR
(SSPPR) queries or top-k PPR queries over dynamic graphs,
and we name the problem as QoS-Aware PPR Computation.
In particular, given any PPR query and update arrival rates, we
aim to design an algorithmic framework that can adapt itself
to suit the workload in producing a reasonable average query
response time.
QoS-Aware PPR computation is a non-trivial problem. First,
directly applying the state-of-the-art PPR query algorithms
(e.g., Agenda [4], FORA [13]) does not ensure a reasonable
query response time, because they mainly optimize for a
shorter query time at the expense of update time (e.g., updating
index structures). As we have argued, faster query processing
is not necessarily translated into a shorter query response time.
Second, the workload of PPR queries and graph updates may
change over time in recommendation systems. However, the
mainstream PPR algorithms often entail many hyperparam-
eters, making it difficult to set justifiable parameters under
different workloads.
These challenges motivate us to design Quota,
1
a system
that aims to optimize PPR query response time for a wide
spectrum of query and update workload configurations. We
emphasize several main design philosophies of Quota. First,
our purpose is not to design a faster PPR query algorithm
than existing ones which are already close to the optimum
in many aspects due to the promising development in recent
decades. Instead, given the maturity of the PPR algorithms, it
can be more important to design an algorithmic framework to
accommodate some state-of-the-art PPR algorithms as base
algorithms and allow them to be transparently adapted for
optimizing query response time. In this paper, we discuss three
base algorithms: Agenda [4], FORA [13], and SpeedPPR [16],
because Agenda [4] is the state-of-the-art dynamic PPR al-
gorithm, FORA [13] is a representative PPR algorithm that
motivates many later algorithms, and SpeedPPR is one of the
1
Quality of Service Optimization for Personalized PageRank over Evolving
Graphs
most query-efficient PPR algorithms in static graphs. Second,
the design of Quota integrates several different branches
of mathematical tools including queuing theory, constrained
optimization, and complexity analysis. Particularly, Quota ab-
stracts the tuning parameters of a typical PPR query framework
and builds a mathematical relationship between the parameters
and the real cost. It then incorporates an in-depth queuing
model to facilitate the transformation of the problem into a
constrained optimization problem. We also design algorithms
to solve the optimization problem to precisely determine the
most suitable parameters for the given workload.
We summarize our main contributions as follows:
• Problem Formulation. We define the problem of QoS-
Aware PPR optimization, which reflects the issues when
applying PPR measures in practical recommendation systems.
The problem is significantly different from the existing PPR
optimization problems which optimize a stand-alone PPR
query or update.
• Auto-Configuration System. We present Quota, a con-
figuration system that can be deployed with some state-of-
the-art PPR algorithms to optimize the query response time.
Quota is an auto-configuration system to enhance the state-of-
the-art approach in adaptability to various query and update
workloads. Quota can be applied to four recent state-of-the-art
algorithms, namely, Agenda [4], FORA [13], SpeedPPR [16],
and TopPPR [17]. To the best of our knowledge, Quota is the
first configuration system that aims to optimize the PPR query
response time for PPR-based systems.
• Reordering Algorithm. In order to further optimize the
query response time, Quota further allows violation of the
FCFS queuing policy with controllable shuffling of the queries
and updates, without affecting the accuracy guarantees. This
forms the reordering algorithm named Seed.
• Extensive Experiments. We have conducted extensive
experiments in various real datasets and settings. Our results
demonstrate that Quota can achieve up to 86%, 40%, 34%,
50% and 33% shorter response time than the state-of-the-
art PPR algorithms Agenda, FORA, SpeedPPR, FORA-TopK
and TopPPR respectively, if deployed with the corresponding
algorithm.
II. PROBLEM DEFINITION
A. PPR Queries
Let G = (V, E) be a directed graph. Given a source node
s ∈ V and a probability α ∈ (0, 1), a random walk from s
is a traversal in G where at each walk step it will move to a
neighbor chosen uniformly at random with probability 1 −α,
and otherwise terminate at the current node. The Personalized
PageRank (PPR) value from s to t, denoted by π(G, s, t),
equals the probability that a random walk starting from s
terminates at t. An important task of PPR computation is the
single-source PPR query, defined as follows.
Definition 1. (SSPPR queries) Given a source node s, a
threshold δ ∈ (0, 1), an error bound ϵ, and a failure prob-
ability p
f
, a Single-Source PPR (SSPPR) query returns an
estimated PPR value ˆπ(G, s, t) for each vertex t ∈ V , such
that for any π(G, s, t) > δ, we have:
评论