
UniView: A Unified Autonomous Materialized View Management
System for Various Databases
Zhenrong Xu, Pengfei Wang
Zhejiang University
Hangzhou, China
{xuzhenrong,wangpf}@zju.edu.cn
Guoze Xue, Qitong Yan
Zhejiang University
Hangzhou, China
{xuegz,qitong.yan}@zju.edu.cn
Shenghao Gong, Yelan Jiang
Zhejiang University
Hangzhou, China
{gongshenghao,jiangyelan}@zju.edu.cn
Yuren Mao, Yunjun Gao
Zhejiang University
Hangzhou, China
{yuren.mao,gaoyj}@zju.edu.cn
Shu Shen, Wei Zhang, Dan Luo
Huawei
Hangzhou, China
{shenshu,zhangwei09,luodan2}@huawei.com
Lu Chen
Zhejiang University
Hangzhou, China
luchen@zju.edu.cn
ABSTRACT
Materialized views (MVs) are critical for improving query perfor-
mance of database systems, especially in online analytical process-
ing (OLAP) databases. Typically, MVs are maintained by DBAs,
which relies on prior knowledge and manual operations. Recently,
autonomous solutions are designed for specic databases. How-
ever, a data warehouse for OLAP is typically hierarchical, which
uses dierent database engines at dierent stages. Hence, existing
methods have limitations in terms of autonomy and unication to
support practical applications.
Motivated by these, we develop UniView, a unied autonomous
materialized view management system that supports various popu-
lar databases, including Spark SQL, PostgreSQL, and ClickHouse.
Moreover, we provide a cross-platform web user interface, where
users can carry out the process of materialized views and evaluate
the optimization performance. In the demonstration, we show that
UniView is user-friendly and can achieve superior performance in
the practical industry scenarios.
PVLDB Reference Format:
Zhenrong Xu, Pengfei Wang, Guoze Xue, Qitong Yan, Shenghao Gong,
Yelan Jiang, Yuren Mao, Yunjun Gao, Shu Shen, Wei Zhang, Dan Luo,
and Lu Chen. UniView: A Unied Autonomous Materialized View
Management System for Various Databases. PVLDB, 17(12): 4353 - 4356,
2024.
doi:10.14778/3685800.3685873
PVLDB Artifact Availability:
The source code, data, and/or other artifacts have been made available at
https://github.com/ZJU-DAILY/UniView.
1 INTRODUCTION
Materialized views (MVs) are of critical importance to the query
performance of database systems. As shown in Figure 1, we can
materialize a view to speed up the query process. In online analyti-
cal processing (OLAP) databases, many SQL queries share common
This work is licensed under the Creative Commons BY-NC-ND 4.0 International
License. Visit https://creativecommons.org/licenses/by-nc-nd/4.0/ to view a copy of
this license. For any use beyond those covered by this license, obtain permission by
emailing info@vldb.org. Copyright is held by the owner/author(s). Publication rights
licensed to the VLDB Endowment.
Proceedings of the VLDB Endowment, Vol. 17, No. 12 ISSN 2150-8097.
doi:10.14778/3685800.3685873
OrderKey CustKey TotalPrice OrderDate Name Address Zipcode
0001 0001 10.0 09152022 A E 310024
0002 0002 100.0 09152022 B F 310023
0003 0004 50.0 09152022 C G 310024
0004 0003 2000.0 09152022 D H 310020
Select Order.TotalPrice
From View
Where Zipcode=310024
Select Order.TotalPrice
From Orders, Custormers
Where Oders.CustKey = Custormers.CustKey
AND Zipcode=310024
View
Figure 1: An example of materialized views.
subqueries and there are lots of redundant computations among
these queries. Materializing views on these subqueries can avoid
redundant computation and improve query performance, which is
a space-for-time trade-o principle. Therefore, it is vital to select
the optimal MVs that can bring the most query performance im-
provement within a space budget. For example, in Amazon Redshift,
automated materialized views are a powerful tool for improving
query performance.
Most existing methods rely on DBAs to generate and maintain
MVs [
3
]. Nevertheless, these methods require prior knowledge and
manual operations, which are costly, and thus, cannot eciently
and eectively support large-scale databases. Motivated by this,
autonomous MV selection methods are proposed recently. The MV
selection process within a space budget can be regarded as a 0-
1 integer linear programming (0-1 ILP) problem. Solving 0-1 ILP
is too expensive for large workloads since the complexity of the
0-1 ILP approach is
𝑂 (
2
𝑛
)
. To eciently solve the MV selection,
two lines of methods exist. One line of MV selection methods is
heuristics, such as the greedy strategy [
1
,
2
]. Another line of studies
uses reinforcement learning (RL) [
3
,
4
,
7
,
10
] or graph algorithm [
5
]
to solve the 0-1 ILP problem faster.
There exist many works[
3
,
4
,
6
,
7
,
9
–
11
] that leverage ML meth-
ods to solve database problems, indicating a growing trend.
However, existing automatic MV selection methods are designed
for specic database systems, e.g., DQM [
6
] designed for Spark
SQL, AutoView [
3
] designed for PostgreSQL, DBMind [
11
] designed
for OpenGauss, and SOFOS [
8
] designed for knowledge graphs.
4353
文档被以下合辑收录
评论