【精品篇】_如何在不影响Oracle生产库性能的评估整库大小

原创杨磊 2019-12-12

3555

【引言】
最近碰到一个小问题：一TB级的Oracle生产库，因为要走数据迁移，需要先行评估整个库的迁移数据量大小，但又不得影响生产库运行性能。如何搞？大家都知道，expdp数据泵有两个很好用的参数ESTIMATE和ESTIMATE_ONLY，此两个参数可以保证在不真正发起逻辑备份的情况下评估整个迁移生产库的大小。
今天念叨下这个小问题。

这里使用$ expdp -help先看expdp的ESTIMATE和ESTIMATE_ONLY两个参数的介绍：

ESTIMATE
Calculate job estimates.
Valid keyword values are: [BLOCKS] and STATISTICS.

ESTIMATE_ONLY
Calculate job estimates without performing the export.

ESTIMATE
默认：blocks
指定计算每张表使用磁盘空间的方法
ESTIMATE=[BLOCKS | STATISTICS]
1.BLOCKS - 通过块数和块大小计算
2.STATISTICS -每张表的统计信息计算

Expdp可计算导出数据大小容量，通过两种方式进行容量估算，一种是通过数据块数量、一种是通过统计信息中记录的内容估算。两者在不同版本中，可能有巨大的性能差异。

通过expdp的参数ESTIMATE_ONLY和ESTIMATE来评估导出的性能，ESTIMATE_ONLY仅作评估不会导出数据，通过ESTIMATE参数指定statistics和blocks参数来测试两者的差异。

以下是Oracle 11.2.0.4中的测试数据输出，在此版本中，我们来看下ESTIMATE的statistics和blocks两个参数各自评估大小和用时。

两条命令如下：

$expdp \'/ as sysdba\' ESTIMATE_ONLY=y FULL=y ESTIMATE=blocks
$expdp \'/ as sysdba\' ESTIMATE_ONLY=y FULL=y ESTIMATE=statistics

具体执行如下：

$ expdp \'/ as sysdba\' ESTIMATE_ONLY=y FULL=y ESTIMATE=blocks

Export: Release 11.2.0.4.0 - Production on Thu Dec 12 16:57:57 2019

Copyright (c) 1982, 2011, Oracle and/or its affiliates.  All rights reserved.

Connected to: Oracle Database 11g Enterprise Edition Release 11.2.0.4.0 - 64bit Production
With the Partitioning, OLAP, Data Mining and Real Application Testing options
Starting "SYS"."SYS_EXPORT_FULL_01":  "/******** AS SYSDBA" ESTIMATE_ONLY=y FULL=y ESTIMATE=blocks 
Estimate in progress using BLOCKS method...
Processing object type DATABASE_EXPORT/SCHEMA/TABLE/TABLE_DATA
.  estimated "BIDW"."W_LOY_ACCRUAL_ITEM_F"               834.1 GB
.  estimated "BIDW"."W_LOY_ACTIVITY_F"                   261.6 GB
.
.
.
.  estimated "S_NQ_SCHED"."S_NQ_JOB"                         0 KB
.  estimated "S_NQ_SCHED"."S_NQ_JOB_PARAM"                   0 KB
**Total estimation using BLOCKS method: 2599. GB**
Job "SYS"."SYS_EXPORT_FULL_01" successfully completed at Thu Dec 12 17:00:48 2019 **elapsed 0 00:02:50**

****如上可以看出，使用ESTIMATE=blocks评估出来的大小为2599. GB，耗时：00:02:50；

接下来再看ESTIMATE=statistics方式。命令如下：

$expdp \'/ as sysdba\' ESTIMATE_ONLY=y FULL=y ESTIMATE=statistics

Export: Release 11.2.0.4.0 - Production on Thu Dec 12 16:37:08 2019

Copyright (c) 1982, 2011, Oracle and/or its affiliates.  All rights reserved.

Connected to: Oracle Database 11g Enterprise Edition Release 11.2.0.4.0 - 64bit Production
With the Partitioning, OLAP, Data Mining and Real Application Testing options
Starting "SYS"."SYS_EXPORT_FULL_01":  "/******** AS SYSDBA" ESTIMATE_ONLY=y FULL=y ESTIMATE=statistics 
Estimate in progress using STATISTICS method...
Processing object type DATABASE_EXPORT/SCHEMA/TABLE/TABLE_DATA
.  estimated "BIDW"."W_LOY_ACCRUAL_ITEM_F"               676.2 GB
.  estimated "BIDW"."W_LOY_ACTIVITY_F"                   226.4 GB
.
.
.
.  estimated "S_NQ_SCHED"."S_NQ_JOB"                         0 KB
.  estimated "S_NQ_SCHED"."S_NQ_JOB_PARAM"                   0 KB
**Total estimation using STATISTICS method: 2132. GB**
Job "SYS"."SYS_EXPORT_FULL_01" successfully completed at Thu Dec 12 16:41:55 **2019 elapsed 0 00:04:40**

如上，使用ESTIMATE=STATISTICS 评估出来的大小为2132. GB，耗时：00:04:40；

可以看出，两种方式统计的大小不一样，为啥？

个人分析推断：
ESTIMATE的默认方式是blocks。个人理解为：Oracle的块大小默认为8196B，直接对库使用的块数计算使用量，简单明了，但考虑到块有高水位线、碎片等因素，故统计不精确。而STATISTICS 方式因计算的是对每张表的实际使用情况，故更为精确。

但STATISTICS 的方式也非很精确，原因为：1. 该方式只是对表做统计，没有对索引、列、系统做统计，2. 一个表中被修改的行数超过stale_percent(缺省值10%)时才会认为这个表的统计数据过时，需要重新搜集。

注意：
如果压缩了表，那么使用ESTIMATE=BLOCKS计算的值时不准确的，这个时候就应该使用ESTIMATE=STATISTICS。

推荐：
使用ESTIMATE的默认方式blocks进行估算，原因很简单，估值按最大值估算申请空间更靠谱。

=========================================================
为了加深理解，这里介绍下Oracle 统计信息收集

大家都知道，Oracle较优执行计划的挑选是基于CBO（cost based optimized）判断，而CBO对哪个执行计划较优的判断是基于统计信息。优化器统计范围包含：

1. 表统计：行数，块数，行平均长度；
    all_tables：NUM_ROWS，BLOCKS，AVG_ROW_LEN；

2. 列统计：列中唯一值的数量（NDV），NULL值的数量，数据分布；
     DBA_TAB_COLUMNS：NUM_DISTINCT，NUM_NULLS，HISTOGRAM；
     
3. 索引统计：叶块数量，等级，聚簇因子；
      DBA_INDEXES：LEAF_BLOCKS，CLUSTERING_FACTOR，BLEVEL；
      
4. 系统统计：I/O性能与使用率；
     CPU性能与使用率；
      存储在aux_stats$中，需要使用dbms_stats收集，I/O统计在X$KCFIO中；

查询表上一次收集统计信息的时间：
SQL> select owner,table_name,last_analyzed from dba_tables where owner=‘SCOTT’;

Oracle中如何搜集统计信息？主要有2种方法：

方式1： analyze语句
analyze可以用来收集表，索引，列以及系统的统计信息和直方图，以下为一些典型用法：

analyze table scott.emp compute statistics; 
--收集所有的统计信息和直方图信息，包括表、列、索引。

analyze table scott.emp compute statistics for table; 
--收集emp表的统计信息，不含列、索引统计信息和直方图。

analyze table scott.emp compute statistics for all columns; 
--收集所有列的统计信息和直方图（超大表较耗资源，因为只要列中有非空值，那么就会收集这个列的统计信息和直方图）。

analyze table scott.emp compute statistics for all indexed columns;  
--收集所有索引列的统计信息和直方图。

analyze table scott.emp compute statistics for all indexes; 
--收集所有索引统计信息，不含列的统计信息和直方图。

analyze table scott.emp compute statistics for columns 列1,列2； 
--收集2个列的统计信息和直方图。

analyze index idx_ename delete statistics; 
--删除索引idx_ename的统计信息。

analyze table scott.emp delete statistics; 
--删除表t1所有的表，列，索引的统计信息和列直方图。

analyze table scott.emp estimate statistics sample 15 percent for table; 
--收集emp表的统计信息，以估算模式采样比例为15%进行收集，不含列、索引统计信息和直方图。

从语法可以看出，只有指定列统计信息收集时，才会收集相关列的直方图，此外收集直方图时for子句还可以加size子句，size的取值范围是1-254，默认值是75，表示直方图的buckets的最大数目。而dbms_stats包的size选择则有：数字|auto|repeat|skewonly选项，但analyze的size只能是数字。

关于直方图介绍：
A histogram is a special type of column statistic that provides more detailed information about the data distribution in a table column. A histogram sorts values into “buckets,” as you might sort coins into buckets.

从官网解释（参考第四个网址）来看，直方图就是一种特殊的列统计信息，这也与我们上边的推断相符，只有列才有直方图。
这里贴一个用于查看analyze后统计信息的SQL：

select t.num_rows as num_rows_in_table, i.index_name, i.num_rows as num_rows_in_index, co.num_analyzed_cols,ch.histogram_cnt
from (select num_rows from user_tables where table_name ='EMP') t,
     (select index_name,num_rows from user_indexes where table_name = 'EMP') i,
     (select count(*) as num_analyzed_cols from user_tab_columns where table_name='EMP' and num_distinct is not null) co,
     (select count(distinct column_name) histogram_cnt from user_tab_histograms where table_name = 'EMP' ) ch;

需要注意的一点是for table选项并不只收集表统计信息，将列和索引的统计信息一块收集了。

可调用dbms_stats.flush_database_monitoring_info过程来手动刷新这些数据。如果想在查询时得到最新信息(在所有统计数据收集之前内部监控数据会被刷新)。可通过查询user_tab_statistics视图中的stale_stats列来查看哪个表的统计数据过时。表的stale_stats被设置为NO,统计数据是最新的。表的stale_stats被设置为YES,统计数据是过时的，表的stale_stats没有被设置说明丢失统计数据。

方式2：调用dbms_stats包
dbms_stats中负责收集统计信息的几个存储过程：

**GATHER_DATABASE_STATS**
    --This procedure gathers statistics for all objects in the database.
    
**GATHER_DICTIONARY_STATS**
    --This procedure gathers statistics for dictionary schemas 'SYS', 'SYSTEM' and schemas of RDBMS components.
    
**GATHER_FIXED_OBJECTS_STATS**
    --This procedure gathers statistics for all fixed objects (dynamic performance tables).
    
**GATHER_INDEX_STATS**
    --This procedure gathers index statistics. It attempts to parallelize as much of the work as possible. Restrictions are described in the individual parameters. This operation will not parallelize with certain types of indexes, including cluster indexes, domain indexes, and bitmap join indexes. The granularity and no_invalidate arguments are not relevant to these types of indexes.
    
**GATHER_SCHEMA_STATS**
    --This procedure gathers statistics for all objects in a schema.
    
**GATHER_SYSTEM_STATS**
    --This procedure gathers system statistics.
    
**GATHER_TABLE_STATS**
    --This procedure gathers table and column (and index) statistics. It attempts to parallelize as much of the work as possible, but there are some restrictions as described in the individual parameters.

具体使用案例：

1. EXEC DBMS_STATS.GATHER_SCHEMA_STATS('SCOTT',estimate_percent=>80,method_opt=>'FOR ALL COLUMNS SIZE AUTO',degree=>4,cascade=>TRUE);

2. EXEC DBMS_STATS.GATHER_TABLE_STATS('SCOTT','EMP',estimate_percent=>80,method_opt=>'FOR ALL COLUMNS SIZE AUTO',degree=>4,cascade=>TRUE);

3. EXEC DBMS_STATS.GATHER_INDEX_STATS('SCOTT','PK_EMP',estimate_percent=>80,degree=>4);

dbms_stats与analyze的区别：

analyze收集系统内部对象会报错，而dbms_stats不会；
analyze不能正确的收集分区表的统计信息，而dbms_stats可以通过指定粒度来实现（granularity）。
analyze不能并行的收集统计信息，而dbms_stats可以（可以加上degree=>4来实现并行度为4的收集）。
Oracle推荐使用dbms_stats来收集统计信息，analyze将会被逐渐抛弃。

关于权限，取巧的办法，示例如下：

conn hr/hr
create or replace procedure gather_stats is
begin 
dbms_stats.gather_table_stats('HR', 'EMPLOYEES');
end gather_stats;
/ 

grant select on hr.employees to scott;
grant execute on gather_stats to scott; 

conn scott/scott  
exec hr.gather_stats;

总结：
1.本文介绍了在不影响生产库运行性能的前提下。使用expdp数据泵参数ESTIMATE和ESTIMATE_ONLY，在不真正发起逻辑备份的情况下，可以评估整个迁移生产库大小的用法和差别及分析；
2.同时介绍了Oracle的CBO，统计信息、直方图，如何收集统计的两种方法调用dbms_stats包和analyze两种方式，推荐使用调用dbms_stats包；
3.使用expdp评估库大小时，推荐使用ESTIMATE的默认方式blocks进行估算，原因很简单，估值按最大值估算申请空间更靠谱。

【参考】
https://docs.oracle.com/cd/B19306_01/appdev.102/b14258/d_stats.htm#CIHBIEII
【参考】
https://docs.oracle.com/cd/B12037_01/server.101/b10759/statements_4005.htm#i2150533
【参考】
https://docs.oracle.com/database/121/TGSQL/tgsql_histo.htm#TGSQL36
【参考】
https://www.cnblogs.com/leohahah/p/9413513.html

以下是个人微信公众号“一森咖记”，欢迎关注

oracle

「喜欢这篇文章，您的关注和赞赏是给作者最好的鼓励」

关注作者

文章被以下合辑收录

Oracle专栏——杨磊（共127篇）

本专栏将定期推动Oracle的一些有技术、有温度的系列文章。

【精品篇】_如何在不影响Oracle生产库性能的评估整库大小

文章被以下合辑收录

评论