- 微信
- 微博
  
  分享文章到微博
- 复制链接
  
  复制链接到剪贴板

GaussDB(DWS)调优分享：delete inlist改写优化

Arrow0lf 发表于 2024/02/04 11:45:29 2024/02/04

【摘要】 GaussDB(DWS)对查询场景下，inlist长的场景进行了优化，可以将inlist的过滤转化为join操作提高性能，由参数qrw_inlist2join_optmode控制。但是该优化仅针对查询select场景生效，对delete场景无效，示例如下：postgres=# set qrw_inlist2join_optmode = rule_base;SETpostgres=# expl...

GaussDB(DWS)对查询时inlist过长的场景进行了优化，可以将inlist的过滤转化为join操作提高性能，由参数qrw_inlist2join_optmode控制，详情参考：https://bbs.huaweicloud.com/blogs/285431。

但是该优化仅针对查询select场景生效，对delete场景无效，delete场景示例如下：

postgres=# set qrw_inlist2join_optmode = rule_base;
SET
postgres=# explain verbose delete from student where sno in(1,2,3,4,5);
                                            QUERY PLAN                                            
--------------------------------------------------------------------------------------------------
  id |              operation               | E-rows | E-distinct | E-memory | E-width | E-costs 
 ----+--------------------------------------+--------+------------+----------+---------+---------
   1 | ->  Streaming (type: GATHER)         |      1 |            |          |      14 | 5.38    
   2 |    ->  Delete on public.student      |      5 |            |          |      14 | 5.23    
   3 |       ->  Seq Scan on public.student |      5 |            | 1MB      |      14 | 1.23    
 
         Predicate Information (identified by plan id)         
 --------------------------------------------------------------
   3 --Seq Scan on public.student
         Filter: (student.sno = ANY ('{1,2,3,4,5}'::integer[]))
 
 Targetlist Information (identified by plan id)
 ----------------------------------------------
   1 --Streaming (type: GATHER)
         Node/s: All datanodes
   3 --Seq Scan on public.student
         Output: sno, ctid, xc_node_id
 
   ====== Query Summary =====   
 -------------------------------
 System available mem: 4710400KB
 Query Max mem: 4710400KB
 Query estimated mem: 1024KB
(23 rows)

当student表数据量很大，in的列表很长时，该语句性能很差，可以通过如下改写方法，手动改写为join的方式，提高delete性能：

上述查询改写为：

delete from student USING (
    WITH val(id) AS(
        VALUES (1), (2), (3), (4), (5)
    )
    SELECT * FROM val
) tmp 
where student.sno = tmp.id;

改写后计划如下：

                                              QUERY PLAN                                               
-------------------------------------------------------------------------------------------------------
  id |                 operation                 | E-rows | E-distinct | E-memory | E-width | E-costs 
 ----+-------------------------------------------+--------+------------+----------+---------+---------
   1 | ->  Streaming (type: GATHER)              |      1 |            |          |      18 | 5.52    
   2 |    ->  Delete on public.student           |      5 |            |          |      18 | 5.36    
   3 |       ->  Hash Join (4,5)                 |      5 |            | 1MB      |      18 | 1.36    
   4 |          ->  Seq Scan on public.student   |     57 | 14         | 1MB      |      14 | 1.14    
   5 |          ->  Hash                         |     20 | 5          | 16MB     |       4 | 0.11    
   6 |             ->  Values Scan on "*VALUES*" |     20 |            | 1MB      |       4 | 0.06    
 
     Predicate Information (identified by plan id)    
 -----------------------------------------------------
   3 --Hash Join (4,5)
         Hash Cond: (student.sno = "*VALUES*".column1)
 
                  Targetlist Information (identified by plan id)                  
 ---------------------------------------------------------------------------------
   1 --Streaming (type: GATHER)
         Node/s: All datanodes
   3 --Hash Join (4,5)
         Output: student.sno, student.ctid, student.xc_node_id, "*VALUES*".column1
   4 --Seq Scan on public.student
         Output: student.sno, student.ctid, student.xc_node_id
   5 --Hash
         Output: "*VALUES*".column1
   6 --Values Scan on "*VALUES*"
         Output: "*VALUES*".column1
 
   ====== Query Summary =====   
 -------------------------------
 System available mem: 4710400KB
 Query Max mem: 4710400KB
 Query estimated mem: 3328KB
(32 rows)

可以看到原先的单表inlist条件被转化成了join操作，该方式对于inlist过长的场景可以极大提升效率。

【声明】本内容来自华为云开发者社区博主，不代表华为云及华为云开发者社区的观点和立场。转载时必须标注文章的来源（华为云社区）、文章链接、文章作者等基本信息，否则作者和本社区有权追究责任。如果您发现本社区中有涉嫌抄袭的内容，欢迎发送邮件进行举报，并提供相关证据，一经查实，本社区将立刻删除涉嫌侵权内容，举报邮箱： cloudbbs@huaweicloud.com

点赞
收藏
关注作者

0/1000

抱歉，系统识别当前为高风险访问，暂不支持该操作

全部回复

上滑加载中

设置昵称

在此一键设置昵称，即可参与社区互动！

*长度不超过10个汉字或20个英文字符，设置后3个月内不可修改。

确认取消

加入云驻计划，成为创作者

华为云周边好礼
免费体验产品
特殊身份标识
线下官方门票
内部专家零距离
与10000+优质创作者共同成长

立即加入

GaussDB(DWS)调优分享：delete inlist改写优化

全部回复

设置昵称

关于作者

目录

加入云驻计划，成为创作者

GaussDB(DWS)调优分享：delete inlist改写优化

全部回复

设置昵称

关于作者

目录

热门推荐查看更多

相关文章

加入云驻计划，成为创作者

相关产品