Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[fix](mtmv) Fix refresh materialized view fail when mv def contains cte #44857

Merged
merged 3 commits into from
Dec 5, 2024

Conversation

seawinde
Copy link
Contributor

@seawinde seawinde commented Dec 2, 2024

What problem does this PR solve?

if mv def contain cte and is partition mv, such as following:

    CREATE MATERIALIZED VIEW mv_name
        BUILD IMMEDIATE REFRESH COMPLETE ON MANUAL
        PARTITION BY (l_shipdate)
        DISTRIBUTED BY RANDOM BUCKETS 2
        PROPERTIES ('replication_num' = '1') 
        AS
    WITH scan_data_cte as (
        select t1.l_shipdate, t1.L_LINENUMBER, orders.O_CUSTKEY, l_suppkey
        from (select * from lineitem where L_LINENUMBER > 1) t1
        left join orders on t1.L_ORDERKEY = orders.O_ORDERKEY
    )
    SELECT *  FROM scan_data_cte; 

if run refresh cmd as following, this would fail, throw exception no partition for this tuple, this pr fix this

refresh materialized view mv_name partitions(p_20231210_20231211);

Issue Number: close #xxx

Related PR: #33988

Problem Summary:

Release note

None

Check List (For Author)

  • Test

    • Regression test
    • Unit Test
    • Manual test (add detailed scripts or steps below)
    • No need to test or manual test. Explain why:
      • This is a refactor/code format and no logic has been changed.
      • Previous test can cover this change.
      • No code files have been changed.
      • Other reason
  • Behavior changed:

    • No.
    • Yes.
  • Does this need documentation?

    • No.
    • Yes.

Check List (For Reviewer who merge this PR)

  • Confirm the release note
  • Confirm test cases
  • Confirm document
  • Add branch pick label

@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

  1. What problem was fixed (it's best to include specific error reporting information). How it was fixed.
  2. Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
  3. What features were added. Why was this function added?
  4. Which code was refactored and why was this part of the code refactored?
  5. Which functions were optimized and what is the difference before and after the optimization?

@seawinde seawinde changed the title Fix cte filter add fail [fix](mtmv) Fix refresh materialized view fail when mv def contains cte Dec 2, 2024
@seawinde
Copy link
Contributor Author

seawinde commented Dec 2, 2024

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 40153 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 33cca52f58895ae7c926c664005eaf65580d55c4, data reload: false

------ Round 1 ----------------------------------
q1	17602	7451	7274	7274
q2	2043	188	170	170
q3	10802	1088	1222	1088
q4	10471	790	676	676
q5	7608	2782	2742	2742
q6	238	144	146	144
q7	1029	635	604	604
q8	9237	1845	1952	1845
q9	6731	6582	6508	6508
q10	7022	2310	2344	2310
q11	482	267	257	257
q12	431	231	225	225
q13	18503	3075	3057	3057
q14	275	222	210	210
q15	589	534	524	524
q16	669	595	588	588
q17	981	544	561	544
q18	7321	6766	6698	6698
q19	1332	928	922	922
q20	458	183	184	183
q21	4073	3265	3364	3265
q22	391	328	319	319
Total cold run time: 108288 ms
Total hot run time: 40153 ms

----- Round 2, with runtime_filter_mode=off -----
q1	7387	7373	7269	7269
q2	347	232	236	232
q3	3073	3142	3173	3142
q4	2157	1917	1945	1917
q5	5852	5812	5691	5691
q6	231	144	142	142
q7	2350	1825	1747	1747
q8	3438	3572	3538	3538
q9	9024	9084	9155	9084
q10	3585	3546	3583	3546
q11	617	531	523	523
q12	850	624	617	617
q13	16107	3241	3211	3211
q14	308	276	260	260
q15	586	523	519	519
q16	696	627	644	627
q17	1844	1636	1624	1624
q18	8365	7974	7563	7563
q19	1702	1422	1613	1422
q20	2110	1865	1920	1865
q21	5587	5377	5507	5377
q22	613	551	556	551
Total cold run time: 76829 ms
Total hot run time: 60467 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 198052 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 33cca52f58895ae7c926c664005eaf65580d55c4, data reload: false

query1	1237	955	954	954
query2	6233	2087	2064	2064
query3	10942	4409	4435	4409
query4	66660	28865	23496	23496
query5	4905	481	457	457
query6	410	196	178	178
query7	5555	310	302	302
query8	330	263	246	246
query9	8755	2695	2688	2688
query10	440	242	240	240
query11	17386	15400	16082	15400
query12	145	100	101	100
query13	1453	424	409	409
query14	10925	7637	7609	7609
query15	218	180	187	180
query16	7066	466	456	456
query17	1186	621	599	599
query18	1882	315	315	315
query19	212	167	168	167
query20	121	116	112	112
query21	221	105	104	104
query22	4817	4552	4559	4552
query23	35109	34261	34365	34261
query24	6014	2644	2594	2594
query25	484	393	382	382
query26	648	151	152	151
query27	1952	289	287	287
query28	4256	2465	2464	2464
query29	683	438	423	423
query30	218	157	150	150
query31	1020	833	845	833
query32	65	55	55	55
query33	410	355	329	329
query34	949	518	547	518
query35	915	785	779	779
query36	1095	971	950	950
query37	118	66	74	66
query38	4485	4557	4383	4383
query39	1528	1522	1480	1480
query40	200	104	101	101
query41	46	43	42	42
query42	113	100	101	100
query43	542	505	506	505
query44	1182	818	821	818
query45	202	171	178	171
query46	1199	730	777	730
query47	2060	1912	1936	1912
query48	418	308	319	308
query49	726	407	420	407
query50	861	397	401	397
query51	7419	7214	7133	7133
query52	98	88	92	88
query53	254	183	178	178
query54	522	397	392	392
query55	79	74	74	74
query56	241	236	242	236
query57	1273	1121	1105	1105
query58	218	215	236	215
query59	3250	3010	2795	2795
query60	263	234	239	234
query61	109	106	108	106
query62	816	691	678	678
query63	219	201	186	186
query64	1466	707	660	660
query65	3308	3237	3228	3228
query66	717	340	304	304
query67	16277	15662	15687	15662
query68	3832	578	559	559
query69	444	249	254	249
query70	1142	1135	1134	1134
query71	360	248	265	248
query72	6435	4051	4097	4051
query73	751	367	365	365
query74	10376	9083	9222	9083
query75	3358	2706	2655	2655
query76	1854	1025	1123	1025
query77	511	291	276	276
query78	10490	9481	9396	9396
query79	1407	621	605	605
query80	889	446	511	446
query81	486	229	234	229
query82	1295	125	119	119
query83	253	148	139	139
query84	278	70	71	70
query85	889	289	292	289
query86	333	308	299	299
query87	4650	4590	4628	4590
query88	3720	2213	2174	2174
query89	421	311	305	305
query90	2021	193	194	193
query91	136	100	105	100
query92	69	56	52	52
query93	1952	535	544	535
query94	792	291	287	287
query95	348	249	248	248
query96	619	275	276	275
query97	2844	2648	2715	2648
query98	222	191	204	191
query99	1647	1354	1310	1310
Total cold run time: 319950 ms
Total hot run time: 198052 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 33.44 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 33cca52f58895ae7c926c664005eaf65580d55c4, data reload: false

query1	0.03	0.02	0.03
query2	0.07	0.03	0.03
query3	0.23	0.07	0.07
query4	1.63	0.10	0.11
query5	0.40	0.41	0.41
query6	1.17	0.66	0.66
query7	0.02	0.01	0.01
query8	0.05	0.03	0.03
query9	0.59	0.51	0.51
query10	0.54	0.55	0.57
query11	0.14	0.10	0.10
query12	0.13	0.11	0.11
query13	0.60	0.62	0.61
query14	2.84	2.74	2.87
query15	0.92	0.83	0.83
query16	0.39	0.37	0.38
query17	1.03	1.04	1.01
query18	0.21	0.21	0.20
query19	1.94	1.82	2.01
query20	0.01	0.01	0.01
query21	15.36	0.60	0.59
query22	2.61	2.09	2.09
query23	16.89	1.07	0.88
query24	2.59	1.46	2.12
query25	0.35	0.10	0.12
query26	0.61	0.15	0.15
query27	0.05	0.05	0.04
query28	9.87	1.11	1.09
query29	12.52	3.19	3.18
query30	0.25	0.06	0.07
query31	2.87	0.40	0.37
query32	3.27	0.46	0.48
query33	2.98	3.05	3.05
query34	17.09	4.46	4.46
query35	4.48	4.51	4.48
query36	0.67	0.49	0.50
query37	0.09	0.06	0.06
query38	0.05	0.04	0.03
query39	0.03	0.03	0.02
query40	0.16	0.12	0.13
query41	0.08	0.02	0.02
query42	0.04	0.02	0.02
query43	0.03	0.04	0.03
Total cold run time: 105.88 s
Total hot run time: 33.44 s

morrySnow
morrySnow previously approved these changes Dec 2, 2024
@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Dec 2, 2024
Copy link
Contributor

github-actions bot commented Dec 2, 2024

PR approved by at least one committer and no changes requested.

Copy link
Contributor

github-actions bot commented Dec 2, 2024

PR approved by anyone and no changes requested.

@seawinde
Copy link
Contributor Author

seawinde commented Dec 4, 2024

run buildall

@github-actions github-actions bot removed the approved Indicates a PR has been approved by one committer. label Dec 4, 2024
@doris-robot
Copy link

TPC-H: Total hot run time: 40068 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 4f6c7f5c6d0be032af47d0d389c7df502eda46d9, data reload: false

------ Round 1 ----------------------------------
q1	17603	7491	7292	7292
q2	2040	177	166	166
q3	10596	1084	1180	1084
q4	10542	728	721	721
q5	7596	2744	2654	2654
q6	235	148	146	146
q7	998	610	596	596
q8	9251	1870	1898	1870
q9	6720	6520	6536	6520
q10	7047	2305	2318	2305
q11	475	257	261	257
q12	413	228	223	223
q13	17769	3003	3011	3003
q14	243	205	217	205
q15	579	535	524	524
q16	657	572	574	572
q17	983	587	524	524
q18	7263	6778	6708	6708
q19	1333	1043	969	969
q20	466	178	187	178
q21	4105	3259	3232	3232
q22	383	328	319	319
Total cold run time: 107297 ms
Total hot run time: 40068 ms

----- Round 2, with runtime_filter_mode=off -----
q1	7334	7282	7264	7264
q2	327	233	238	233
q3	2944	2875	2957	2875
q4	2057	1808	1841	1808
q5	5701	5813	5642	5642
q6	238	144	147	144
q7	2238	1801	1804	1801
q8	3426	3602	3550	3550
q9	8997	9063	9051	9051
q10	3614	3563	3623	3563
q11	611	507	510	507
q12	822	617	633	617
q13	10163	3206	3257	3206
q14	309	275	274	274
q15	580	535	535	535
q16	701	664	642	642
q17	1868	1626	1573	1573
q18	8186	7688	7602	7602
q19	1721	1659	1504	1504
q20	2088	1862	1857	1857
q21	5750	5491	5476	5476
q22	688	578	566	566
Total cold run time: 70363 ms
Total hot run time: 60290 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 196035 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 4f6c7f5c6d0be032af47d0d389c7df502eda46d9, data reload: false

query1	1269	940	935	935
query2	6246	2064	2081	2064
query3	10927	4343	4404	4343
query4	67175	28526	23435	23435
query5	4992	457	428	428
query6	422	195	182	182
query7	5578	299	296	296
query8	320	239	248	239
query9	8756	2741	2738	2738
query10	450	249	253	249
query11	17222	15120	15826	15120
query12	158	100	101	100
query13	1495	447	420	420
query14	9941	7682	6646	6646
query15	216	190	208	190
query16	7119	509	444	444
query17	1267	577	562	562
query18	1805	292	296	292
query19	199	143	146	143
query20	119	114	139	114
query21	205	101	98	98
query22	4726	4609	4576	4576
query23	35018	34328	34562	34328
query24	5481	2411	2524	2411
query25	480	400	392	392
query26	666	151	150	150
query27	1732	278	291	278
query28	4712	2491	2496	2491
query29	683	443	393	393
query30	208	148	150	148
query31	1018	852	810	810
query32	67	56	55	55
query33	397	293	291	291
query34	932	526	531	526
query35	895	740	737	737
query36	1101	977	958	958
query37	119	76	72	72
query38	4434	4407	4413	4407
query39	1514	1476	1476	1476
query40	209	96	95	95
query41	45	41	42	41
query42	106	99	96	96
query43	541	504	503	503
query44	1198	816	817	816
query45	186	167	172	167
query46	1160	739	702	702
query47	2080	1946	1975	1946
query48	415	311	307	307
query49	713	382	398	382
query50	834	386	381	381
query51	7316	7219	7028	7028
query52	96	86	88	86
query53	258	176	174	174
query54	504	375	402	375
query55	73	75	69	69
query56	271	218	221	218
query57	1252	1142	1137	1137
query58	214	207	203	203
query59	3097	2891	3031	2891
query60	270	238	252	238
query61	118	107	108	107
query62	770	679	656	656
query63	207	183	187	183
query64	1383	662	634	634
query65	3241	3178	3209	3178
query66	695	292	300	292
query67	16044	15554	15657	15554
query68	4240	559	550	550
query69	410	259	246	246
query70	1160	1153	1100	1100
query71	344	247	242	242
query72	6426	4098	4115	4098
query73	756	349	355	349
query74	10296	9071	9174	9071
query75	3388	2637	2661	2637
query76	1837	1162	1061	1061
query77	485	255	256	255
query78	10303	9475	9411	9411
query79	1302	590	591	590
query80	924	414	476	414
query81	523	237	234	234
query82	211	116	113	113
query83	173	139	143	139
query84	285	69	66	66
query85	984	295	296	295
query86	380	296	293	293
query87	4704	4643	4600	4600
query88	4005	2182	2174	2174
query89	402	292	280	280
query90	2139	183	182	182
query91	135	102	102	102
query92	60	49	49	49
query93	2145	541	513	513
query94	918	286	285	285
query95	342	246	255	246
query96	616	278	275	275
query97	2831	2645	2652	2645
query98	214	195	194	194
query99	1592	1335	1330	1330
Total cold run time: 317890 ms
Total hot run time: 196035 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 32.74 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 4f6c7f5c6d0be032af47d0d389c7df502eda46d9, data reload: false

query1	0.04	0.03	0.04
query2	0.06	0.03	0.03
query3	0.23	0.07	0.08
query4	1.63	0.11	0.11
query5	0.43	0.42	0.43
query6	1.19	0.66	0.64
query7	0.02	0.02	0.02
query8	0.04	0.03	0.03
query9	0.58	0.51	0.50
query10	0.56	0.55	0.57
query11	0.14	0.10	0.10
query12	0.13	0.11	0.10
query13	0.62	0.60	0.59
query14	2.81	2.74	2.83
query15	0.90	0.83	0.82
query16	0.39	0.39	0.38
query17	1.00	1.02	1.02
query18	0.22	0.21	0.21
query19	1.85	1.84	2.02
query20	0.02	0.01	0.01
query21	15.40	0.61	0.59
query22	2.73	2.55	2.14
query23	17.17	1.05	0.89
query24	2.85	1.72	0.71
query25	0.30	0.05	0.14
query26	0.48	0.14	0.14
query27	0.04	0.04	0.05
query28	10.51	1.11	1.08
query29	12.58	3.26	3.20
query30	0.26	0.07	0.06
query31	2.87	0.37	0.39
query32	3.27	0.46	0.46
query33	2.98	3.09	3.15
query34	17.33	4.42	4.54
query35	4.49	4.53	4.51
query36	0.68	0.47	0.50
query37	0.09	0.06	0.06
query38	0.05	0.04	0.04
query39	0.03	0.02	0.03
query40	0.17	0.12	0.13
query41	0.08	0.02	0.02
query42	0.03	0.02	0.02
query43	0.03	0.03	0.03
Total cold run time: 107.28 s
Total hot run time: 32.74 s

Copy link
Contributor

github-actions bot commented Dec 5, 2024

PR approved by at least one committer and no changes requested.

@github-actions github-actions bot added the approved Indicates a PR has been approved by one committer. label Dec 5, 2024
@morrySnow morrySnow merged commit 8ea48af into apache:master Dec 5, 2024
23 of 24 checks passed
github-actions bot pushed a commit that referenced this pull request Dec 5, 2024
…te (#44857)

### What problem does this PR solve?

Related PR: #33988

Problem Summary:

if mv def contain cte and is partition mv, such as following:

    CREATE MATERIALIZED VIEW mv_name
        BUILD IMMEDIATE REFRESH COMPLETE ON MANUAL
        PARTITION BY (l_shipdate)
        DISTRIBUTED BY RANDOM BUCKETS 2
        PROPERTIES ('replication_num' = '1') 
        AS
    WITH scan_data_cte as (
        select t1.l_shipdate, t1.L_LINENUMBER, orders.O_CUSTKEY, l_suppkey
        from (select * from lineitem where L_LINENUMBER > 1) t1
        left join orders on t1.L_ORDERKEY = orders.O_ORDERKEY
    )
    SELECT *  FROM scan_data_cte; 

if run refresh cmd as following, this would fail, throw exception `no
partition for this tuple`, this pr fix this

refresh materialized view mv_name partitions(p_20231210_20231211);

### Release note

Fix refresh materialized view fail when mv def contains cte
github-actions bot pushed a commit that referenced this pull request Dec 5, 2024
…te (#44857)

### What problem does this PR solve?

Related PR: #33988

Problem Summary:

if mv def contain cte and is partition mv, such as following:

    CREATE MATERIALIZED VIEW mv_name
        BUILD IMMEDIATE REFRESH COMPLETE ON MANUAL
        PARTITION BY (l_shipdate)
        DISTRIBUTED BY RANDOM BUCKETS 2
        PROPERTIES ('replication_num' = '1') 
        AS
    WITH scan_data_cte as (
        select t1.l_shipdate, t1.L_LINENUMBER, orders.O_CUSTKEY, l_suppkey
        from (select * from lineitem where L_LINENUMBER > 1) t1
        left join orders on t1.L_ORDERKEY = orders.O_ORDERKEY
    )
    SELECT *  FROM scan_data_cte; 

if run refresh cmd as following, this would fail, throw exception `no
partition for this tuple`, this pr fix this

refresh materialized view mv_name partitions(p_20231210_20231211);

### Release note

Fix refresh materialized view fail when mv def contains cte
yiguolei pushed a commit that referenced this pull request Dec 6, 2024
…te (#44857)

### What problem does this PR solve?

Related PR: #33988

Problem Summary:

if mv def contain cte and is partition mv, such as following:

    CREATE MATERIALIZED VIEW mv_name
        BUILD IMMEDIATE REFRESH COMPLETE ON MANUAL
        PARTITION BY (l_shipdate)
        DISTRIBUTED BY RANDOM BUCKETS 2
        PROPERTIES ('replication_num' = '1') 
        AS
    WITH scan_data_cte as (
        select t1.l_shipdate, t1.L_LINENUMBER, orders.O_CUSTKEY, l_suppkey
        from (select * from lineitem where L_LINENUMBER > 1) t1
        left join orders on t1.L_ORDERKEY = orders.O_ORDERKEY
    )
    SELECT *  FROM scan_data_cte; 

if run refresh cmd as following, this would fail, throw exception `no
partition for this tuple`, this pr fix this

refresh materialized view mv_name partitions(p_20231210_20231211);

### Release note

Fix refresh materialized view fail when mv def contains cte
seawinde added a commit to seawinde/doris that referenced this pull request Dec 6, 2024
…te (apache#44857)

### What problem does this PR solve?

Related PR: apache#33988

Problem Summary:

if mv def contain cte and is partition mv, such as following:

    CREATE MATERIALIZED VIEW mv_name
        BUILD IMMEDIATE REFRESH COMPLETE ON MANUAL
        PARTITION BY (l_shipdate)
        DISTRIBUTED BY RANDOM BUCKETS 2
        PROPERTIES ('replication_num' = '1') 
        AS
    WITH scan_data_cte as (
        select t1.l_shipdate, t1.L_LINENUMBER, orders.O_CUSTKEY, l_suppkey
        from (select * from lineitem where L_LINENUMBER > 1) t1
        left join orders on t1.L_ORDERKEY = orders.O_ORDERKEY
    )
    SELECT *  FROM scan_data_cte; 

if run refresh cmd as following, this would fail, throw exception `no
partition for this tuple`, this pr fix this

refresh materialized view mv_name partitions(p_20231210_20231211);

### Release note

Fix refresh materialized view fail when mv def contains cte
seawinde added a commit to seawinde/doris that referenced this pull request Dec 6, 2024
…te (apache#44857)

### What problem does this PR solve?

Related PR: apache#33988

Problem Summary:

if mv def contain cte and is partition mv, such as following:

    CREATE MATERIALIZED VIEW mv_name
        BUILD IMMEDIATE REFRESH COMPLETE ON MANUAL
        PARTITION BY (l_shipdate)
        DISTRIBUTED BY RANDOM BUCKETS 2
        PROPERTIES ('replication_num' = '1') 
        AS
    WITH scan_data_cte as (
        select t1.l_shipdate, t1.L_LINENUMBER, orders.O_CUSTKEY, l_suppkey
        from (select * from lineitem where L_LINENUMBER > 1) t1
        left join orders on t1.L_ORDERKEY = orders.O_ORDERKEY
    )
    SELECT *  FROM scan_data_cte; 

if run refresh cmd as following, this would fail, throw exception `no
partition for this tuple`, this pr fix this

refresh materialized view mv_name partitions(p_20231210_20231211);

### Release note

Fix refresh materialized view fail when mv def contains cte
morrySnow pushed a commit that referenced this pull request Dec 9, 2024
@yiguolei yiguolei mentioned this pull request Jan 19, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by one committer. dev/2.1.8-merged dev/3.0.4-merged reviewed
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants