Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feature](cast) Support cast complex type to json #36548

Merged
merged 6 commits into from
Jun 24, 2024

Conversation

amorynan
Copy link
Contributor

Proposed changes

in this pr we do

  1. support cast complex type to json which we can use array_agg/map_agg result into json_object
    Issue Number: close #xxx

@doris-robot
Copy link

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR

Since 2024-03-18, the Document has been moved to doris-website.
See Doris Document.

@morrySnow morrySnow changed the title [Feature](function) Support complex type to json [feature](cast) Support cast complex type to json Jun 19, 2024
@amorynan
Copy link
Contributor Author

run buildall

@@ -834,6 +834,10 @@ public static boolean isImplicitlyCastable(Type t1, Type t2, boolean strict, boo
}

public static boolean canCastTo(Type sourceType, Type targetType) {
// In BE Code , we make any data type can cast to jsonb
if (targetType.isJsonbType()) {
return true;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

agg_state, variant could cast to json? what we will get if cast ipv4 /ipv6 to json.

@@ -834,6 +834,10 @@ public static boolean isImplicitlyCastable(Type t1, Type t2, boolean strict, boo
}

public static boolean canCastTo(Type sourceType, Type targetType) {
// In BE Code , we make any data type can cast to jsonb
if (targetType.isJsonbType()) {
return true;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if support all type, u should add all type cast to json test case

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

add limits

Comment on lines 123 to 131
if (complexType.isMapType()) {
return ((MapType) complexType).getKeyType().isStringType();
} else if (complexType.isArrayType()) {
return checkMapKeyIsStringLikeForJson(((ArrayType) complexType).getItemType());
} else if (complexType.isStructType()) {
for (StructField f : ((StructType) complexType).getFields()) {
return checkMapKeyIsStringLikeForJson(f.getType());
}
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should check in nereids, not here

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done!

@doris-robot
Copy link

TPC-H: Total hot run time: 39887 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 5e4b75d8fc60f19b9cabffa2794dc88f4078510c, data reload: false

------ Round 1 ----------------------------------
q1	13691	4448	4339	4339
q2	1693	195	196	195
q3	8162	1112	1119	1112
q4	7834	830	760	760
q5	5746	2671	2625	2625
q6	218	143	137	137
q7	964	620	632	620
q8	7083	2078	2070	2070
q9	7073	6470	6465	6465
q10	9028	3732	3755	3732
q11	428	245	247	245
q12	405	235	231	231
q13	14209	3015	2992	2992
q14	272	224	220	220
q15	518	475	470	470
q16	483	396	374	374
q17	957	707	669	669
q18	8038	7491	7372	7372
q19	2056	1536	1463	1463
q20	657	315	318	315
q21	5068	3147	3958	3147
q22	395	334	336	334
Total cold run time: 94978 ms
Total hot run time: 39887 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4356	4255	4287	4255
q2	377	265	258	258
q3	2980	2753	2740	2740
q4	1913	1621	1651	1621
q5	5288	5289	5330	5289
q6	222	129	132	129
q7	2207	1806	1783	1783
q8	3272	3399	3368	3368
q9	8711	8731	8729	8729
q10	4080	3862	3974	3862
q11	595	510	526	510
q12	814	649	623	623
q13	14470	3144	3072	3072
q14	308	268	261	261
q15	517	469	485	469
q16	504	447	450	447
q17	1823	1505	1481	1481
q18	8191	7877	7438	7438
q19	1745	1520	1479	1479
q20	2966	1812	1785	1785
q21	4856	4710	4703	4703
q22	648	535	564	535
Total cold run time: 70843 ms
Total hot run time: 54837 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 172075 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 5e4b75d8fc60f19b9cabffa2794dc88f4078510c, data reload: false

query1	893	381	378	378
query2	6484	2556	2378	2378
query3	6413	204	215	204
query4	18870	17509	17157	17157
query5	4138	489	452	452
query6	243	154	166	154
query7	4539	295	293	293
query8	332	294	295	294
query9	8605	2399	2362	2362
query10	558	295	274	274
query11	10576	10143	10171	10143
query12	146	86	81	81
query13	1604	356	358	356
query14	9301	7548	7086	7086
query15	236	193	197	193
query16	7145	261	259	259
query17	1049	528	567	528
query18	1910	270	270	270
query19	197	151	159	151
query20	94	82	83	82
query21	203	128	125	125
query22	4294	4116	3955	3955
query23	33766	32856	32997	32856
query24	10801	2835	2808	2808
query25	588	350	355	350
query26	705	152	153	152
query27	2290	317	310	310
query28	5794	2047	2037	2037
query29	871	628	604	604
query30	269	153	152	152
query31	973	722	747	722
query32	95	52	55	52
query33	697	280	269	269
query34	868	454	454	454
query35	712	613	617	613
query36	1117	888	933	888
query37	141	72	72	72
query38	2871	2751	2737	2737
query39	857	789	778	778
query40	198	130	123	123
query41	56	51	50	50
query42	119	94	103	94
query43	558	568	557	557
query44	1063	714	719	714
query45	197	168	164	164
query46	1069	733	690	690
query47	1872	1798	1806	1798
query48	354	296	289	289
query49	979	400	425	400
query50	753	405	374	374
query51	6800	6707	6657	6657
query52	97	96	92	92
query53	362	291	282	282
query54	860	434	425	425
query55	74	74	74	74
query56	275	256	257	256
query57	1115	1048	1051	1048
query58	250	233	248	233
query59	3278	3230	3061	3061
query60	290	277	268	268
query61	91	89	121	89
query62	622	453	430	430
query63	320	286	286	286
query64	8645	2240	1707	1707
query65	3214	3283	3115	3115
query66	1050	377	334	334
query67	15328	14845	14874	14845
query68	4559	532	538	532
query69	457	299	310	299
query70	1157	1027	1082	1027
query71	403	265	266	265
query72	7234	5661	5896	5661
query73	754	322	322	322
query74	5988	5525	5451	5451
query75	3386	2639	2674	2639
query76	2261	962	858	858
query77	476	298	288	288
query78	10281	9931	9736	9736
query79	2232	508	504	504
query80	1003	453	462	453
query81	588	221	214	214
query82	710	104	100	100
query83	245	165	171	165
query84	276	88	84	84
query85	1772	300	299	299
query86	491	382	313	313
query87	3250	3112	3051	3051
query88	4217	2320	2313	2313
query89	467	379	373	373
query90	1830	188	186	186
query91	128	103	95	95
query92	57	47	49	47
query93	2458	514	502	502
query94	1237	183	182	182
query95	413	309	312	309
query96	602	265	259	259
query97	3211	3035	3026	3026
query98	220	210	205	205
query99	1227	861	833	833
Total cold run time: 264724 ms
Total hot run time: 172075 ms

@amorynan
Copy link
Contributor Author

run buildall

@amorynan
Copy link
Contributor Author

run buildall

Copy link
Contributor

PR approved by at least one committer and no changes requested.

@github-actions github-actions bot added approved Indicates a PR has been approved by one committer. reviewed labels Jun 20, 2024
Copy link
Contributor

PR approved by anyone and no changes requested.

Copy link
Member

@eldenmoon eldenmoon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@doris-robot
Copy link

TPC-H: Total hot run time: 39579 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 65d09a3e2ab7b6e2a92a5fd51e27140570e9fc4f, data reload: false

------ Round 1 ----------------------------------
q1	17874	4512	4376	4376
q2	2018	211	194	194
q3	10514	1073	1168	1073
q4	10253	851	697	697
q5	7466	2645	2593	2593
q6	217	130	132	130
q7	931	593	600	593
q8	9224	2065	2053	2053
q9	9116	6519	6531	6519
q10	8969	3715	3730	3715
q11	457	244	235	235
q12	460	221	223	221
q13	17802	3001	2977	2977
q14	261	216	213	213
q15	544	479	468	468
q16	529	371	370	370
q17	958	692	701	692
q18	7982	7385	7305	7305
q19	5738	1428	1380	1380
q20	667	317	318	317
q21	4968	3125	3918	3125
q22	384	334	333	333
Total cold run time: 117332 ms
Total hot run time: 39579 ms

----- Round 2, with runtime_filter_mode=off -----
q1	4462	4314	4230	4230
q2	379	269	271	269
q3	2953	2876	2878	2876
q4	2004	1699	1692	1692
q5	5633	5515	5488	5488
q6	220	130	136	130
q7	2213	1827	1802	1802
q8	3273	3427	3407	3407
q9	8710	8730	8861	8730
q10	4110	3926	3772	3772
q11	582	521	500	500
q12	824	656	663	656
q13	16259	3189	3145	3145
q14	311	278	270	270
q15	553	479	482	479
q16	494	435	440	435
q17	1827	1528	1504	1504
q18	8124	7720	7785	7720
q19	1840	1552	1484	1484
q20	3032	1884	1879	1879
q21	5253	5069	4994	4994
q22	634	561	590	561
Total cold run time: 73690 ms
Total hot run time: 56023 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 173087 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 65d09a3e2ab7b6e2a92a5fd51e27140570e9fc4f, data reload: false

query1	939	399	363	363
query2	6437	2585	2237	2237
query3	6622	207	210	207
query4	19427	17281	17413	17281
query5	3622	489	472	472
query6	231	161	157	157
query7	4586	293	289	289
query8	331	320	278	278
query9	8553	2387	2374	2374
query10	574	290	276	276
query11	10633	10049	10075	10049
query12	111	91	81	81
query13	1629	378	372	372
query14	10196	6768	7045	6768
query15	241	190	188	188
query16	7489	264	267	264
query17	1467	570	539	539
query18	1947	287	278	278
query19	198	157	161	157
query20	90	85	86	85
query21	211	124	129	124
query22	4329	4150	4037	4037
query23	33894	33750	33564	33564
query24	11173	2895	2934	2895
query25	623	380	376	376
query26	1187	158	162	158
query27	2254	314	329	314
query28	6900	2113	2107	2107
query29	911	631	627	627
query30	253	159	155	155
query31	970	743	764	743
query32	91	50	54	50
query33	762	280	310	280
query34	1003	501	470	470
query35	799	637	638	637
query36	1102	988	997	988
query37	153	77	75	75
query38	2972	2831	2883	2831
query39	906	812	850	812
query40	222	128	130	128
query41	62	50	51	50
query42	106	101	106	101
query43	603	567	539	539
query44	1213	735	716	716
query45	188	167	155	155
query46	1080	706	717	706
query47	1852	1761	1769	1761
query48	372	328	290	290
query49	848	414	415	414
query50	775	380	379	379
query51	6869	6866	6736	6736
query52	117	91	96	91
query53	368	292	281	281
query54	861	431	435	431
query55	74	72	73	72
query56	279	269	264	264
query57	1170	1028	1049	1028
query58	248	238	276	238
query59	3259	3237	3164	3164
query60	291	269	275	269
query61	97	96	93	93
query62	605	440	438	438
query63	316	300	304	300
query64	9007	2308	1753	1753
query65	3158	3066	3103	3066
query66	762	324	329	324
query67	15492	15101	14894	14894
query68	4479	520	511	511
query69	525	439	360	360
query70	1175	1148	1144	1144
query71	392	292	277	277
query72	7210	5481	5200	5200
query73	752	324	323	323
query74	5931	5601	5494	5494
query75	3353	2659	2658	2658
query76	2293	953	870	870
query77	434	300	303	300
query78	10568	9788	9750	9750
query79	2512	524	501	501
query80	2315	468	472	468
query81	573	229	232	229
query82	829	114	105	105
query83	313	177	172	172
query84	264	93	84	84
query85	1957	299	274	274
query86	481	342	282	282
query87	3293	3095	3070	3070
query88	4253	2339	2349	2339
query89	484	382	382	382
query90	1718	195	190	190
query91	131	101	102	101
query92	63	52	52	52
query93	2354	491	496	491
query94	1073	189	188	188
query95	409	311	311	311
query96	596	268	269	268
query97	3242	3097	3027	3027
query98	214	205	194	194
query99	1128	833	849	833
Total cold run time: 271424 ms
Total hot run time: 173087 ms

@doris-robot
Copy link

ClickBench: Total hot run time: 30.49 s
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/clickbench-tools
ClickBench test result on commit 65d09a3e2ab7b6e2a92a5fd51e27140570e9fc4f, data reload: false

query1	0.04	0.03	0.03
query2	0.08	0.04	0.04
query3	0.22	0.05	0.04
query4	1.69	0.06	0.07
query5	0.52	0.48	0.50
query6	1.13	0.72	0.72
query7	0.02	0.02	0.01
query8	0.04	0.04	0.04
query9	0.55	0.50	0.48
query10	0.54	0.53	0.55
query11	0.15	0.11	0.11
query12	0.15	0.12	0.12
query13	0.59	0.58	0.59
query14	0.77	0.79	0.77
query15	0.84	0.82	0.80
query16	0.37	0.38	0.35
query17	1.04	0.98	0.96
query18	0.24	0.22	0.25
query19	1.82	1.72	1.73
query20	0.01	0.01	0.01
query21	15.41	0.67	0.67
query22	4.03	7.49	1.78
query23	18.27	1.43	1.28
query24	2.13	0.22	0.23
query25	0.16	0.08	0.08
query26	0.27	0.17	0.18
query27	0.08	0.09	0.09
query28	13.24	1.01	1.01
query29	12.60	3.30	3.31
query30	0.25	0.06	0.06
query31	2.88	0.39	0.38
query32	3.27	0.47	0.48
query33	2.85	2.95	2.94
query34	17.15	4.50	4.40
query35	4.52	4.52	4.50
query36	0.65	0.48	0.46
query37	0.17	0.15	0.16
query38	0.16	0.14	0.14
query39	0.04	0.04	0.04
query40	0.16	0.14	0.17
query41	0.09	0.05	0.05
query42	0.06	0.05	0.05
query43	0.05	0.05	0.04
Total cold run time: 109.3 s
Total hot run time: 30.49 s

@morrySnow morrySnow merged commit d2b0cd5 into apache:master Jun 24, 2024
28 of 31 checks passed
dataroaring pushed a commit that referenced this pull request Jun 26, 2024
support cast complex type to json which we can use array_agg/map_agg
result into json_object
@xiaokang xiaokang added usercase Important user case type label dev/2.1.x labels Jul 31, 2024
yiguolei pushed a commit that referenced this pull request Aug 1, 2024
## Proposed changes
backport: #36548
Issue Number: close #xxx

<!--Describe your changes.-->
@yiguolei yiguolei removed the dev/2.1.x label Aug 1, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by one committer. dev/2.1.6-merged dev/3.0.0-merged reviewed usercase Important user case type label
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants