copybook meta data for RDBMS #634

sree018 · 2023-07-27T12:13:58Z

Background

Currently, copybook metadata comes as spark schema, we need schema as rdbms level

Example [Optional]

'''
01 MASTER-RECORD.
02 RDT-TLF-MTHD-NM PIC X(08).
02 RDT-ADJ-ORGN-TRAN-DT PIC 9(06).
02 FILLER PIC X(03).
02 RDT-ADDL-DATA-GROUP.
05 RDT-ADDL-DATA OCCURS 0 TO 2 TIMES
DEPENDING ON RDT-ADDL-SEGS-NO.
10 RDT-ADDL-SEG-KEY.
15 RDT-ADDL-SEG-KEY-PROD PIC X(02).
15 RDT-ADDL-SEG-KEY-TYPE PIC S9(15)V99 COMP-3.
'''
Current Schema:
root
|-- RDT-TLF-MTHD-NM String
|-- RDT-ADJ-ORGN-TRAN-DT integer
|-- RDT-ADDL-DATA-GROUP
|-- RDT-ADDL-SEG-KEY
|-- RDT-ADDL-SEG-KEY-PROD String
|-- RDT-ADDL-SEG-KEY-TYPE DECIMAL (15,2)

we are able get parent-level element lengths only before flattening

df.schema.fields(0).metadata.getLong("maxLength")

is there any option to get the expected schema?

yruslan · 2023-08-01T06:57:02Z

Spark does not have varchar() type, nor integer(6) data types, only string and integer, so the expected output you specified is not possible.

However, it could be possible to retain metadata after schema flattening. How do you flat the schema?

sree018 · 2023-08-01T11:39:55Z

SparkUtils.flattenSchema(df,useShortFieldManes=false)

yruslan · 2023-08-03T13:36:18Z

I've tested if retaining the metadata is possible, and it is.

This PR makes SparkUtils.flattenSchema() retain metadata: #635

It is already merged into master. Please, test if you can and let me know if it works for you.

sree018 · 2023-08-07T10:47:06Z

@yruslan

New feature working.

thanks for feature

yruslan · 2023-08-08T08:43:51Z

Awesome! Thanks for letting me know

sree018 added the enhancement New feature or request label Jul 27, 2023

yruslan added a commit that referenced this issue Aug 3, 2023

#634 Retain metadata on schema flattening.

ac1e7f3

yruslan added a commit that referenced this issue Aug 3, 2023

#634 Retain metadata on schema flattening.

7462ecf

yruslan closed this as completed Aug 8, 2023

yruslan mentioned this issue Oct 16, 2023

Release Cobrix v.2.6.9 #646

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

copybook meta data for RDBMS #634

copybook meta data for RDBMS #634

sree018 commented Jul 27, 2023

yruslan commented Aug 1, 2023

sree018 commented Aug 1, 2023

yruslan commented Aug 3, 2023 •

edited

Loading

sree018 commented Aug 7, 2023

yruslan commented Aug 8, 2023

copybook meta data for RDBMS #634

copybook meta data for RDBMS #634

Comments

sree018 commented Jul 27, 2023

Background

Example [Optional]

yruslan commented Aug 1, 2023

sree018 commented Aug 1, 2023

yruslan commented Aug 3, 2023 • edited Loading

sree018 commented Aug 7, 2023

yruslan commented Aug 8, 2023

yruslan commented Aug 3, 2023 •

edited

Loading