Our ETL program need to handle different format and encoding coming from different sources. Below listed some common technical challenges.
- Encoding: ASCII vs EBCDIC
- Character Set: UTF8, Unicode, etc.
- Format: Fixed length, Delimited, JSON, XML, Excel Spreadsheet and more
- Numeric Format: Binary, Packed-decimal, Zoned-decimal and more
Some of the above format or encoding may be challenging in hand coding ETL. However, most modern ETL engine could help to deal with the above different format and encoding.
No comments:
Post a Comment