Labeler and Standardizer Transformation

Both are Used for Data Cleansing and Standardization for Addresses, Company Names(like LTD for limited, TAS  Trading As,).

Labeler

Characteristics

  • Labels Different Incoming values
  • e.g. #$^& as Symbols or S
  • 017242 as 99999
Properties/Strategies

  • Label using Reference Table
  • Label Using Character Set.


Standardizer

Characteristics

  • Used for Standardizing the Data
  • e.g. AVE or AVE or AVNUE to AVENUE

Properties/Strategies

  • Offers Removal of Spaces.
  • Label Using Character Set.
  • Remove Reference Table Match
  • Remove Custome String
  • Replace Reference Table Matches with Valid Values
  • Replace Reference Table Matches with Custome Values
  • Replace Custome Strings.

Comments

Popular posts from this blog

IDQ - Data Profiling