Clean, Standardize, and Reformat Any Data Type
In the era of big data, data stewards spend a large amount of time doing “data janitor” work – the time-consuming, mundane task of collecting, preparing, and cleaning disparate data. We offer the Melissa Cleanser transform for Pentaho® and Microsoft SQL Server® Integration Services (SSIS) to help automate and prepare data for the cleansing process. This empowers users to build custom data cleansing scripts for data suffering from a wide range of errors and inconsistencies. With the component, data stewards have the ability to standardize and validate inventory lists to better prep and cleanse data prior to analysis.
- Cleanse any type of data and achieve a higher standard of data quality for integration, warehousing, and analytics
- Gain greater control of your data when optimized, and save your business time and resources
- Customize and create rules (triggering) to standardize data
How Cleanser Works
The Cleanser transform enables users to clean, standardize, and reformat any data type – from changing casing or capitalization, adding or removing punctuation, expanding or contracting abbreviations, and searching and replacing any parts of a string. The tool applies different cleansing operations to your data integration and warehousing efforts. The transform has five main cleansing operations:
View Cleansing Options+
Add or remove punctuation.
Expand or contract abbreviations, for example: CA to California
Search & Replace
Replace portions of a string
Create programmatic expressions to make sense of data values
Use regular expressions to extract, validate, etc.