Pentaho Data Quality
Harness Quality Big Data for Business Intelligence
Big Data can be powerful, but Big Data done wrong can end up just Big Bad Data. That’s where Data Quality Components for Pentaho® PDI become relevant. This unique set of customer data management tools leverages the integration power of Pentaho and Melissa’s suite of global data quality and enhancement solutions to empower businesses to collect data from any source, cleanse and transform it, and gain immediate insight for actionable intelligence.
- Increase productivity with drag and drop visual development for Big Data integration and accessibility to all DBAs and analysts
- Boost operational intelligence with the ability to embed enriched analytics into actionable line-of-business applications
- Accelerate Time to Value employing powerful data quality routines using minimal time and effort
- Improve Return on Investment (ROI) with quality data for better customer relationships, advanced analytics, and effective marketing segmentation.
Melissa Data Quality and Enhancement Transforms
Melissa leverages the integration power of SQL Server to provide a full spectrum of data quality with the following transforms:
The Profiler transform provides intelligent data profiling to identify weak points in your data collection process, enforce business rules on incoming records and monitor improvements over time.
The Contact Verify transform verifies, corrects, and standardizes U.S and Canada postal addresses, email addresses, and phone numbers.
The Global Verify transform verifies, corrects, and standardizes postal addresses, email addresses, and phone numbers for 240+ countries.
The Personator transform verifies identity, validates postal address, email address, and phone number, plus updates the current address for a contact and enriches records with missing email, phone, address data, and demographic info.
The Personator World transform verifies an individual’s national ID, provides age verification, and provides watchlist/sanctions screening for fraud prevention and KYC/AML compliance.
The MatchUp transform provides advanced deduping/matching capabilities and survivorship rules to eliminate duplicates and create a single customer view.
The Fuzzy Matching component provides sophisticated fuzzy matching capabilities utilizing more than 12 fuzzy matching and proprietary algorithms to identify hard-to-spot similar records.
The IP Locator transform geolocates a given IP address and returns latitude/longitude, city, state, postal code, county, and ISP information.
The Business Coder transform provides detailed firmographic data on over 25 million U.S. businesses – ideal for standardizing business entities, consolidating records, and lead scoring.
The Property transform enriches records with over 400 fields of property and mortgage data on over 140 million U.S. properties.
The SmartMover transform matches your address records against the USPS NCOALink database or Canada Post NCOA database & returns the current address of customers that have moved.
The Cleanser transform utilizes a variety of programmatic and regular expressions to correct data inconsistencies fast.