WHAT DOES IT DO?
Bumblebee automates data transformation and cleansing activities while using data science to provide fast and accurate duplicate identification and record matching.
Bumblebee transforms your data in four key ways:
- Duplication ID & Matching Bumblebee's sophisticated matching technology analyzes your data to identify duplicates and return predicted matches with shared group ID, a confidence score, and a single record flagged as the "Queen Bee" or primary record saving you countless hours in the merging process. No more reading through each duplicate to see which is best.
- Standardization Where possible, your data is transformed to conform to the most commonly used formats existing in your data including, cASe, state, postal code, country and phone number.
- Data Cleansing Leading/trailing white spaces and leading punctuation are removed, leading zeroes are appended to US postal codes, and invalid data. (e.g. emails without '@', URLs having 'wwww', phones/zips without required numeric characters, etc.) are flagged.
- Email & Postal Validation Emails undergo validation testing and are flagged with a risk score. Postal addresses are passed through deliverability & change of address databases for US and Canada to identify address corrections and append mailability scores.