Home > DemandTools for AppExchange Modules > DemandTools Cleansing Modules > Single Table Dedupe Wizard

Single Table Dedupe Wizard

 Capabilities & Features:


 
DeDuplication Best Practices & Guidelines:

Please test your deduplication scenarios thoroughly before deduping a large number of records, as there is no automatic "undo"/"rollback" option.  If you have a Salesforce sandbox, we recommended that you test in this environment first before deduping your production data.

More information on pointing DemandTools to the Salesforce Sandbox can be found
here.

1. Dedupe in the following order: 

2. Use at least a 3 pass strategy for identifying duplicates

3.  It is recommended that name, address, phone fields are standardized prior to searching for duplicates.

Screen One:

Step 1. Select the Object to De-Duplicate

The object for de-duplication can be selected by using one of the pre-built scenarios or by manual selection.

Pre-Built Scenario:


 

Manual  Object Selection:


Step 2: Select the fields to view on the dupe grid


Step 3: Select the records to search

Conditions are used when subsetting which objects will be updated, exported, and/or searched, depending on the module.  In most cases the option to use all objects or specify conditions on which object(s) to use will be available.

 

When "Use Conditions" is selected the conditions area is no longer greyed out and open for criteria specification.



Step 1.  Select the Field to Base a Condition On

Step 2.  Select the Operator

  • Available operators will change based on the type of field selected.
  • "!=" represents "not equal to".
  • When using dates, operators will include "Today", "Yesterday", etc.".

Step 3.  Specify a Value(s)

  • Text Fields:  Type in the value.  Multiple values for the same field are to be separated by commas.  Commas are translated as "or".  For example, "ma,nh,me" will be translated to "ma or nh or me".  Note:  values entered are not case sensitive.
  • Picklists:  Select the value from the specified list.  Multiple values can be selected and will be translated as "or".
    • Valid picklist values are shown (as defined in Salesforce.com).  Right click on a list of picklist values to "Add a Value".  This is used to add a value not currently in the picklist and make it available for use in the criteria. The value is added for the purpose of selection and will not be added to the actual picklist in Salesforce.
    • Right click on a list of picklist values to "Filter List" to subset a long picklist based on a search string to quickly find a value(s).
  • Dates:  The current date will default in the value window. Click the arrow on the right to access a calendar and select a specific date.


    • To isolate a specific day regardless of time when dealing with a Date/Time field (i.e. created date), specify 2 conditions, greater than the day before and less than the day after.  This is not necessary when using an operator such as "Today" or "Yesterday".

Step 4.  Add Condition(s)

  • Select "Add Condition" to add the condition to the list of current conditions.
  • By default all conditions are and'd together.  To create "or" groups see Step 5.
  • To delete one condition from the list, click the box to the left to highlight the condition and press the delete key on the keyboard.

Step 5.  Creating "OR Groups" (this option is not applicable to all modules)

  • To create "or" groups, check the "Expert Mode" checkbox.
  • Change the "OR Group" number for specific conditions (click on the number on the right and adjust up or down).  All conditions WITHIN a group are and'd and BETWEEN groups are or'd.  For example:


    Reads as:

    (MailingState=ma AND LeadSource=External Referral) OR (MailingState=ny AND LeadSource= Trade Show)
  • Unchecking the "Expert Mode" checkbox will cause all conditions to be and'd once again.



 
Screen Two:

Step 1: Editing the Match Criteria



 

 

Step 2: Understanding Mapping Types:

Match Categories

Mapping Types

Type Description:
Found in Category:
 

The Cleaned Account Name mapping type uses the built in Account Name Cleaning List.  To see the Cleaning list tool select the "Edit Account Cleaning List" button.

The cleaning list standardizes punctuation, spaces and removes common business prefixes and suffixes.  These lists are customizable for language(s) and/or line of business.

All, CRM

The Country Match mapping type is used to standardize field values for the recognized countries of the world.  It will recognize the long name of a country, the 2 digit ISO short form, the 3 digit ISO short form and the numeric ISO country value as possible matches of each other.

All, GEO

The Domain mapping type is used when mapping web pages and/or email addresses. It allows for the independent analysis of the domain information contained within the URL or the email address. For email addresses it uses any information to the right of the @ sign.  For web pages it parses the XXXXX.com portion.  This mapping types allows for easy comparison of web page field vs. web page field or email field vs. email field. It also allows for the comparison of email addresses to web pages and vice versa.

All, Internet
The Exact mapping type is a 100% match of every character (assuming no options apply (see Fuzzy, Transpose & Alpha Clean))

All, Alpha, Numeric

The First Name mapping type uses the built in Nickname List.  To see the Nickname list select the "Edit Nickname List" button at the top of the interface.

The Nickname list will see Bill, William and Billy as potential duplicates/matches of each other.  This list is also customizable by the end user for localization or for non contact substitution on any field by replacing the nickname list with synonyms.

All, Contact
The First XX Letters mapping technique is used to compare the first XX letters in a field.  When selected a pop up box will appear in order to specify the number of letters on which to match. Applicable to text fields only.

All, Alpha
The Numeric mapping type is used to compare only the numeric values in a field. All other characters, such as spaces or punctuation, will be ignored by the deduper. A field with a value of "Apt # 31" is seen by the deduper as the numeric characters "31", ignoring "Apt #".

All, Numeric

The Relaxed Address Match mapping type parses the street address to the lowest common denominator.  Based on North American standards it has also proved effective with most country address formats.

Example:

Apt#4, 123 Pavillion Street
123 Pavillion, Apt 4
4-123 Pavillion Ave NW

Are all seen as "123 Pavillion"

All, CRM
The Relaxed NA Phone Match mapping type removes all non-numeric characters and spaces, leading 0's and 1's, area codes and extensions on the back end, returning the 7 primary digits of the phone number.  If just 7 digits are left use those 7 digits, else just return digits 4 - 10.  It will not match the "Phone-word" values and will trim off the "SPOT" in the phone number and only look at the numeric portion.

Example:

+1 879 555 1212 ext 500
(879) 555 1212
1.879.555.1212 ext 408

Are all seen as "5551212"

Note: This is based on a 10-digit North American phone number.  Although it can be used with international phone numbers (will just return 7 digits in the middle), the Numeric mapping type is the recommeded mapping type to be used with international phone numbers.

All, CRM

The Salesforce.com ID Match  mapping type is used to match any Salesforce.com objects 15 digit ID to its 18 digit equivalent ID and vice versa.

All, CRM
The Street Address Match mapping type is a slightly more rigid criteria than the relaxed address match mapping type. The deduper will ignore the differences in street type short forms such as crescent - cres, road - rd, street - st, etc.

All, CRM
The Zip 5 and 9 mapping type will automatically match USPS 5 and 9 digit zip codes without the need to standardize them to a common number of digits.
All, CRM

Additional Mapping Options


Option: Description: Applicable Mapping Types:
Fuzzy  Phonetics engine capable of analyzing words for how they sound when pronounced.  Through a technique of removing vowels and analyzing the remaining consonants the fuzzy engine works well for matching fields with spelling mistakes. Cleaned Account Name
Exact
FirstName

Transpose

 The transpositional engine allows for fields to appear as duplicates even when there are differences in their word order.  For example,  "Jones, Smith and Jackson" will appear to be a duplicate of Jackson, Smith and Jones.

Cleaned Account Name
Exact
FirstName
Street

Alpha-Clean

 The alpha cleaner extends some of the capabilities of the account name cleaner to other fields for easier matching.  The alpha cleaner is used for ascii (north american) data to ensure that the only characters that are analyzed are the 26 characters of the english alphabet and the numbers 0-9.  Any other character that the field may contain will be ignored by the deduplication matching algorithms.

Cleaned Account Name
Exact
FirstName
Numeric
Street
Zip 5 and 9


Select the "Screen 3 - Merge Control" Button to search for potential duplicates and move to screen 3 to analyze and eventually process.



Screen Three:
 
Step 1: Grid Options

 

 

 

 



 

                                                                                    

Step 2: Selecting a Master Record

  

NOTE:   All merges, whether manual or automatic, will be processed based upon the defined Merge Options listed along the right hand side!! (See Step 4)

Step 3: Choose or Design a Master Rule


Currently selected master rule

Back to top

Sample constraints on different field types
Rule Dialog Box


A complex rule with multiple expressions and a score delta set

If the score delta threshold is set and not met: (records tie and have the same point value)


 

Step 4: -Select the merge control options

 

  • Reassign to Master Owner





    • Allows for sub-objects to be reassigned to the ownership of the master record owner during the merger (for both subordinate record sub objects and master record sub objects)
  • Additional log fields

    • Logging of additional fields of information in the DemandTools log file

Step 4: Processing records in batch mode

Once the records to merge are checked, and the master records determined (either by master rule or manual selection) select "Merge Checked Objects" to begin the merge process.




Step 5: Saving A Custom Scenario:

  • Even if a user begins the de-dupe process using a pre-built scenario it is very likely that some changes/modifications will have been made to fit the users data needs.
  • Modified pre-built scenarios can be saved for future use with the changes made by the user
  • On screen 3, once all settings have been selected, click the "Save Scenario" button on the lower left hand side
    • The pop up screen will still show the pre-built settings
  • To save the modifications the user must click the "Re-Build Description" button
    • This forces the pop up to refresh and reflect the modifications
  • The user can rename the scenario (this is reccomended so there are not two scenarios with the same name; an effective naming convention is to use the company name or abbreviation as a prefix to the scenario name)



See also