Hi,
I seem to have a lot of duplicates (based on the built in duplicate check, i.e. the Email Address field) in my DB, where one of each "pair" is a lead or a contact, and the other one is from a list upload (not flagged as neither a lead nor a contact and not synced to SFDC). Since the 'Email Address' field is the duplicate matching criteria, the duplicates cannot have been created from uploading the list of email addresses to the wrong email field (we have a few), right?
I'm thinking that either
- the lead/contact was created first, and then the list upload person was created as a duplicate
- or of course the other way around
Either way, I would like to understand under what circumstances this can happen (and try to stop this in the future).
Any insights would help!
Solved! Go to Solution.
Hi Ludvig,
As Josh mentioned, dedupe occurs for list import, for your case you can check the following to find the root cause of the issue:
- Check if the file is properly formatted - there should not be any space before or after the email address
- Search a lead/contact for which you know the duplicate exists - then look at the activity log for both the records, you will find how they got created
- Another reason could be the data in SFDC, since SFDC have different definition for Lead and Contact but Marketo does not have that difference and treat everything as lead, so if any person exists in SFDC as contact and you upload the same in Marketo then once the sync between marketo and sfdc runs, there is a possibility of duplication.
If you read the docs:
Thanks Josh,
I have done quite extensive readings and in this case I am talking about list imports (called it uploads in OP) which as you say should not create dupes, hence my question.
Hi Ludvig,
As Josh mentioned, dedupe occurs for list import, for your case you can check the following to find the root cause of the issue:
- Check if the file is properly formatted - there should not be any space before or after the email address
- Search a lead/contact for which you know the duplicate exists - then look at the activity log for both the records, you will find how they got created
- Another reason could be the data in SFDC, since SFDC have different definition for Lead and Contact but Marketo does not have that difference and treat everything as lead, so if any person exists in SFDC as contact and you upload the same in Marketo then once the sync between marketo and sfdc runs, there is a possibility of duplication.
When you say "You can manage this by analyzing your entry sources (sales uploads, etc) and cut off people from being able to add records. You can add processes to block or force deduping."
Can you share some examples of how we can force dedupe a contact upon re-entry?
We have these issues.
You have to find out/tell us exactly what's causing multiple leads with the same address in the first place.
Tactics for dealing with dupes change depending on the cause(s).
My guess is that they were deleted at some point and then the lead re-entered the instance via list import or API (two most common cases). This also happens sometimes when there's clean up efforts in Salesforce. The lead/contact then gets merged but when this happens sometimes the older data will get prioritized before the latest data is populated. This then causes issues in triggering the lead to get qualified and routed.
Marketo has the duplicate matching criteria on the Standard "Email Address" field only. While uploading the list if you are mapping any other email address field except the standard one then Marketo treat that records as the records with email and create all the records with data as it is in the csv file.
If you make sure to map the correct or standard email address field, then it will not create any duplicates.
Thanks. I am looking at some examples and they have indeed "email address" field populated on both dupes most of the time. One of each pair typically has "person created" as the first activity history. I suspect that in historic list imports, someone might not have had the "de-dupe" checkbox clicked when doing the list import. Alternatively, it could be that the sync between SFDC and MKTO has been activated/performed AFTER the list import was made. Typical work flow for us is to set the sync field to true in SFDC and then make a list import of the target group, so if the sync has not yet finished (sometimes it takes 20 minutes) it could create a new person and then a duplicate when the sync is done like you flag Abhishek Chandra.
Boiled it down to 3 likely reasons:
- Wrong email field
- Sync made after list import
- De-dupe option not clicked
Related - Is there a way to choose which of the dupes that are added to the program, if the program members come from a list import? Right now it seems to be completely random. So person A and B have the same email, but maybe only B is synced to SFDC, and if B is added to the program, that activity will not be synced to the CRM.
Hi - I've run into the is same issue.
I created lists of leads to delete/reengage/mark safe based on their activity that comes through a separate source, our Drupal website's lead gen forms which pipes leads into Marketo, and uploaded them using only the "email address" column. It was a CSV, and the files had no additional spaces or any other characters before or after the email address. Virtually if not all of these names should still exist in our database, which we clean regularly.
By looking at the upload lists and clicking checking lead info, none of them have anything in their activity log. Marketo sees them as new. If I copy and paste any of them into the database search, I see two records, the old and the new, with the same exact email address and two different created dates/activity logs.
We've uploaded lists in the past like this and not seen duplicates created.
What's going on? How does this happen?