You’ve been in business for many years. You may have done it all on your own, or you may have enlisted the aid of unscrupulous SEO “services” to built links for you. Either way, you may be wondering whether you need to conduct a link clean up. This column contains a five-step process outlining how to determine if you have a problem, how to analyze it, and then how to clean it up.
If you get this message, you have a problem. Skip to step two.
Obviously, you can’t email Google and ask them if you have some lousy links. But you can get a clue from the Google Webmaster Tools link list. To access it in the new navigation, go here:
Before you download anything, look at the numbers:
Compared to the total of 199k links, 76k is a lot of the share. If you have a distribution like this, you may have some cleanup to do.
Also on this screen, look over the list of “Your most linked content.” If a large proportion of the links are pointing directly to your homepage, you may have a problem.
Finally, click on the More >> button below “Your most linked content.” If your source domains seem low for the number of links to a particular content, you probably have what are referred to as “run of site” links. Those are often the source of penalties because they don’t really add any value, and are probably in either an ad, a blogroll, a list of links, or the footer.
Notice that I use amusing words like “may” and “might” here. That’s because there are no hard and fast rules dictating which links are “bad” and which aren’t; most of it is a matter of judgment. While the one circled above does look suspicious, what you’d eventually discover (later in the process) is that it is an affiliate link formatted as a 302 redirect — therefore, it does not pass PageRank.
Okay, let’s assume you’ve determined that you most likely have a problem, or you’ve received a message that says you do. Where do you begin? The answer, from Google’s own blog, is to start with the link lists in Google Webmaster Tools. Begin by heading to your list of “Who links the most” in Webmaster Tools and downloading the following two reports:
Combine them in Excel, sort by Col A ascending and de-duplicate them (note that you’ll have to uncheck “First discovered” since that is only available on one of the reports):
Now that you have a deduplicated list, there are two things you should do:
Now, you should be left with a smaller list that includes followed links with 200 and 301 status codes. These you will have to check manually.
Now for the last step, and this one gets confusing. You’ve collected and checked a list of all the links that Google reported in those downloads, but you’re not done. You still have two things to do:
When you download all the domains, you’ll notice that you only get a list of base URLs, like website.com. To check these against the list you’ve already made, add a new column in your spreadsheet labeled “Base Domain.”
Open up a new worksheet (this is important) and copy the list of links into it. Select [Data], [Text to Columns], [Delimited] and then make the delimiter a [/]. This will leave you with a list of base domains that you just need to clean up, possibly find and replace www. and then paste back into the “base domain” column. As long as you don’t sort or delete anything, the list will match up exactly with your list of links.
Now, go to the very last record in your list of links. Underneath it, paste the list of domains from Google.
Sort by Col A descending, remove duplicates, and you’ll be left with a list of domains that aren’t already represented in your list of links. Check these the same way you did the links in Steps Two and Three.
When you finish this process, you should have four main lists:
There are many services out there that will help you get links removed, which is what Google says they want you to do. But in most cases, reaching out to webmasters and asking them to remove the link is a fool’s errand. It’s extremely time-consuming and often unsuccessful — most sites where a webmaster would actually respond to you are ones that you want to keep your link on, maybe just have them add a nofollow.
If you know of directory submissions you can remove or paid links you can stop paying for or add nofollows to, you should absolutely do that. But, most webmasters don’t have that option. In addition, most of those services make you pay by the link, so going through this effort first will save you money based on the number of links that need to be checked.
Next time, I’ll show you how to properly format a reconsideration request and a disavow report. There’s bound to be more qualified link experts who take a different approach, but this is how a self-proclaimed techie approaches the problem. Best of luck, and leave your ideas and feedback in the comments!