Problem Links in Google

Associate
Joined
31 Jan 2010
Posts
283
Location
UK
I am having really strange links appearing in Google but I am not sure where they are coming from.

If you have a look here:

Google

You will see links like this:
Code:
http://www.chrislowthian.co.uk/tag/adobe-device-central-cs5/%3Cli%20class=/

Notice the %3Cli%20class=/ at the end.

I don't know where the links are coming from. I have checked the xml sitemap and there is nothing wrong there.

I have also looked through the template with no luck.

Does anyone have any ideas? I don't want it to start effecting SEO as it is bringing up 404's.
 
it looks like it's a malformed link that's pointing to your site (or even inside your site) somewhere.

If you un-encode the last part of the url it reads
Code:
http://www.chrislowthian.co.uk/tag/adobe-device-central-cs5/<li class=/

I'd guess you have a link in a menu somewhere that's missing the last </a>
 
Actually I just checked your site - the problem links are in the UL in DIV#headerright. That whole section of code looks pretty messed up tbh :)
 
I also seems that a lot of classes are declared with an ' I beleive you need to use double quotes instead.

And I see spaces in the class names I think that is illegal too.
 
Oh great :(

This is going to be a long night!

Here is the problem area causing the links in Google, anyone see any problems?

Code:
<div class="headerleft">
<a href="<?php echo get_option('home'); ?>/"><img src="<?php bloginfo('template_url'); ?>/images/logo.jpg" alt="<?php bloginfo('name'); ?>" /></a>
</div>

<div class="headerright">
<ul class="horiz">
<li><a href="<?php wp_list_pages('title_li=&depth=4&sort_column=menu_order'); ?>"</a></li>
</ul>
</div>
 
its thid line...
Code:
<li><a href="<?php wp_list_pages('title_li=&depth=4&sort_column=menu_order'); ?>"</a></li>

It should be something like...
Code:
<li><a href="<?php wp_list_pages('title_li=&depth=4&sort_column=menu_order'); ?>">Link text</a></li>
 
OK I think I have sorted the problem, I changed this:

Code:
<li><a href="<?php wp_list_pages('title_li=&depth=4&sort_column=menu_order'); ?>"</a></li>

To This:

Code:
<li><?php wp_list_pages('title_li=&depth=4&sort_column=menu_order'); ?></li>

Can someone just check the source code for me.

Now the next thing for me is to try and get the rubbish links out of google. Is it possible to setup a 301 redirect in the .htaccess so if someone visits:

Code:
http://www.chrislowthian.co.uk/tag/adobe-device-central-cs5/%3Cli%20class=/

They get redirected to:

Code:
http://www.chrislowthian.co.uk/tag/adobe-device-central-cs5/

Any help with that would be great :)
 
Yeah that's what I thought, just having trouble working out what to put in the .htaccess file.

Basically all I want to do is re-direct all urls that have %3Cli%20class=/ at the end to exactly the same url but with that part removed.

There must be a way of doing it by using one re-write command, instead of doing a re-direct for every page. It would take ages doing it like that as there are 100's of incorrect urls.

Also my site now validates fine (took over 3 hours to fix) :D

Any help with the .htaccess would be great.
 
OK if I add the below to the robots.txt file and remove them links in google webmaster tools will that be fine?

Code:
Disallow: /%3C!--QuoteEnd--%3E/page/2/
Disallow: /%3C!--QuoteEnd--%3E/page/3/
Disallow: /%3C!--QuoteEnd--%3E/page/4/
Disallow: /%3C!--QuoteEnd--%3E/page/5/
Disallow: /%3C!--QuoteEnd--%3E/page/6/
Disallow: /%3C!--QuoteEnd--%3E/page/7/
Disallow: /acer-aspire-revo-3610-review/915/comment-page-1/%3Cli%20class=
Disallow: /adobe-creative-suite-5-master-collection-final-cracked-already/1325/%3Cli%20class=
Disallow: /adobe-creative-suite-5-master-collection-final-cracked-already/1325/comment-page-1/%3Cli%20class=
Disallow: /category/computers-and-software/page/3/%3Cli%20class=
Disallow: /category/computers-and-software/page/4/%3Cli%20class=
Disallow: /category/computers-and-software/page/5/%3Cli%20class=
Disallow: /category/computers-and-software/page/6/%3Cli%20class=
Disallow: /category/internet-and-seo/li_class=%2F%3Cli+class%3D%2F/page/2/
Disallow: /category/nintendo-wii/page/2/%3Cli%20class=
Disallow: /changing-the-windows-7-start-orb-start-button/1035/%3Cli%20class=
Disallow: /darkcorp-v1-0-aka-cioscorp-v4-0-released/999/%3Cli%20class=
Disallow: /darkcorp-v1-0-aka-cioscorp-v4-0-released/999/comment-page-1/%3Cli%20class=
Disallow: /download-facebook-photo-albums-with-facepad/815/%3Cli%20class=
Disallow: /earn-cashback-on-your-online-purchases/91/%3Cli%20class=
Disallow: /google-wave-desktop-notifier/289/%3Cli%20class=
Disallow: /google-wave-desktop-notifier/289/comment-page-1/%3Cli%20class=
Disallow: /how-to-play-n64-games-on-your-nintendo-wii/1129/comment-page-1/%3Cli%20class=
Disallow: /installing-cioscorp-3-6-on-your-nintendo-wii/321/%3Cli%20class=
Disallow: /installing-the-homebrew-channel-on-your-nintendo-wii/299/%3Cli%20class=
Disallow: /new-nintendo-wii-4-2-softmodding-guide/985/%3Cli%20class=
Disallow: /new-nintendo-wii-4-2-softmodding-guide/985/comment-page-1/%3Cli%20class=
Disallow: /new-nintendo-wii-4-2-softmodding-guide/985/comment-page-2/%3Cli%20class=
Disallow: /nintendo-wii-4-2-safe-firmware-updater/793/%3Cli%20class=
Disallow: /page/10/%3Cli%20class=
Disallow: /page/3/%3Cli%20class=
Disallow: /page/4/%3Cli%20class=
Disallow: /playing-nintendo-wii-games-from-a-usb-hard-drive/389/%3Cli%20class=
Disallow: /playing-nintendo-wii-games-from-a-usb-hard-drive/389/comment-page-4/%3Cli%20class=
Disallow: /repair-your-epson-inkjet-printer/71/%3Cli%20class=
Disallow: /site-newsletter-may-2010/1309/%3Cli%20class=
Disallow: /sky-digital-set-top-box-secret-engineers-menu/42/%3Cli%20class=
Disallow: /tag/softmod/page/2/%3Cli%20class=
Disallow: /the-best-of-the-best-ecommerce-packages/465/comment-page-1/%3Cli%20class=
Disallow: /top-10-firefox-add-ons/64/%3Cli%20class=
Disallow: /top-wordpress-security-plugins/506/comment-page-1/%3Cli%20class=
Disallow: /ultimate-nintendo-wii-4-2-softmod-guide/622/comment-page-3/%3Cli%20class=
Disallow: /utorrent-2-0-release-candidate-1-released/17/%3Cli%20class=
Disallow: /vidahost-web-hosting-offer-extended/900/%3Cli%20class=
Disallow: /vidahost-web-hosting-offer-extended/900/comment-page-1/%3Cli%20class=
Disallow: /windows-7-sp1-service-pack-1-beta-leaked/1010/%3Cli%20class=
Disallow: /windows-7-student-offer/134/comment-page-1/%3Cli%20class=
Disallow: /windows-7-theme-for-windows-xp/98/%3Cli%20class=
Disallow: /wordpress-2-9-released/9/%3Cli%20class=
 
Just want to say thank you to everyone who helped.

It's all sorted now, just waiting for the last few problem URLs to be removed from google now :)
 
Back
Top Bottom