Problem Links in Google

Associate
Joined
31 Jan 2010
Posts
283
Location
UK
I am having really strange links appearing in Google but I am not sure where they are coming from.

If you have a look here:

Google

You will see links like this:
Code:
http://www.chrislowthian.co.uk/tag/adobe-device-central-cs5/%3Cli%20class=/

Notice the %3Cli%20class=/ at the end.

I don't know where the links are coming from. I have checked the xml sitemap and there is nothing wrong there.

I have also looked through the template with no luck.

Does anyone have any ideas? I don't want it to start effecting SEO as it is bringing up 404's.
 
Oh great :(

This is going to be a long night!

Here is the problem area causing the links in Google, anyone see any problems?

Code:
<div class="headerleft">
<a href="<?php echo get_option('home'); ?>/"><img src="<?php bloginfo('template_url'); ?>/images/logo.jpg" alt="<?php bloginfo('name'); ?>" /></a>
</div>

<div class="headerright">
<ul class="horiz">
<li><a href="<?php wp_list_pages('title_li=&depth=4&sort_column=menu_order'); ?>"</a></li>
</ul>
</div>
 
OK I think I have sorted the problem, I changed this:

Code:
<li><a href="<?php wp_list_pages('title_li=&depth=4&sort_column=menu_order'); ?>"</a></li>

To This:

Code:
<li><?php wp_list_pages('title_li=&depth=4&sort_column=menu_order'); ?></li>

Can someone just check the source code for me.

Now the next thing for me is to try and get the rubbish links out of google. Is it possible to setup a 301 redirect in the .htaccess so if someone visits:

Code:
http://www.chrislowthian.co.uk/tag/adobe-device-central-cs5/%3Cli%20class=/

They get redirected to:

Code:
http://www.chrislowthian.co.uk/tag/adobe-device-central-cs5/

Any help with that would be great :)
 
Yeah that's what I thought, just having trouble working out what to put in the .htaccess file.

Basically all I want to do is re-direct all urls that have %3Cli%20class=/ at the end to exactly the same url but with that part removed.

There must be a way of doing it by using one re-write command, instead of doing a re-direct for every page. It would take ages doing it like that as there are 100's of incorrect urls.

Also my site now validates fine (took over 3 hours to fix) :D

Any help with the .htaccess would be great.
 
OK if I add the below to the robots.txt file and remove them links in google webmaster tools will that be fine?

Code:
Disallow: /%3C!--QuoteEnd--%3E/page/2/
Disallow: /%3C!--QuoteEnd--%3E/page/3/
Disallow: /%3C!--QuoteEnd--%3E/page/4/
Disallow: /%3C!--QuoteEnd--%3E/page/5/
Disallow: /%3C!--QuoteEnd--%3E/page/6/
Disallow: /%3C!--QuoteEnd--%3E/page/7/
Disallow: /acer-aspire-revo-3610-review/915/comment-page-1/%3Cli%20class=
Disallow: /adobe-creative-suite-5-master-collection-final-cracked-already/1325/%3Cli%20class=
Disallow: /adobe-creative-suite-5-master-collection-final-cracked-already/1325/comment-page-1/%3Cli%20class=
Disallow: /category/computers-and-software/page/3/%3Cli%20class=
Disallow: /category/computers-and-software/page/4/%3Cli%20class=
Disallow: /category/computers-and-software/page/5/%3Cli%20class=
Disallow: /category/computers-and-software/page/6/%3Cli%20class=
Disallow: /category/internet-and-seo/li_class=%2F%3Cli+class%3D%2F/page/2/
Disallow: /category/nintendo-wii/page/2/%3Cli%20class=
Disallow: /changing-the-windows-7-start-orb-start-button/1035/%3Cli%20class=
Disallow: /darkcorp-v1-0-aka-cioscorp-v4-0-released/999/%3Cli%20class=
Disallow: /darkcorp-v1-0-aka-cioscorp-v4-0-released/999/comment-page-1/%3Cli%20class=
Disallow: /download-facebook-photo-albums-with-facepad/815/%3Cli%20class=
Disallow: /earn-cashback-on-your-online-purchases/91/%3Cli%20class=
Disallow: /google-wave-desktop-notifier/289/%3Cli%20class=
Disallow: /google-wave-desktop-notifier/289/comment-page-1/%3Cli%20class=
Disallow: /how-to-play-n64-games-on-your-nintendo-wii/1129/comment-page-1/%3Cli%20class=
Disallow: /installing-cioscorp-3-6-on-your-nintendo-wii/321/%3Cli%20class=
Disallow: /installing-the-homebrew-channel-on-your-nintendo-wii/299/%3Cli%20class=
Disallow: /new-nintendo-wii-4-2-softmodding-guide/985/%3Cli%20class=
Disallow: /new-nintendo-wii-4-2-softmodding-guide/985/comment-page-1/%3Cli%20class=
Disallow: /new-nintendo-wii-4-2-softmodding-guide/985/comment-page-2/%3Cli%20class=
Disallow: /nintendo-wii-4-2-safe-firmware-updater/793/%3Cli%20class=
Disallow: /page/10/%3Cli%20class=
Disallow: /page/3/%3Cli%20class=
Disallow: /page/4/%3Cli%20class=
Disallow: /playing-nintendo-wii-games-from-a-usb-hard-drive/389/%3Cli%20class=
Disallow: /playing-nintendo-wii-games-from-a-usb-hard-drive/389/comment-page-4/%3Cli%20class=
Disallow: /repair-your-epson-inkjet-printer/71/%3Cli%20class=
Disallow: /site-newsletter-may-2010/1309/%3Cli%20class=
Disallow: /sky-digital-set-top-box-secret-engineers-menu/42/%3Cli%20class=
Disallow: /tag/softmod/page/2/%3Cli%20class=
Disallow: /the-best-of-the-best-ecommerce-packages/465/comment-page-1/%3Cli%20class=
Disallow: /top-10-firefox-add-ons/64/%3Cli%20class=
Disallow: /top-wordpress-security-plugins/506/comment-page-1/%3Cli%20class=
Disallow: /ultimate-nintendo-wii-4-2-softmod-guide/622/comment-page-3/%3Cli%20class=
Disallow: /utorrent-2-0-release-candidate-1-released/17/%3Cli%20class=
Disallow: /vidahost-web-hosting-offer-extended/900/%3Cli%20class=
Disallow: /vidahost-web-hosting-offer-extended/900/comment-page-1/%3Cli%20class=
Disallow: /windows-7-sp1-service-pack-1-beta-leaked/1010/%3Cli%20class=
Disallow: /windows-7-student-offer/134/comment-page-1/%3Cli%20class=
Disallow: /windows-7-theme-for-windows-xp/98/%3Cli%20class=
Disallow: /wordpress-2-9-released/9/%3Cli%20class=
 
Just want to say thank you to everyone who helped.

It's all sorted now, just waiting for the last few problem URLs to be removed from google now :)
 
Back
Top Bottom