Redirecting /nodeID.aspx to /My-real-url.aspx Options
SoerenS
Posted: Monday, June 30, 2008 11:49:30 AM

Rank: Fanatic

Joined: 7/25/2006
Posts: 424
Location: Silkeborg, Denmark
For some weird reason, Google has started indexing a lot of pages on my site via /nodeid.aspx, instead of the real URL. Thus marking half of my pages as duplicate content :-)

Can anyone recommend a way for me to do a permanent redirect for all requests coming in via /nodeid.aspx, and redirect it to the real URL? Or might I risk breaking something in Umbraco by doing this?

/SoerenS

Brug for råd til hvordan du driver en god webshop? / Need advice on how to run an effective webshop?
SoerenS
Posted: Monday, June 30, 2008 11:58:45 AM

Rank: Fanatic

Joined: 7/25/2006
Posts: 424
Location: Silkeborg, Denmark
Ok, I've figured out why Google has suddenly started to visit those /nodeID.aspx URL's. It's that buggy RSS package's fault, which uses nodeid based url's as guid's.

And apparently Google favours to use guid in XML-feed, rather than link...

Still, inspiration for some kind of quick fix / hot fix would be nice!

/SoerenS

Brug for råd til hvordan du driver en god webshop? / Need advice on how to run an effective webshop?
Richard
Posted: Monday, June 30, 2008 4:39:10 PM
Rank: Enthusiast

Joined: 9/7/2007
Posts: 19
Location: UK
One thought is to add a robots.txt file to the /data/rss folder, and request search engines to not index this, as you say it is duplicate content.
kalpa
Posted: Monday, June 30, 2008 6:38:31 PM

Rank: Fanatic

Joined: 7/19/2006
Posts: 492
Location: Göteborg, Sweden
While it might be a bit ugly and create duplicates in Google's eyes I believe that the id link is more correct since it's written as permalink in the feed. If you change the title of your post due to a typo or anything else the link from Google's index will be broken...
I belive this is the reason for why Google prefers to index the permalinks...

So while I can agree with you that it not is the best looking url, I think that the rss package actually handles the permalink correct. An option to exclude the permalinks might be a nice alternative...

// ;) Kalle

" - Yeah I'd like to share your point of view, as long as it's my view too... (http://www.d-a-d.dk/lyrics/pointofview)
hartvig
Posted: Tuesday, July 01, 2008 6:27:58 AM

Rank: Addict

Joined: 3/17/2008
Posts: 952
Location: Nyborg, Denmark
it's not buggy and the rss package uses permalinks according to specs. I believe excluding the permalink would cause the rss to become invalid. I'd go for the robots.txt solution.

Jeeeez, did I really start this :-)
SoerenS
Posted: Thursday, August 21, 2008 3:20:22 PM

Rank: Fanatic

Joined: 7/25/2006
Posts: 424
Location: Silkeborg, Denmark
I understand and agree with all of your points. Problem is, it doesn't seem like Google follows the specs or even best practice when it comes to this. Looks like it is ignoring the isPermaLink option. Which is a problem if you're digging into the hardcore SEO details like me.

I'm currently using the robots.txt solution to deal with the problem at hand (along with permanent redirects of the nodes it has already indexed), and it seems to work. But it's not ideal, as I definitely WANT Google to index my feed!

My next step is to try and just put the node ID as the guid, and set isPermaLink to false. Ie. <guid isPermaLink="false">1564</guid>.

According to the specs, the guid doesn't have to be a URL, although all the examples given is in the form of a URL:
Quote:
There are no rules for the syntax of a guid. Aggregators must view them as a string. It's up to the source of the feed to establish the uniqueness of the string.

Unless isPermaLink=true, then it has to be an URL. Also, guid is an optional element according to the specs.

The ideal solution to me would be to provide an option in the RSS package to include the guid element or not. Unfortunately I don't have the l337 programming skillz to correct it, so I've created an issue on it. Go give it a vote if you think this is important enough to disturb any of the Umbraco programmers, I know they're probably busy right now with Umbraco v4.

Brug for råd til hvordan du driver en god webshop? / Need advice on how to run an effective webshop?
Users browsing this topic
Guest


You cannot post new topics in this forum.
You cannot reply to topics in this forum.
You cannot delete your posts in this forum.
You cannot edit your posts in this forum.
You cannot create polls in this forum.
You cannot vote in polls in this forum.