Welcome to Casino Affiliate Programs! If this is your first visit, be sure to check out the FAQ by clicking the link above. You may have to register before you can post: click the register link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below.
Thread: Robots.txt Question
- 02-10-2012 06:40 AM #1Member
- Join Date
- Nov 2010
- Location
- Barnsley
- Posts
- 157
- Blog Entries
- 1
- Thanks
- 1
- Thanked 8 Times in 8 Posts
- Rep Power
- 2
Robots.txt Question
I am having the following issue:
site.com/article
site.com/home/article
I have a few pages where the "home" is making a duplicate page. Would adding
Disallow: /home/
In my Robot.txt file be safe or would it block everything as "home" maybe an important directory. Stupid Joomla, stuff like this should not really been an issue to start with.
Any other ideas would be great.
Thanks all - 02-10-2012 09:00 AM #2Put on a smile!
- Join Date
- Oct 2003
- Location
- USA
- Posts
- 1,282
- Blog Entries
- 1
- Thanks
- 8
- Thanked 25 Times in 22 Posts
- Rep Power
- 12
Yes adding Disallow: /home/ would block everything within that directory.
You might want to just block the files that are duplicating with "Disallow: /home/file.html" ... - 02-10-2012 12:20 PM #3Senior Member
- Join Date
- Sep 2005
- Posts
- 272
- Thanks
- 48
- Thanked 20 Times in 18 Posts
- Rep Power
- 8
There is a specific meta tag designed for this situation.
<link rel=”canonical” href=”http://www.site.com/article” /> <-- this tells the search engines which is the correct page to index but i am unsure if it stops the alternate versions from bleeding link juice or factoring in other ways
Regarding the robots.txt The problem is that the robots entry will stop the robots from visiting that page but it will still factor into things in a number of other ways.
IMO your best solution, if possible, 301 redirect the /home/article pages to the /article pages. In theory you pass any pagerank and keyword relevance to the correct page this way as well. -
The Following User Says Thank You to bingoadvantage For This Useful Post:
Rak (02-10-2012)
- 02-10-2012 03:51 PM #4Moderator
- Join Date
- Jun 2006
- Location
- Brisbane
- Posts
- 1,173
- Blog Entries
- 4
- Thanks
- 157
- Thanked 149 Times in 113 Posts
- Rep Power
- 19
Agree with bingo advantage here. Go the canonical URL route. It allows all your pages to be spidered - and allows your "main" page with the "duplicate" content to still be the only existence in google serps.
I ran into similar problems with wordpress a while back where my category page was ranking high then the actual article itself - canonical urls made sure that the article was the page that ranked in serps. - 02-10-2012 04:16 PM #5Put on a smile!
- Join Date
- Oct 2003
- Location
- USA
- Posts
- 1,282
- Blog Entries
- 1
- Thanks
- 8
- Thanked 25 Times in 22 Posts
- Rep Power
- 12
Had no clue how joomla worked so wasnt sure why it was creating duplicate content, and I have always used robots to disallow following of certain pages.
When you mention category page which gives an excerpt vs the actual artical I see where the issue might be the same with joomla. Started using WP in Dec 11 and never even thought of that! So how exactly did you work that out in WP?
EDIT >
Just checked and WP includes the canonical tag automatically on the post pages ... maybe they didnt in an older version?Last edited by arkyt; 02-10-2012 at 04:22 PM.
- 02-10-2012 07:14 PM #6Moderator
- Join Date
- Jun 2006
- Location
- Brisbane
- Posts
- 1,173
- Blog Entries
- 4
- Thanks
- 157
- Thanked 149 Times in 113 Posts
- Rep Power
- 19
The latest versions of WP actually handle the canonical URL as part of its install. So every post created is given a canonical URL. Back in the day though, it wasn't part of the package.. got to love WP.. helping everyone out!

LinkBack URL
About LinkBacks


