| Duplicate Content in Joomla and Why it Matters |
| Monday, 05 February 2007 | |
|
A few weeks ago we mentioned that there was some good news for those people who have duplicate content on their site. A Google staff member mentioned that there was no longer going to be an active penalty for websites that committed this particular mistake.
Some people were happy. Some were dubious and belived the penalty still existed. Some simply said, "what the beep is duplicate content and how does it affect my Joomla site"?
This post is that for that last group of people.
Adding more components can produce even more URLs.
Comments (15)
![]() written by ronn, July 03, 2007
Yeah I agree that we must stick to one SEF component as long as we can. I just completely messed up my site simply by changing fromo OpenSEF to SH404SEF.
Eventhough I am happy with OpenSEF but due to several good review I read on SH404SEF, I feel the need to try the component which I definitely say a big misstake. It messed up with content as well as pagerank. After several hours of headache I just installed the OpenSEF again. regards written by David Towers, October 04, 2007
Is it really necessary to disable the print function? Does this not just make the browers load up the page with linked to a different CSS?
In version 1.5 of Joomla, there is no need to disable PDFs as they have the no follow attribute built into the link, however the print buttons do not have that attribute built in. So whats your advice, do you recommend disabling the print button on Joomla 1.5 sites then? From an accessibility point of view and usability point of view, its really nice having a print option!
PDF and printing
written by Zorro, October 10, 2007
In Joomla 1.0.x, the best way to avoid having duplicate content is to disallow index2.php in the robots file. Both the print and PDF functions are run through index2.php which (from a search engine point of view) generates additional pages with the same content.
Printing through a "print" CSS is obviously the best and cleanest way - but Joomla doesn't support that out of the box. You have to switch off the print icon and make your own "print" CSS for this method to work. In Joomla 1.5, disallowing index2.php doesn't work any more since they stopped using the index2.php method and now run everything through index.php. Steve's tip will remove the print pages, and with an additional disallow: /*pdf* you can remove the PDF pages as well if you don't trust the nofollow method. Kind regards, Zorro
...
written by Good Web Practices, October 18, 2007
It's great to be upto speed about this before it's too late. So basically from what I've understood, on my Joomla 1.5 setup, I can still leave the Pdf and Print page on (if I want to) and not have duplicate content problems by including the following two lines:
disallow: /*pdf* /*print* Is that right? written by Vacuum, November 21, 2007
If I understood Zorro well, if we disallow print and PDF in the robots.txt we can leave print and PDF joomla wide on and we doesn't have any printer or PDF duplicate content?
...
written by Zorro, November 22, 2007
Vacuum: Yes, that's what I'm saying. For Joomla 1.5, that is.
Kind fregards, Zorro written by brendan wilde, December 06, 2007
"Use JPromoter. Analyze your site and then go to "Optimize Your Site". Search by using "Group by Same Titles". Make sure you choose "No index" and "No follow" for all but one copy of each page. This means that Google should only index the pages you want indexed."
Can one set no index and no follow for all but one url without JPromoter... i.e. simply using Open SEF?? written by Micheas Herman, February 17, 2008
Google strongly hinted at joomla day west last year at the googleplex that if they detect a site is joomla site they drop into a special joomla site mode. This is why some pages that google says they will not index get indexed on joomla sites. There are a huge amount of joomla 1.0.x and mambo 4.x sites and they all do things that google hates, but there are so many of mambo and joomla site google has to cope.
Not that I don't use sh404sef for all of my joomla 1.0 sites, but I suspect that the penalty for not being google friendly is not that great at the moment. However, as sites slowly upgrade the odds of all but a small number of pages not being indexed by the major search engines increases. Personally I suspect that it is a better use of resources to increase the organic links to your site than optimizing the site. Also, I find that sh404sef makes the site much easier for people to link to your site, so the effect of sh404sef may be more that the urls are human friendly and thus get reposted more. Remember, SEO is mostly just the methodical work of posting good content and posting links to your content around the web. Shortcuts don't really exist. Just my musings on the subject. written by Rafi Michael - Toronto Weddings, March 16, 2008
that was great info i well make sure work on my site videobabylon.ca so i have no problims
Write comment
|


As for JPromoter, contrary to what they're claiming on their website, there is the Joomla SEO Patch from joomlatwork.com which, at a fraction of the price, also gives full control over meta tags on non-com_content pages, besides doing other great things. (No I'm not affiliated.)
I'm with you again completely on item 3, and there is a strong reason for using a SEF URL component that you didn't even mention: It can rectify Joomla's inherent ItemID issues and make sure that a certain page is always reached via *ONE* URL no matter what ItemID Joomla thinks it should have. That's what I'm using OpenSEF for on almost all my sites.
OpenSEF also rewrites all internal links so you can leave them the standard way. I haven't tried SEF Advance but would imagine it operates much the same.
Thanks for the article and kind regards.