Printer Friendly Version
Email this thread to a friend
|
Featured Web Site Template |
|
Reflects user activity within the last 5 minutes
|
|
| Member |
Message |
pataya1
Joined: May 16, 2006
# Posts: 5
|
Posted: 2006-May-16 09:32
Hi everyone,
I have a data base driven joomla website and 772 categories, most of them still empty because the site is new...
I would like Google to only show content pages, for example, I would want this page to be crawled:
xuzo(dot) com/marketing/merchandising/link/promotional-merchandise.html
but not these pages:
xuzo(dot)com/marketing/merchandising/
xuzo(dot)com/marketing/
The above pages are empty, just a directory listing and links, would Google see that as duplicate content or will they just quickly crawl them until getting to my content items?
I put very important in the subject field because doing the wrong thing could easily get my banned...
Please use the contact form on xuzo(dot)com if you have a chance to tell me you have replied to this post, I don't always get notified of replies.
Thanks,
Bruno
|
 |
g1smd
Staff
Joined: Jul 28, 2002
# Posts: 10465
|
Posted: 2006-May-17 01:37
Use the robots noindex tag on just the pages that you don't want indexed. Keep it there until those pages do have content on them.
|
 |
pataya1
Joined: May 16, 2006
# Posts: 5
|
Posted: 2006-May-17 05:06
Thank you for the reply but handcoding no index no follow manually on nearly 700 pages is a lot of work...!
Isn't there and easier way?
Thanks again,
Bruno
|
 |
g1smd
Staff
Joined: Jul 28, 2002
# Posts: 10465
|
Posted: 2006-May-17 20:04
If it is database driven, then simply add one more field to the records: indexthis: yes/no, and then alter your script to add the meta robots noindex tag only where it is required.
|
 |
pataya1
Joined: May 16, 2006
# Posts: 5
|
Posted: 2006-May-18 03:28
The site is indeed database driven. I use Navicat to export and import my sql files but use MS Excell to edit them.
So let's see if I understand this right. In Excell, I should add a colum to my existing content table, name that column "indexthis: yes/no" and then input yes or no in the feilds next to the pages?
"meta robots noindex", where should I put this text? In the HTML file? THe index file? In every html page I don't want indexed?
Thank you so much, this information might save my bacon!...MSN is already starting to crawl my empty content...
Good thing I'm taking care of this issue with a 2 weeks old site than a 2 year old one!
|
 |
g1smd
Staff
Joined: Jul 28, 2002
# Posts: 10465
|
Posted: 2006-May-18 20:44
The tag that you want for the noindex operation is:
<meta name="robots" content="noindex">
and it goes in the <head> section of the page.
|
 |
You are not permitted to post messages in this forum or topic, because of one or more of the following reasons:
- You have not yet logged in, or registered properly as a member
- You are a member, but no longer have posting rights.
- This is a private forum, for which you do not have permissions.
If you are a recent member, it's possible that you simply have not yet confirmed your account. Please
check your email for a message entitled 'JimWorld Forums: Confirm Your Account' and follow the instructions
contained within.
If you cannot find this message, click here to Re-Send it.
|
If you are still experiencing problem, please read the
Login Assistance
Article for some advice on what may be causing your login not to work properly.
|
Switch to Advanced Editor and ...
Create a New Topic
or Reply to this Thread
|
|