Printer Friendly Version Print this thread
Email this thread to a friend eMail this thread to a friend
  • For Sale Russia/USA Marriage/Dating/Meeting Site (In: I Want to Sell My Website)
  • I want to sell my site, mzkforums.com (In: I Want to Sell My Website)
  • I want to sell my site, mzkforums.com (In: I Want to Sell My Website)
  • For Sale - Child Gift Web Site (In: I Want to Sell My Website)
  • For Sale PR3 Site with Lots of Authority (In: I Want to Sell My Website)
  • Featured Web Site Template

    Hundreds More at Free Site Templates.com!

    Web Site Partners
    Sponsored Links
    Jet City Software
     
    Whos Here ?
    There are 0 guests and 1 members in the forums right now.
    Reflects user activity within the last 5 minutes
    Moderator(s): Logan, WinningWays
    Member Message

    david68
    Joined: May 16, 2005
    # Posts: 144

    View the profile for david68 Send david68 a private message

    Posted: 2007-Aug-07 14:51
    Edit Message Delete Message Reply to this message

    I want better rankings and more traffic, but what bots actually help me and aren't just out there for data mining to sell information?

    If they ignore robots.txt (everyone is disallowed until I allow them) I block them. I got a new bot, hit robots.txt, hasn't viewed a page so it's obeying it, but I googled about it and some people call it a data miner. It's called BlogPulseLive - hostname comes back as intelliseek dot com which redirects to nielsenbuzzmetrics dot com which openly admits it's a data miner company. So I assume it doesn't help me at all?

    Is there a list of good bots out there? Some blog bots help get traffic.



    Hampstead
    Joined: Feb 20, 2001
    # Posts: 2015

    View the profile for Hampstead Send Hampstead a private message

    Posted: 2007-Aug-07 17:09
    Edit Message Delete Message Reply to this message

    There is no easy way of banning data mining or "bad" bots. They simply won't adhere to robots txt or tag. Indeed, they could simply identify themselves as browsers.

    If you're really worried about it, you could try to find a list of IP adresses asociated with the various bots and ban them, but there is very little point.

    By the way, all bots by definition are there for data mining purposes.



    david68
    Joined: May 16, 2005
    # Posts: 144

    View the profile for david68 Send david68 a private message

    Posted: 2007-Aug-07 17:20
    Edit Message Delete Message Reply to this message

    By the way, all bots by definition are there for data mining purposes


    Yes, but google/yahoo/msn at least help the cause in the process.

    The question was: is there a list of GOOD bots, that I could specially "allow" in robot.txt. I guess I should have phrased it better.




    Hampstead
    Joined: Feb 20, 2001
    # Posts: 2015

    View the profile for Hampstead Send Hampstead a private message

    Posted: 2007-Aug-07 18:15
    Edit Message Delete Message Reply to this message

    It is quite normal to allow all bots.



    david68
    Joined: May 16, 2005
    # Posts: 144

    View the profile for david68 Send david68 a private message

    Posted: 2007-Aug-07 18:32
    Edit Message Delete Message Reply to this message

    Well, maybe I'm quite abnormal as I rather not give permission to companies who sell MY information without permission without me gaining anything in return. smile





    g1smd
    Staff
    Joined: Jul 28, 2002
    # Posts: 10438

    View the profile for g1smd Send g1smd a private message

    Posted: 2007-Aug-07 23:44
    Edit Message Delete Message Reply to this message

    A Google search will find a number of sites that have compiled a list of what they consider to be "bad bots". I would use those as a start, making sure to review as many as possible to ensure that the list is still correct.



    Hampstead
    Joined: Feb 20, 2001
    # Posts: 2015

    View the profile for Hampstead Send Hampstead a private message

    Posted: 2007-Aug-08 07:05
    Edit Message Delete Message Reply to this message

    I understand your problem with these companies, but the "bad bots" will take no notice of your robots.txt or your robots noindex tag and will simply crawl your site anyway.



    david68
    Joined: May 16, 2005
    # Posts: 144

    View the profile for david68 Send david68 a private message

    Posted: 2007-Aug-08 13:36
    Edit Message Delete Message Reply to this message

    Hmph. I don't care about "bad bots" - I asked for a list of "GOOD" bots smile Bots which OBEY robots.txt that I should allow. BLOGPULSELIVE keeps asking for permission, I was wondering if I should allow them or if they won't really help me get rankings. I did google, but honestly the search engines are getting pretty crappy - bad results or no results.

    I know most people allow everyone, but honestly why waste my bandwidth for something that won't benifit me.




    dudibob
    Joined: Oct 13, 2005
    # Posts: 1462

    View the profile for dudibob Send dudibob a private message

    Posted: 2007-Aug-08 14:08
    Edit Message Delete Message Reply to this message

    banning bad bots will require a fair bit of server scripting of banning IPs to make sure they don't come in and new bad bots are generated every day so you will have to update your script a fair bit :s

    Personally I'd say don't worry about them, just look out for MSNbot, Slurp (yahoo), Googlebot and Asks (I forget what it's called).



    david68
    Joined: May 16, 2005
    # Posts: 144

    View the profile for david68 Send david68 a private message

    Posted: 2007-Aug-08 14:53
    Edit Message Delete Message Reply to this message

    OMG - forgive the rant/flame - but I never asked about BAD BOTS.

    I asked: Is BlogPulseLive a GOOD bot?

    I asked: What ones are considered GOOD (like Slurp, Google, etc) that I should specifically allow in robots.txt.

    Sheesh.

    Moderator - please LOCK this topic smile Better yet, move it to the xVault.



    Hampstead
    Joined: Feb 20, 2001
    # Posts: 2015

    View the profile for Hampstead Send Hampstead a private message

    Posted: 2007-Aug-08 15:59
    Edit Message Delete Message Reply to this message

    Allow all bots by not using the robots.txt. The "good bots" will then have access to your site without you having to worry about it.

    It's quite normal to do this.



    david68
    Joined: May 16, 2005
    # Posts: 144

    View the profile for david68 Send david68 a private message

    Posted: 2007-Aug-08 16:06
    Edit Message Delete Message Reply to this message

    Thanks for your comments - even though I don't agree with them nor were they useful.




    Hampstead
    Joined: Feb 20, 2001
    # Posts: 2015

    View the profile for Hampstead Send Hampstead a private message

    Posted: 2007-Aug-08 16:47
    Edit Message Delete Message Reply to this message

    rolleyeys


    You are not permitted to post messages in this forum or topic, because of one or more of the following reasons:
    1. You have not yet logged in, or registered properly as a member
    2. You are a member, but no longer have posting rights.
    3. This is a private forum, for which you do not have permissions.

    If you are a recent member, it's possible that you simply have not yet confirmed your account. Please check your email for a message entitled 'JimWorld Forums: Confirm Your Account' and follow the instructions contained within.

    If you cannot find this message, click here to Re-Send it.

    If you are still experiencing problem, please read the Login Assistance Article for some advice on what may be causing your login not to work properly.

    Switch to Advanced Editor and ... Create a New Topic or Reply to this Thread

    New posts Forum is locked
    © 1995  ·  iWeb, Inc  ·  DBA JimWorld Productions