Welcome guest. Before posting on our computer help forum, you must register. Click here it's easy and free.

Author Topic: My Robots.txt file  (Read 12211 times)

0 Members and 1 Guest are viewing this topic.

Zylstra

    Topic Starter
  • Moderator


  • Hacker

  • The Techinator!
  • Thanked: 45
    • Yes
    • Technology News and Information
  • Certifications: List
  • Computer: Specs
  • Experience: Guru
  • OS: Windows 7
Re: My Robots.txt file
« Reply #15 on: August 09, 2006, 03:00:59 PM »
Quote
k..

i'll try to make one myself and have you guys check it over..

how do i find the robots names??
Find it here

unlovedwarrior



    Guru

  • someday this name will be known
  • Thanked: 13
    Re: My Robots.txt file
    « Reply #16 on: August 09, 2006, 03:03:54 PM »
    k thanks

    Rob Pomeroy



      Prodigy

    • Systems Architect
    • Thanked: 124
      • Me
    • Experience: Expert
    • OS: Other
    Re: My Robots.txt file
    « Reply #17 on: August 09, 2006, 08:21:09 PM »
    Quote
    if you have a forum, you will want to block the admin directories and the member directories. This will keep important information about your users from being searched if the search engines manage to do that.
    Still not convinced you've quite go this, by virtue of the fact you're using the word "block".  It's best not to have (or give) the impression that robots.txt files can actually block anything.  Consider them more as a polite request, which will sometimes be ignored.

    You need to protect your /admin tree in much more robust ways.
    Only able to visit the forums sporadically, sorry.

    Geek & Dummy - honest news, reviews and howtos

    Zylstra

      Topic Starter
    • Moderator


    • Hacker

    • The Techinator!
    • Thanked: 45
      • Yes
      • Technology News and Information
    • Certifications: List
    • Computer: Specs
    • Experience: Guru
    • OS: Windows 7
    Re: My Robots.txt file
    « Reply #18 on: August 09, 2006, 08:32:34 PM »
    Quote
    Quote
    if you have a forum, you will want to block the admin directories and the member directories. This will keep important information about your users from being searched if the search engines manage to do that.
    Still not convinced you've quite go this, by virtue of the fact you're using the word "block".  It's best not to have (or give) the impression that robots.txt files can actually block anything.  Consider them more as a polite request, which will sometimes be ignored.
    [highlight]
    You need to protect your /admin tree in much more robust ways.[/highlight]
    Mind explaining how?

    Dilbert

    • Moderator


    • Egghead

    • Welcome to ComputerHope!
    • Thanked: 44
      Re: My Robots.txt file
      « Reply #19 on: August 09, 2006, 09:08:12 PM »
      Yes, please elaborate, I can use this info. :)
      "The geek shall inherit the Earth."

      unlovedwarrior



        Guru

      • someday this name will be known
      • Thanked: 13
        Re: My Robots.txt file
        « Reply #20 on: August 10, 2006, 08:23:21 AM »
        im confused rob :-[ explain plz so do i really need robot.txt??

        Rob Pomeroy



          Prodigy

        • Systems Architect
        • Thanked: 124
          • Me
        • Experience: Expert
        • OS: Other
        Re: My Robots.txt file
        « Reply #21 on: August 11, 2006, 01:13:16 AM »
        Quote
        Mind explaining how?
        Quote
        Yes, please elaborate, I can use this info. :)
        Sure.  WIth Apache, you can use an .htaccess file as an added layer of protection.  I suggest using password and IP-based protection.  If you will only access /admin from your local subnet, then only allow access from that subnet.  If you are connecting to a remote server, but from a fixed IP address, only allow access from that IP address.

        You can do something similar with IIS and WIndows-based authentication.

        Quote
        im confused rob :-[ explain plz so do i really need robot.txt??
        Well this is my point.  The Computer Hope article explains it quite well.  You tell spiders not to index parts of the web tree that it would be pointless for them to access - e.g. recursive directories (not something you're likely to encounter for a while) or any thread on YaBB that contains a post by Mac...
        « Last Edit: August 11, 2006, 01:13:35 AM by robpomeroy »
        Only able to visit the forums sporadically, sorry.

        Geek & Dummy - honest news, reviews and howtos

        Zylstra

          Topic Starter
        • Moderator


        • Hacker

        • The Techinator!
        • Thanked: 45
          • Yes
          • Technology News and Information
        • Certifications: List
        • Computer: Specs
        • Experience: Guru
        • OS: Windows 7
        Re: My Robots.txt file
        « Reply #22 on: August 11, 2006, 01:15:13 AM »
        Hmm. I already have HTACCESS, but not on my YaBB directories...

        unlovedwarrior



          Guru

        • someday this name will be known
        • Thanked: 13
          Re: My Robots.txt file
          « Reply #23 on: August 11, 2006, 08:23:28 AM »
          i have the htaccess file also im using phpbb

          Dilbert

          • Moderator


          • Egghead

          • Welcome to ComputerHope!
          • Thanked: 44
            Re: My Robots.txt file
            « Reply #24 on: August 11, 2006, 09:32:59 AM »
            Quote
            or any thread on YaBB that contains a post by Mac...

            ROTFL ;D ;D ;D

            Thanks, Rob, I'll keep that in mind.

            So now I know what robots.txt is... thanks, Rob. :)
            "The geek shall inherit the Earth."

            Google



              Mentor

              Thanked: 2
              • Certifications: List
              • Experience: Experienced
              • OS: Windows 7
              Re: My Robots.txt file
              « Reply #25 on: August 11, 2006, 10:16:29 AM »
              You have reached the point of a "VERY HOT TOPIC" at [timestamp=1155312980]!
              [highlight]CONGRATULATIONS![/highlight]