Click here to get back home

blocking robots.txt from non-robots

 HomeNewsGroups | Search | About
 alt.internet.search-engines    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
blocking robots.txt from non-robots Joe Fox 02-20-2008
Posted by Paul on February 21, 2008, 11:54 pm
Please log in for more thread options

>
>> You have email John.
>
>Thanks Paul, looking into it (got the Gecko one as well, haven't had time
>to check it out, thanks).

nps John,
i'll hear from you when you are ready.
plh
paul

----== Posted via Newsfeeds.Com - Unlimited-Unrestricted-Secure Usenet News==----
http://www.newsfeeds.com The #1 Newsgroup Service in the World! 120,000+
Newsgroups
----= East and West-Coast Server Farms - Total Privacy via Encryption =----

Posted by Big Bill on February 21, 2008, 5:12 pm
Please log in for more thread options

>
>> I don't want certain humans (only a few hundred in number but all on
>> dynamic IPs in several countries) to be able to read the robots.txt that
>> I'm giving search engines because I don't want them to know what pages I
>> am telling SE's "disallow"
>
>Maybe it helps if you explain the why. Which is not: I don't want them to
>read. What do you want to achieve? Why should those people not be able to
>read your robots.txt
>
>> What's so wrong with this?
>
>To me it sounds pointless. I see no gain in it, but maybe you can explain
>better the *why*?
>
>BTW: I disagree with others that Google et al should have a problem with
>this. Although it's cloaking, it's not something (in this case) Google
>should care about. It's like getting upset about a site that shows a flag
>on its page based on the country you connect from, and feeds Google a flag
>of the USA :-)

I never said they'd care about it, just that it was cloaking.

BB
--

http://www.kruse.co.uk/
http://www.fat-odin.com/
http://www.here-be-posters.co.uk/

Posted by Don on February 21, 2008, 8:58 pm
Please log in for more thread options
b348-49a9cacc2453@n77g2000hse.googlegroups.com:

>> Perhaps I didn't say it right.  I'm wanting to block the robots.txt that
>
>> I'm feeding search engines from being given to anybody else.
>
> If Google catch you they will exclude you from the index.
>
> 'Don't deceive your users or present different content to search
> engines than you display to users, which is commonly referred to as
> "cloaking." '
>

It's done ALL the time.
What matters is that it's done for an appropiate reason and is accomplished
server side.

Posted by John Bokma on February 21, 2008, 10:10 pm
Please log in for more thread options

> It's done ALL the time.
> What matters is that it's done for an appropiate reason and is
> accomplished server side.

To me, what matters, is that a user doesn't click on a search result, and
comes on a page that doesn't make the expected data available.

And yes, even that does seem to be allowed by Google. There is a forum
that uses JS cloaking. Can't find a quick example, will post when I bump
into it again. The solution is to turn either JS off, or click on cached
link. But it's a sad practice and even sadder that Google allows for it.

webmasterworld (IIRC) did use cloaking, maybe still does. I've reported
this several times to Google, but no use (unless it has been fixed)

--
John Bokma http://johnbokma.com/

Posted by Don on February 21, 2008, 10:37 pm
Please log in for more thread options

>
>> It's done ALL the time.
>> What matters is that it's done for an appropiate reason and is
>> accomplished server side.
>
> To me, what matters, is that a user doesn't click on a search result,
> and comes on a page that doesn't make the expected data available.
>

John,
Believe this is the goal of most webnasters, however there is specific
traffic that individual webasters simply have no desire for.
That's their own decision and each webamster must determine what is
benefical or detrimental their own site (s).

>
> webmasterworld (IIRC) did use cloaking, maybe still does. I've
> reported this several times to Google, but no use (unless it has been
> fixed)
>

Webmaster World has had many problems in getting their extensive forums
and pages sipdered propperly, without allowing harvesting by other forums.
Brett does a superb job at providing mutiple forums for participants the
world over.
Here's a 2005 explanation:
http://www.webmasterworld.com/forum9/9618.htm


Similar ThreadsPosted
whitehouse.gov is blocking " February 2, 2007, 11:51 am
Semi-OT :How Do I Know If My ISP Is Blocking Pages? August 14, 2007, 10:15 pm
Question about testing for page blocking January 9, 2005, 1:30 pm
Google blocking our Web Position Software March 7, 2005, 10:18 am
Yahoo has been blocking SeoElite's queries January 8, 2006, 8:52 pm
Yahoo has been blocking SeoElite's queries January 8, 2006, 8:55 pm
robots.txt January 12, 2005, 11:56 pm
Robots txt March 20, 2006, 8:19 am
robots.txt April 12, 2006, 8:48 am
Robots.txt April 17, 2006, 6:43 pm

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap