|
Posted by Nathan Keel on May 19, 2009, 5:38 pm
Please log in for more thread options Sparticle wrote:
show/hide quoted text
>
>>Sparticle wrote:
>>> Is there anything I can do to stop google indexing https links on a
>>> site.
>>>
>>> It seems that if you get an http:// and https:// link for EVERY link
>>> when SSL is applied to the domain.
>>>
>>> i.e. Can I put some code in the robots.txt file to disallow
>>> spidering the secure links??
>>>
>>> Thank you
>>> Sparticle
>>Maybe set it so any HTTPS access read by an .htaccess file enacts the
>>robots.txt file to deny spidering, or just reject google's user agent
>>bot if it's port 443. It would be ideal if you could just have
>>separate documentroots or directories for secure access, so you could
>>have a separate robots.txt file for that area.
>
> Thank you for the info.
>
> The next site I build I'll apply the SSL to a subdomain and disallow
> the directory. It's too late for the existing one.
>
> At the moment I'm using ASP on windows 2003. A lot of stuff I see
> online about manipulating .htaccess seems to be with mod_rewrite.
> Isn't this an Apache thing?
>
> I'm not sure how to apply this to a win 2003 server.
>
> Thank you
> Sparticle
Yeah, it's just common for people to be using Apache. I don't know the
solution for ASP, other than maybe suggesting putting a check in the
scripts themselves, but there's probably a better way.
|
> site.
>
> It seems that if you get an http:// and https:// link for EVERY link
> when SSL is applied to the domain.
>
> i.e. Can I put some code in the robots.txt file to disallow spidering
> the secure links??
>
> Thank you
> Sparticle