block google et all from searching the site

gwlco · 2013-03-12 04:45:42

As I remember the file .htaccess will do this. What do I need to store in it?
I dont want the site indexed or searched until I am ready.

rod barbee · 2013-03-12 05:23:11

you access your .htaccess file from the root of your site using your FTP program or your host's admin panel. There's nowhere in TTG plugins to store that.

gwlco · 2013-03-12 05:28:28

I understand that, but what do i store in the .htaccess file for this purpose?

rod barbee · 2013-03-12 05:31:56

Google is your friend

gwlco · 2013-03-13 00:24:49

Thank you Rod. I will be more careful in the future to do my own work. But you continue to be a good source.I want to put this in this post in case other people want the same thing

It's good programming policy. Pros have a robots.txt. Amateurs don't. What group do you want your site to be in? This is more of an ego/image thing than a "real" reason but in competitive areas or when applying for a job can make a difference. Some employers may consider not hiring a webmaster who didn't know how to use one, on the assumption that they may not to know other, more critical things, as well. Many feel it's sloppy and unprofessional not to use one.

The creation of the robots.txt file can be found http://www.mcanerin.com/EN/search-engine/robots-txt.asp

or here: http://searchengineland.com/google-offe … ator-13653

Here is the contents of the robots.txt file I created

# Disallows all robots
User-agent: *
Disallow: /

Last edited by gwlco (2013-03-13 00:29:05)

rod barbee · 2013-03-13 00:35:30

Good info. Thanks for posting it.

Kris · 2013-03-13 04:59:11

robots.txt is the standard way.

Unfortunately, it is entirely voluntary on the part of the web crawler whether to honour it or not.

serowe · 2013-03-18 16:49:25

I run a number of different domains - one of which is a genelaogy site - I use exclusions in robots.txt files and, so far, (touch wood here!) none of this site can be found using any of the search engines I have used or experimented with.

Whilt it is true I see some activity from search engines, it is usually only in the order of around 30-100k of data a month from each one (as a comparison, one other ite I don't restrict, currently shows Google accessing around 10Mb of data a month - it all starts to build up).

Other sites to consider not allowing are archive.org, sitedossier.com amongst others - thee ite archive (without permission) your site and it becomes increasingly frustrating when people find older versions of your site when you don't want them found!

lofty · 2013-03-18 21:55:50

Could I ask a real dumb newbie question?

Why would I want to block crawlers from Indexing my site?

--Lofty

Kris · 2013-03-18 22:55:50

Not all crawlers are benign.

Not all galleries (e.g. boudoir, wedding etc.) are public interest, even though held on a www server.

lofty · 2013-03-19 00:27:56

OK Kris,

What are the consequences of stopping the crawlers, presumably no one would know about your site unless you told them?

--Lofty

Matthew · 2013-03-19 02:06:26

Right. The intent is to prevent your site -- or sections thereof -- from being found via search engine. The opposite of SEO.

lofty · 2013-03-19 02:22:57

OK Thanks Guys, asked and answered..

serowe · 2013-03-19 05:55:06

It's not to just stop your site from being 'searched', it's also to prevent these crawlers from using massive amounts of bandwidth which they can, and often, do!

gwlco · 2013-03-27 23:04:31

I wanted to block the 'crawlers' from my site while in development since many things are changing almost on a daily basis. I will wait until my site is more stable.

Community @ The Turning Gate

#1 2013-03-12 04:45:42

block google et all from searching the site

#2 2013-03-12 05:23:11

Re: block google et all from searching the site

#3 2013-03-12 05:28:28

Re: block google et all from searching the site

#4 2013-03-12 05:31:56

Re: block google et all from searching the site

#5 2013-03-13 00:24:49

Re: block google et all from searching the site

#6 2013-03-13 00:35:30

Re: block google et all from searching the site

#7 2013-03-13 04:59:11

Re: block google et all from searching the site

#8 2013-03-18 16:49:25

Re: block google et all from searching the site

#9 2013-03-18 21:55:50

Re: block google et all from searching the site

#10 2013-03-18 22:55:50

Re: block google et all from searching the site

#11 2013-03-19 00:27:56

Re: block google et all from searching the site

#12 2013-03-19 02:06:26

Re: block google et all from searching the site

#13 2013-03-19 02:22:57

Re: block google et all from searching the site

#14 2013-03-19 05:55:06

Re: block google et all from searching the site

#15 2013-03-27 23:04:31

Re: block google et all from searching the site

Board footer