| Google is tough to trick. Bu what if you | | | | problem can be avoided by including an index |
| wanted to trick it? Well, even if you could | | | | page that provides direct links to pages that |
| trick it, this would probably be unwise, | | | | have convoluted navigation paths. |
| because your site could end up in the "Trash | | | | |
| Pile" list. This largely frees webmasters | | | | Pages may also be left out if they are not |
| from spending time on tricks and page hacks | | | | available when search crawlers visit. Many |
| designed to get higher rankings. | | | | sites do periodic maintenance at off hours, |
| | | | or cycle servers during the night. If your |
| There are several reasons your pages may not | | | | site is not fully redundant, it may be |
| show up,including the way they are generated, | | | | unavailable at these times, which can result |
| their availability, and content. | | | | in your site being left out of the index. |
| | | | This will normally be corrected the next time |
| According to information at Google's site, | | | | the crawler visits your site, but should be |
| they limit the number of dynamically | | | | avoided because it makes it difficult for |
| generated pages that they index at any site. | | | | users to find your site. |
| Site crawlers can generate a tremendous | | | | |
| amount of site traffic in a short amount of | | | | In rare cases, page content can cause your |
| time. Google limits the number of dynamic | | | | pages to be left out. If Google identifies |
| pages it requests to avoid monopolizing a | | | | elements on your pages that it thinks are |
| server's resources. | | | | inappropriate for general audiences, the |
| | | | pages will get filtered out. The content will |
| Frames can also cause problems. This is | | | | be indexed, but may not show up in search |
| because it's difficult for an automated | | | | results, depending on the searcher's |
| system to know what should be indexed: the | | | | SafeSearch settings. |
| frame, the individual panes, the entire page | | | | |
| or the alternate content. If your site uses | | | | Pages are also filtered out if they do not |
| frames, it's very important that your frames | | | | meet Google's "quality standards". Google |
| include "noframes" content. This is alternate | | | | doesn't detail what its quality standards |
| content for browsers that can't handle | | | | are, but things like page cloaking, invisible |
| frames. It is also used by search engines. | | | | text, or other techniques designed to fool |
| | | | search engines can cause pages to be |
| If your site doesn't provide an internal path | | | | excluded. Pages are also occasionally |
| to all of your pages, search robots may not | | | | excluded for copyright reasons, or because |
| find them. These pages may also not be | | | | the page content is inflammatory. |
| indexed if they are only linked to by pages | | | | Fortunately, it's rare for any of these |
| that are not in the index for other reasons, | | | | problems to affect typical sites. |
| such as being dynamically generated. This | | | | |