Saturday 18 May 2013

Google Crawling and indexed methods in website

=============   Popularity & Importance   =============

Now - if you actually bothered reading the linked to topics ... you should have seen this comming.

Google only has a certain amount of resources.
It has to allocate those carefully.
It will rate sites, and treat them differently, depending on how it rates them.
It seems to rate them on Popularity/Importance.
Another factor is likely to be Size of the site (number of pages, size of the pages).

This means that if your site is not popular with other sites,
if your page is not well linked to from other pages on your site,
if your page is not linked to from other popular pages on your site,
... then it will not be processed as fast as sites that are more popular, have better internal link structure etc.

And I did mention Size, yes?
Well - the major player here is that larger sites tend to have "deeper" pages (longer URLs, multiple Directories ... and less internal links from Prominent/Important/Popular pages).

There's also the aspect that Popularity only stretches so far.
Your Homepage may havea PR of 5 .... but does that really reach down to http://www.example.com/something/somethingelse/anotherthing/thisheresection/somepage.html ?

Then there may even be the aspect of "Change Rate" ... how often you have been adding/updating your site previously ... G may pre-allocate based on past behaviour?


Now - bearing all that in mind ... the simplest way to look at it is a "tier" system, or levels.
You have X number of pages, with X popularity, and X importance = Y Crawling and Z Indexing times.

Thus;
 - if you are not that popular, you get crawled less often, less pages are crawled and it takes longer to get what is Crawled appearing in the SERPs.
 - if you suddenly end up with a larger amount of content, but your popularity has not gone up at the same sort of rate, then the chances are that you will not be crawled as often/fast as you were, nor indexed as fast.

And - as it seems so many people seem to struggle with this concept ....
(NOTE: The figures/values used are completely made up!)

------------+-------------------+-------------------------|-------|--------------------|-------------------|----------------
   Size   |   Popularity   |   Link Structure  |   =  |   Frequenct   |   Quantity   |   Indexed
------------+-------------------+-------------------------|-------|--------------------|-------------------|----------------
  Small  |       Low        |         Weak        |   =  |    14 days    |         3         |    8 days
------------+-------------------+-------------------------|-------|--------------------|-------------------|----------------
  Small  |       Low        |      Moderate      |   =  |    14 days    |         3         |    7 days
------------+-------------------+-------------------------|-------|--------------------|-------------------|----------------
  Small  |   Moderate    |         Weak        |   =  |    10 days     |         4         |    7 days
------------+-------------------+-------------------------|-------|--------------------|-------------------|----------------
  Small  |   Moderate    |      Moderate      |   =  |      7 days    |         4         |    6 days
------------+-------------------+-------------------------|-------|--------------------|-------------------|----------------
  Small  |       High       |      Moderate      |   =  |      5 days    |         5         |    4 days
------------+-------------------+-------------------------|-------|--------------------|-------------------|----------------
Medium |      High        |      Moderate      |   =  |      7 days    |         4         |    5 days
------------+-------------------+-------------------------|-------|--------------------|-------------------|----------------

Okay - does that help you see what I'm on about?
(NOTE: Again - just to make flamming sure - the figures/values are complete gibberish!!!)
(NOTE: I have Not included Change Frequency, nor accounted for Sitemaps, nor Depth!)

And - here's the reall killer part about the Crawling ....
Google doesn't only crawl New things does it ... it also recrawls old things!
So though you may get 8 pages crawled - how many are likely to be "new" ones?

1 comment: