|
Improve Google Page Rank
Is Page Rank Calculation still following the
“BackRub” Algorithm? An examination of the
PR algorithm, with examples from real world sites.
Many a time and oft, we hear the speculation
that the Page Rank Calculation has changed. Disappointingly,
it is voiced by people who should know better. Perhaps
the green monster of jealousy at seeing a spammy (competitor)
site or a “google bomb” taking a site to
PR-7 gets the better of their judgement. We will proceed
to shatter a few urban legends.
- Home page is necessarily a higher PR than the rest
of the site.
- Older pages get highest PR.
- Google considers “on page” factors or
contents to judge PR.
- Pages with most external link backs get the highest
PR.
To justify our assertions more rigorously, let’s
first quickly introduce the Page Rank Calculation.
Page rank measures the importance of a page using the
Google
page rank formula:
PR(A) = (1-d) + d (PR(T1)/C(T1) + ... +
PR(Tn)/C(Tn))
The page rank shown on a tool bar is actually a scaled
number between 0-10. The scaling is logarithmic to accommodate
the widely different number of IBL’s, where a
linear scale would not convey the information appropriately.
While we are not privy to the log factor “lf”
that Google uses for it’s Page Rank tool bar normalization,
we take a stab at the number by making a few assumptions.
We assume that Google is the highest ranking PR10 page.
We use the number of links incoming to Google reported
by Google itself. On March 29, 2006 it shows it to be
3,750,000 links. We further assume that the average
Page rank of the links pointing to the Google home page
is PR 1.
Therefore, to obtain the upper bound of the log factor(“lf”),
we just take the appropriate root. (e.g. ((Incoming
Links to Google)^(1/(Google PR-Average_Incoming_PR))
) ).
This boils down to (3.75*10^6)^(1/(9)) or lf=5.38.
Naturally, as the size of the web increases, or the
number of back links that Google exports to outside
world increases, “lf” is likely to increase.
Our tool allows you to set a different “lf”.
We use the following formula to scale numbers to the
“RPR”(real non normalized PR) and obtain
the lower and upper bounds.
((lf)^(n)) =< RPRn < ((lf)^(n+1))
As an example when n=5, 4507 < PR-5 < 24,248
Naturally, some pages are a very strong PR-5(i.e. almost
a PR-6), or a very weak PR-6. Therefore, we allow a
user to set the strength of the page. The default is
0.5(i.e. it’s the equivalent of a PR5.5 page).Now,
if the above holds, then “1”, “2”
or “3” are irrelevant. Our contention is
that Google has not changed the Page Rank calculation
in any significant way, aside from tweaking the constants.The
proof is obviously in the pudding. Let’s take
three sample sites and see how the update of April 6,
2006 has affected them.
We turn to our Event Tickets Website and examine two
pages on the site:
- Tickets
Website's Home Page , PR-4
- New
Tickets Events Page , PR-6
The comparison is indeed telling. A page that was added
a few weeks ago attained the highest PR in the site.
While the home page with most of the back links stayed
at 4. The reason is quite obvious if you examine the
actual pages. The home page is donating it’s PR
to a lot of pages, while the new events page has only
3 outgoing links(in addition to the common links). This
behavior is predictable, if we apply the “backrub”
algorithm.
Happenstance, you say? Let’s look at another
site, with less contents and a different “offer”/e-commerce
pricing model. The common feature is that the navigation
structure is exactly the same.
- Tickets
Website's Home page , PR 3
- New
Tickets Event Page , PR-5
Coincidence, you say? Let’s look at a third site
- Click
Fraud Website's Home Page —PR-4
- Click
Fraud Website's Internal Page —internal
pages like click fraud with no link to Zaralyzer but
lots of links from external sites, PR-5
In conclusion, even the most trivial changes in site
navigation structure can cause major changes in how
the search engine view your pages. For a free
Google Page Rank Improvement Consultation
on how you can improve your site structure, please contact
Sofizar.net
|