October 30, 2007
Just came form holidays. Well, obviously not. Sorry. Just needed to invent some excuse for not writing for such a long time. My holidays ended weeks ago but as it always happens you come back to work only to discover such an enormous backlog that it takes twice as long as my holidays lasted to sort it out. So, going through the pile of things amassed over time I came about this draft that I started sometime in midsummer. I double-checked the factual side of things and all seems perfectly actual even now. So, after a little polishing I decided to publish this post in a pretty much the same shape it would have appeared in August (better late than never).
Doing SEO myself for quite some time and having a good number of friends and acquaintances of the same trade, naturally I had plenty of opportunities to observe and analyse different SEO practices and eventually I came to a rather strange conclusion: it seems there are probably only two types of SEOs in whole world. All of us have to deal with the same set of circumstances, namely we have to feed something to Search Engines in order to get a desired output from them and we have not a slightest idea of what is happening in the process. Search engines are perfect black boxes for us as we know what comes in and we can see what comes out but we are not allowed to see inside the box. There may be two ways to approach this rather tricky situation. One type of SEOs is practicing an approach that is very much akin to ancient magic. They take their client, perform some strange divination rituals on them and utter their prophecies that are just bound to be fulfilled. It isn’t a big deal if they don’t – there are so many taboos in their recommendations that it’s simply impossible not to break at least one of them. And of course this sin of breaking taboo is the only reason the magic didn’t work, that is of course if it didn’t, because if it did it is only because it is what it is – magic. Of course I know some from this bunch whose magic really works no matter what the circumstances are, but these are rare exceptions. Remarkably, SEO magicians are almost exclusively found among the troops of so-called white-hat SEO. The other party that is quite distinct from the former one consists of those who are mostly fascinated by the workings of the engine itself. The greatest joy for them is to catch a glimpse of those gear wheels, triggers, levers and latches all in motion. The pleasure, I confess very much akin to that of a Peeping Tom watching naked Lady Godiva who incidentally happened to be a patron of all engineers. The grand idea is in this case is ultimately to perform some sort of a reverse engineering upon Search Engine mechanism thus achieving a thrilling sensation of superiority. But this could never be perfected to the end perhaps not only because these black boxes are so carefully guarded by their owners, but also because even for the developers of Search Engines they are now exactly this - perfect black boxes. Otherwise why Matt Cutts would be using the word heuristics to describe the working of the Google engine so many times?
So, the article here is probably naye good for divination purposes but might be of some interest to those among us who belong as I do to the voyeur type.
This whole thing was triggered by a strange example that was discovered quite by chance: once I’ve been checking Google output for a rather unusual search: art power. Don’t ask me why, as it is just my little hobby to check out weird searches. This search phrase is a strange fruit indeed as it belongs to a class of generic searches, those that are likely not to produce large volume of searches but naturally have extremely high competition. This one quite as expected returns 321,000,000 results (at the time of writing). And now allow me just a small side note on the way Google deals with generic searches: the results tend to have much larger than average proportion of pages coming from large media resources, often newspapers like NYT and so on and media portals like BBC etc (you got the idea). Hence, as I observed, if you are able to obtain a link from a resource of this kind it will directly boost the responsiveness of your site to generic searches. But now let’s come back to the subject. Somewhere within results 351 - 360 for this search you will find this strange page. Mind you it isn’t such a bad position considering the competition – we can insist that at least 320999640 pages (321,000,000 - 360) are less relevant in Google’s opinion that the page in question. By now I expect you couldn’t resist the temptation to click on the link above to examine the actual page. Now, what do you see? Correct, a perfectly blank white page, no content whatsoever. Shocking enough as such isn’t it? And now note an even more distressing detail: Google thinks there is the word art on this empty page! What the hell is going on you ask. Although you will not get a definitive answer even if you open cached text version although it will get you much closer. I hope you in no time realised that what we are dealing here with is a frameset. Remarkable on its own and even allows us to formulate our first conclusion: it seems likely that Google assigned the content of frame to the frameset URL. But we’ll pass this remarkable fact without much discussion here as it is only indirectly relates to our question. Suffice to say that the preference to the frameset instead of nested pages is most likely defined by the overall site structure rather than by the framed page construction. Just to remind ourselves that our main question is: how come such a page ranks so high in Google? The only way to learn this is to go to the page itself and examine what it is made of. First of all of course goes our indispensable virtual currency - the Google PR and the page itself has moderate PR 3. Remarkably the word art is present only three times in one tight paragraph somewhere towards the very end of the page plus these three letters can be met as parts of bigger words about a dozen of times. Surely, it’s not enough to achieve such a high position. And what about the second word, power, as apparently it is nowhere to be seen on the page itself? And as always View Page Source is at our disposal to help in our predicament. Now drum roll, please! What do you see? Shock! Horror! We see the old number sign link trick inadvertently used on this page about 30 times. For those who are not aware of this trick here is a brief story. Just to remind you, the number sign as a partial value of the href attribute of the a tag was initially introduced to be used together with the name attribute to provide an easy on-page navigation. Basically it creates a relative link to another part of the same document. Subsequently with introduction of JavaScript this became a tool to create a JavaScript OnClick event without actually sending the browser away from the page and this is exactly how it is used in our example. The SEO part of it mainly consisted of a superstition that circulated since times immemorial that stated that placing a number sign link would somehow magically increase the potency and power of a page. I must admit I used this trick indiscriminately so many times in my youth without even bothering to think what it actually does, just relying on its magic powers. When I started paying more attention to what I am actually doing it turned out to be not so difficult to figure out what is happening with a number sign link. Basically Search Engine for some obscure reason takes these links seriously and assumes they are perhaps as valid as any other internal link. The only difference in this case is that a page links to itself which supposedly creates some kind of a loop back in PR or other link ranking calculations and as a result it should double or at least amplify the importance of the page. Presumably the effect of an outside link pointing to a page with this kind of loop back should also be amplified compare to an ordinary page thus allowing to create powerful pages on purpose. And all this is just because the number sign link somehow temporarily short-circuits Search Engine algorithms. Not bad, isn’t it? But it is surely black hat, isn’t it? Well, as usually it depend on the definition. I assume there is no algorithmically doable way to distinguish between a normal use of a number sign in link and its SEO-related application. Although in theory it is a punishable offence, in practice the prosecution can not be enforced as it remains untraceable. So in effect Search Engines algorithms could not penalise its usage, but only compensate for its effect. Being such an old trick I thought it had been compensated for years ago by all main Search Engines. In any way in my recent memory only MSN had been observed to be still prone to this, but who takes MSN for something other than a joke in terms of their algorithms? And now I am proven to be wrong again. As you can see the mighty Google itself is swallowing the same old bait. What in the world is going on here? We’ve observed hoe this page might have received a preferential treatment but still have no answer to our main question: why it ranks for art power having no mention of power on the page itself? Now let’s take a closer look at those loop back links we discovered. Mon Dieu! (Pardon my French) Yet another shocking revelation awaits us here. What we see an OnClick JavaScript alert that simply throws up a popup message that has the word POWER twice in each text element. If we count all instances this word is repeated altogether 62 times on this page! If this was a part of the page text I would not have been surprised at all. It reminds me so much of a classical keyword stuffing for a two words search where the first word has a very low density whilst the second one is overstuffed. This combination had been known to work since times immemorial and is working even now inasmuch as keyword density still matters. The preference for this unbalanced pair type of pages for two word search phrases perhaps comes from the fact that pages with even density of both search words en even more so with both search words coming together in sequence get penalised for fear of keyword stuffing and those pages that look the least similar to suspicious pages get favoured. All in all it would have worked out perfectly if the text of this JavaScript alert was counted as a part of page text, but we all remember the textbook SEO axiom: Search Engine Robots don’t read JavaScript. Or do they? OK, what we really know with a high degree of probability is that they don’t execute JavaScript. And this is only based on our assumption that it is much too much in terms of computing resources to attempt to run each and every section of JavaScript code on the net. But again, it is only our assumption, albeit confirmed by similar declarations coming from Search Engine spokespeople, but do we always have to trust their statements? Naturally we can not assume on this basis that Search Engines don’t read JavaScript code and don’t use it as a part of their relevancy calculation algorithms. Particularly attractive this idea looks when applied to our case where JavaScript alert message can be seen under certain circumstances as part of the overall page text since human agent can read it after all. Admittedly there are some arguments to counter this hypothesis. Formally the JavaScript alert is not counted as a page content proper which can be easily confirmed by the absence of our page in this search. Surprisingly enough we don’t find our page even if we search in anchor like this.
Equally however we know that Google never tells us the whole truth. Google insists that the word power is present in links leading to our page but at the same time displays no results for link operator search for this URL. Clearly something is missing here. After all Google allows us to see only what it wants us to see. Hence I’ll stick to my guns and offer the only explanation I think is plausible for this example? However strange and unorthodox it might look: Google actually reads JavaScript and hence
- Google counts the word POWER in JavaScript alert towards the total in keyword density thus taking the page for the desired unbalanced pair type
- Google counts the word POWER in JavaScript as anchor text too (actually I have reasons to believe that virtually any text contained within a tag will be counted as an anchor text)
- Google is still prone to the number sign link loop back
This is so far the only possible explanation I can find for the phenomena we just observed but I am myself very far from being completely satisfied by it. The flaw in my analysis is apparent and should I mention that I made no effort to hide it? Having no better explanation myself I’ll be ready to accept the challenge and if you are ready to come up with an alternative I will only be happy to get engaged in a fruitful discussion. To aid those prepared to offer an alternative analysis of this example I may throw in another curious fact, namely the root of irational.org domain itself ranks rather high for our art power search too.
If the line of reasoning drawn above holds true, the consequences are quite intriguing. Not only one may extract some practical tricks from these features, but it also confirms that some statements coming from Google itself were at least premature, particularly that the possibility of effective Google Bowling is still alive and kicking.
For those who expected to discover some sort of magic potion in this article I may only say: I don’t sell snake oil here.
tags: godiva, google, javascript, lady, peeping, seo, tom, anchor text, art, holidays, linking, magic, onclick, power, trick, voyeur
Posted by LZZR under SEO Tricks, Google | Comments (3)
July 4, 2007
We are all hungry for inbound links these days and even more so since the introduction of nofollow which I personally hate as it did not solve a bit the problem of comment spam it was supposed to solve but had a detrimental effect on social networks and blogsphere in particular. It turned all of us bloggers into a particularly greedy bunch. Before placing a link we started to think of how much we would lose if it’s an open link without nofollow. Although my attempt to mock the whole nofollow concept by inventing a rel=follow attribute proved to be popular enough it nevertheless is too limited to change the picture in any significant way. The only solution to this seems to be generosity. But generosity itself isn’t enough in a world governed by the principle of self-interest. Attempts to persuade an average webmaster that outgoing links from his/her site are beneficial since eventually they will bring a hub status to the linking resource were not too convincing. It needed more than just empty promises of some benefits in the future. Linking should be mutually beneficial for both the linking site and the linked one and more these benefits must be obvious and immediate. For quite some time I’ve been thinking about a strategy that would realize these objectives and finally I think I came up with one.
Initially the idea was triggered by observing rather old-fashioned SEO attempts to pull out their own sites from Search Engine results using site: operator. I am sure you are all familiar with this dumb trick and if not just follow this link to see what it does. You see? This is an example of pure greed and selfishness. You pull out your own site and only your own site. Now let’s try a different search this time using not site: but link: operator (example). These are URLs of resources linking to lzzr.com. And what’s the point, you might ask? Surely if you are greedy it does not make any sense to promote someone else’s URL but if you are clever you’ll quickly realize that it’s in your own best interests to have inbound links from strong pages. Think about the amount of time and effort (and very often money) an average SEO spends on acquiring links from pages with good Page Rank. Now, the same goal can be achieved simply by making some effort promoting pages that already have natural links to you! They are already out there, just give them some link love. Of course this way you push up resources that are not owned or operated by you and hence you loose something but the gain which in my opinion outweighs the hypothetical loss you may suffer is that all of those pages link back to you!
From strictly SEO point of view you create a perfect 3-way linking structure where Search Engine Results Page (SERP) serves as an intermediary between you and pages linking to you. In other words you link to results page and results page links to a number of sites each having a link back to you. This 3-way linking is known to be much more effective than a usual reciprocal linking and is not punishable like the outdated link farm method. On a side note: I strongly insist that the structure I just described can not be considered search engine spamming as it makes a perfect sense from user’s point of view. Let me explain: I as a webmaster certainly hold a pretty high opinion about myself and my website. Equally I should value those who appreciated my website by placing a link to it. Thus promoting websites linking to you is just a simple sign of respect and appreciation. I also assume that sites linking to my content are relevant to this content hence linking to them enhances user experience!
However the potential of this structure is limited since search engines are rarely indexing their own pages (Google results page will not be indexed and processed by Google but Netscape results page for example will be seen in Google) and even if they did these result pages are still orphan i. e. have no internal links coming from within the search engine itself. A perfect solution would be to have a page like this to be a part of your own resource – an objective that was horribly difficult to achieve until very recently in purely technical terms.
Now here is where Web 2.0 comes to the rescue. Let’s check this example from Google BlogSearch (similarly you can pull your results from Technorati but Technorati is lame). True it only displays links from blogs but look at the bottom of the Google page – you see it can be pulled out as Feed! Now, do I need to continue? Almost every single Web 2.0 resource or service comes with a built-in ability to pull and display feeds. Now all you need to do to promote blogs linking to you is to set up such feeds at every possible place and watch how pages linking to you grow in reputation and PageRank pushing up your own resource on the way.
This is exactly what I do for my blog. The easiest way is to create Yahoo Pipes page combining Google and Technorati feeds. But the easiest doesn’t mean the best since Yahoo Pipes are notoriously slow and are usually badly indexed by Google. Surely I must have more SEO Tricks up my sleeve. I obviously keep and maintain LZZR accounts on all major social networking services to beef up both LZZR online presence and exposure which is a bon ton thing to do these days. At every opportunity I include a feed with links to sites linking to LZZR.com. Of course different services have different options and consequently different SEO efficiency. On major blog systems such as Blogger, MSN Live, Wordpress.com etc. you are only allowed to display a limited number of feed entries in your external feeds and as a result sites that are at the top of Google Blog Search and the most recent Technorati entries will be favoured. In addition these link blocks are set up as separate page areas different from content and links due to their small number will have a substantial link churn. Not so good but at least something. Squidoo is a better place as they allow including a substantially large number of feed entries and besides they are displayed within the main content area of the page. But the best opportunities can be realised at Tumblr and Jaiku blog services. These two are really godsend as each new feed entry in those systems is published as a separate blog posting thus permanently affixing each link. Another interesting option is to propagate you back links feed by setting up special groups at various services like Bumpzee or the same Squidoo.
Note: I purposly avoided placing such link feed blocks on lzzr.com itself to prevent Link Farm structure.
So here is the deal: you link to any page at LZZR.com and get a bunch of backlinks to your site from places I manage and be sure do my best to promote. Your site must be a blog and it should be already indexed by Technorati and Google. I suggest you write a blog post about this very idea and link back to this very article rather than to the root. This will be more beneficial to you as the article itself already contains a bunch of outgoing links to places where I set up link feeds. However if you don’t want to link direct to the article any link to any page within lzzr.com will do including a root link. I became so excited about this idea that I even designed some buttons for the occasion. I think they visually represent the idea of LZZR linking where a LZZR link reflects from an intermediary.


Feel free to copy them and don’t be afraid of hotlinking as hotlinking is good!
Now if you are still thinking that this whole idea is just a shameless self-promotion – go back to the beginning and read again. I am sure you’ll find a rational core in all this. There is a good deal of self-interest in all this of course. I get inbound links from you. But at the same time you get a number of links pointing to you in return and this number is going to grow for as long as I’ll be finding new places to insert my link feed into. I will also be promoting pages with back links by all means available as this is in my own best interest. Thus there comes a perfect win-win combination!
PS:
Of course the temptation to copyright this idea and call it LZZR linkingTM is huge but in the spirit of true generosity I will refrain from doing so. You are free to replicate this structure for your site as this would be a really effective way to fight the nofollow greed. The only thing I would object to is if you decide to pass this idea as yours without proper attribution in a form of a backlink to this article :-). And should it be mentioned that linking back is in your best interests anyway… It’s good to link LZZR.com!
tags: 3 way linking, feed, google, lzzr, lzzrseo, pr, page rank, pagerank, rss, seo, seo trick, back link, backlink, bon ton, generosity, link churn, link farm, link love, web 2.0, win-win
Posted by LZZR under eSEO, SEO Tricks, Search Engines, Blog | Comments (27)
June 19, 2007
As you might have noticed from my post about EXIF I am certainly fascinated by the opportunity to squeese out a bit more of SEO not only from the traditional textual content but also from visuals. Every little bit counts and if you can enhance visibility of your site by just another grain, however small it might be it’s nevertheless worth it.
Ever since images appeared on the web webmasters tended to see it as a problem, rather than an advantage for one simple reason - unlike predominantly textual pages which are rarely over 100K images used to be relatively heavy. Not only they slowed down pageloads but also consumed plenty of bandwidth. And this approach seemed to be reasonable enough at the time since even relatively small images could weight several times more whilst large images dozens of times more than the page itself. And you remember the webmaster’s nightmare of not so distant past called hotlinking? Weren’t you just going berserk when some 12 y. o. idiot puts a hotlink to your image from somewhere in Xanga and it starts leeching like crazy eating up megabytes of your daily traffic allowance? Every single trick from referrer check to MOD rewrite we used to fight leechers, it used to be the hottest subject for all webmaster’s forums just a couple of years ago. SEOs too hated images considering them to be useless since search engines can not read images anyways. This negative attitude outlived by far the reasons causing it. Even at the wake of introduction of image search by all major search engines SEOs constantly deranged image search traffic as being not relevant and hurting bandwidth. Some even suggested SEO tricks designed to hide images from Image Search spiders. I am afraid that this is still the predominant attitude in SEO community I am myself guilty of sharing it for quite some time.
Only the recent experience with some of the sites I manage convinced me that this attitude is in most part completely wrong. Let’s look at the most widespread negative arguments and see how relevant they are indeed being applied for sites you own or manage.
1) Images consume too much bandwidth - Dismissed as being completely outdated. Coming from the stone age of the Internet it’s a complete anachronism in our age of Youtube and podcasting. You can’t save a penny economizing on image bandwidth.
2) Hotlinking is bad as another site eats up your bandwidth - Dismissed as greedy and unsociable. Can’t you just allow someone else to use something that belongs to you but costs you nothing? (also see above)
3) Hotlinking is bad because it constitutes copyright infringement - Plausible, but only in part. Most copyright complaints about image theft are completely groundless as most of those complaining just don’t understand that merely resizing an image in Photoshop or other image editor does not produce any copy rights. As for those really holding copy rights over leeched images using proper complaint procedure (see DMCA for details) is the most straightforward and by far the most effective way of dealing with this problem.
4) Image Search Traffic is useless since those searching for images are not interested in products and hence are not likely to become your customers - Plausible but only if you own or manage an online shop. Now, when the Internet is turning from some kind of gigantic department store to a wide arena for social interaction this argument becomes less and less relevant. If your site is based on image content you are more than anybody interested in attracting Image Search traffic since if someone came to your site after seeing one image it’s likely that he or she will enjoy other images from your site too, thus joining your fan base which is a clear blessing. Of course if you are managing a shopping site or most of your monetization is per sale or per action this blessing turns into a curse as logically the type of user coming from Image Search is most likely doing a window shopping rather than the real one. But it flips back to blessing again if you base your strategy on per click monetization like AdSense for example. It’s true that an Image Search surfer is much less focused and less likely to buy online but precisely for this lack of focus this surfer is more likely to click on ads thus increasing your site CTR and overall conversion. I’ve seen it myself - Image Search visitors are a blessing for per click conversion rates. Shall I say that Image Search enhances your site visibility too?
I don’t know if my arguments sound convincing to you, but as far as I am concerned image search traffic is as good as any other kind of traffic if not better in some respects. So now I am going to share some ideas on how to make your site more visible for image search. If you don’t believe your site can benefit from image search traffic, read on anyway, just don’t follow the recommendations I am about to list and do the opposite to this small checklist.
So here is how to make your resource Image Search friendly:
1) The basics available in any SEO textbook for dummies. Give your image a meaningful file name that contains your desired keywords. Put your keywords in alt (which you should do anyway to validate your code accordingly) and more importantly into your image title. Surround your image with keyword-rich textual content.
2) Don’t be afraid of hotlinking! Let every adolescent boy and girl from Facebook or MySpace hotlink to your images. This will do for images the same as normal linking does for text! Don’t worry that these not so SEO-literate young individuals will not put appropriate title or alt while hotlinking: if your image name is filled with keywords anyway this will give Image Robots enough clue. If understandably you don’t want to leave this important issue at the mercy of ruthless teenagers - hotlink yourself! Put it in relevant textual content with keywords in alt and title. The advantage is obvious - it looks inconspicuous, i. e. it does not look like link spam! Spread your hotlinks as wide as possible and be prolific - quick while you can, while hotlinks are not yet as heavily policed as text links with nofollow.
3) You can further enhance searchability of your images by placing a thumbnail that links to your full-size image (NB! not to a page with a full-size image but to the image itself).
4) As a last remedy I may suggest opting in for Google Enhanced image search from your webmaster console - theoretically it should act as a trigger for Google’s Image Search but the only problem I see is that it is not controlled by you.
Although with a benefit of hindsight I can see that my ideas about EXIF are a bit silly but I certainly know about one service that does read EXIF for sure and it is (you guessed it) the beloved Flickr by Yahoo! But this is altogether another story and I promise to write about SEO on Flickr soon.
tags: adsense, dmca, exif, facebook, flickr, myspace, seo, seo on flickr, berserk, copyright, dummies, hot, hot link, hotlinking, image, image search, image theft, keywords, leeching, link, mod rewrite, monetization, nofollow, per action, per click, per sale, searchability, spam, text, textbook, traffic, webmaster, xanga
Posted by LZZR under Search Engines | Comments (3)
May 30, 2007
Internet censorship seems to be a rather far-fetched issue until it actually affects you directly and personally.
I admit I do things that make a lot of people angry and this time it seems Yahoo! Inc itself decided to get angry with me! I perfectly well know why as my post on Yahoo handcrafting describing exactly how Yahoo! employees fiddle with your seach results clearly annoyed some of them as it was their own stupidity that allowed this information to leak out to the publc domain.
Now they are making some effort attempting to stop it from spreading - all feeds from my website lzzr.com had been blocked in Yahoo! Pipes and effectively censored out. No more Yahoo Pipes for LZZR.com!
Of course they are in their own right as it is their service and they are to decide which feeds to allow to Yahoo! Pipes and which to block out. I am not in a position to deny them this right. Neither I am in a position to deny them their right to act stupid. It’s been prooven too many times that repressive measures like censoring information resources you don’t like always results in an opposite effect! It gets spread by other channels and tends to snowball at each revolution.
Hopefully this is not the official policy of Yahoo! Inc as I don’t believe their top management is stupid enough to ruin their new project by introducing selective censorship. Most likely it is a personal revenge of a certain Yahoo! employee whose actions resulted in leakage of sensitive information. I have reasons to believe this person had to go through some hard times as a result of this blunder. Now this individual seems to be making another mistake.
On the other hand I may be wrong and somewhere at the top level of Yahoo! there is a clear policy of cutting out websits publishing information that is not exactly favoured by Yahoo. Not surprising and equally stupid considering the fact that Yahoo Inc attitudes towards censorship once already resulted in an enormous PR disaster when in 2005 Yahoo collaborated with Communist China helping Chinese authorities to imprison a journalist for 10 years. Shi Tao, a reporter who worked for the Contemporary Business News nwespaper in China was found guilty of sending foreign-based websites the text of an internal Communist Party message. At this time Yahoo! was accused by Western media of providing Chinese investigating organs with information that helped link Shi Tao’s personal e-mail account and the text of the message to his computer.
Allow me a quote from BBC News website:
“We already knew that Yahoo! collaborates enthusiastically with the Chinese regime in questions of censorship, and now we know it is a Chinese police informant as well,” Reporters Without Borders said in a statement.
So, it may well be that Yahoo! Inc simply likes censorship policies since Yahoo! record on sensorship looks rather consistent - from acting as a Chinese police informant to censoring their owns search results and individual websites.
tags: shi tao, yahoo, yahoo pipes, blunder, censorship, internet, policy, scandal, search results
Posted by LZZR under SEO Ethics, Yahoo | Comments (5)
May 23, 2007
Google AdSense continues to surprise and since my last post on this subject I happened to bump into yet another Ad sense thing that bothers me so much that I just can’t help sharing it with you.
If you are like me do most of your Pay Per Click monetization via Google AdSense you just have to be watching what it does very closely and attentively as good part of your income depends on it. And if you do, you might have noticed as I did some time ago that from time to time AdSense statistics spit out some funny figures. Almost every other day I see some funny irregularities like a channel that had no clicks but some impressions suddenly shows greater than zero eCPM. I am sure I am not the only one who had noticed this strange irregularity. For a long time I tended to write this off thinking that big numbers maths is something way beyond my understanding and all this would have been forgotten if there was not another more intriguing figure that tended to pop up less often but still often enough for me to take a note of. Occasionally in the column directly adjusted to the eCPM one and inconspicuously called Earnings I could see something like US$0.01. Nothing unusual, you say. Well, it would have been so if a corresponding value of a column titled Clicks was not displaying a big round zero. Needless to say that Page CTR cell also quite logically had zero percent in it.
Now, here is the question: how come I earned a cent from a Pay Per Click Network without having any clicks? Must be a miracle - IN GOD WE TRUST - indeed…
Can’t grasp the logic of all this. Setting aside the possibility if divine intervention as being highly unlikely in my case, let’s look for possible materialistic explanations. No big number theory can explain this phenomehae as from the course of elementary maths I remember that whatever you do with zero either you divide or multiply by zero you should always get zero. So, how was I given this one cent out of nothing as The Universal Laws of Science and The Law of Creation and Preservation of Matter and Energy in particular tell us that one cent coins don’t come from nothing under normal circumstances (and even less so do dollar bills). My most immediate thought was that Google rewards not only clicks but impressions as well (hence the title of this post). However strange it might seem if you think about it in fact it is only logical to reward sites performing well in terms of impressions but having users dumb enough not to click like crazy on those highly attractive and clickable AdSense blocks. Now, when temptation to look for AdSense alternative seems to be on the increase it does make sense to add a bit of per impression to the traditional per click concept to stimulate publishers thus saving your Ad network from collapsing. The only problem here is how you do it? The easiest way seems to introduce some bonus per view coefficient. But here another problem arises: where do you get funds to pay per view premiums if you are a per click network? Well, the answer is not so difficult to guess– of course from the part advertisers are paying into your ad network as per click rates (it would have been plain stupid to deduct it from your own profits, wouldn’t it?). But now you are facing another difficult task – how to calculate this whole thing as simple arithmetic i. e. 100% per click fee split between a publisher and a network at a known proportion will be of no help. For this the notion of heuristics widely popularized by GoogleGuy AKA Matt Cutts comes to the rescue.
Here is how Wikipedia defines Heuristics in Computer Science (I really like this one
):
In computer science, a heuristic is a technique designed to solve a problem that ignores whether the solution can be proven to be correct, but which usually produces a good solution or solves a simpler problem that contains or intersects with the solution of the more complex problem.
Heuristics are intended to gain computational performance or conceptual simplicity, potentially at the cost of accuracy or precision.
Not only this method is something of a black box - we know what comes in and we see what comes out and how it does this we don’t really know and don’t care (see Black Box schematics diagram showing how BlackBox works for details). It also outputs results that are not quite accurate and not exactly precise. A method pretty much the same as Throw Shit At a Wall technique in SEO by Dax. Now, I think we’ve got an explanation!
I can not be sure that Google AdSense does heuristics for the purpose of rewarding per view sites (or should I say only for this purpose?) but I am pretty sure it placed a big black box ful