Excellent SEO - Guide of SEO, Common SEO Mistakes, SEO Process, Automatic SEO, Death to SEO

Earn $1 per each click

Google
 

Thursday 13 December 2007

Common SEO Mistakes

Graphic Header

Very often sites are designed with a graphic header. Often, we see an image of the company logo occupying the full-page width. Do not do it! The upper part of a page is a very valuable place where you should insert your most important keywords for best seo. In case of a graphic image, that prime position is wasted since search engines can not make use of images. Sometimes you may come across completely absurd situations: the header contains text information, but to make its appearance more attractive, it is created in the form of an image. The text in it cannot be indexed by search engines and so it will not contribute toward the page rank. If you must present a logo, the best way is to use a hybrid approach – place the graphic logo at the top of each page and size it so that it does not occupy its entire width. Use a text header to make up the rest of the width.

Graphic Navigation Menu

The situation is similar to the previous one – internal links on your site should contain keywords, which will give an additional advantage in seo ranking. If your navigation menu consists of graphic elements to make it more attractive, search engines will not be able to index the text of its links. If it is not possible to avoid using a graphic menu, at least remember to specify correct ALT attributes for all images.

Script Navigation

Sometimes scripts are used for site navigation. As an seo worker, you should understand that search engines cannot read or execute scripts. Thus, a link specified with the help of a script will not be available to the search engine, the search robot will not follow it and so parts of your site will not be indexed. If you use site navigation scripts then you must provide regular HTML duplicates to make them visible to everyone – your human visitors and the search robots.

Session Identifier

Some sites use session identifiers. This means that each visitor gets a unique parameter (&session_id=) when he or she arrives at the site. This ID is added to the address of each page visited on the site. Session IDs help site owners to collect useful statistics, including information about visitors' behavior. However, from the point of view of a search robot, a page with a new address is a brand new page. This means that, each time the search robot comes to such a site, it will get a new session identifier and will consider the pages as new ones whenever it visits them.

Search engines do have algorithms for consolidating mirrors and pages with the same content. Sites with session IDs should, therefore, be recognized and indexed correctly. However, it is difficult to index such sites and sometimes they may be indexed incorrectly, which has an adverse effect on seo page ranking. If you are interested in seo for your site, I recommend that you avoid session identifiers if possible.

Redirects

Redirects make site analysis more difficult for search robots, with resulting adverse effects on seo. Do not use redirects unless there is a clear reason for doing so.

Hidden Text, A Deceptive Seo Method

The last two issues are not really mistakes but deliberate attempts to deceive search engines using illicit seo methods. Hidden text (when the text color coincides with the background color, for example) allows site owners to cram a page with their desired keywords without affecting page logic or visual layout. Such text is invisible to human visitors but will be seen by search robots. The use of such deceptive optimization methods may result in banning of the site. It could be excluded from the index (database) of the search engine.

One-Pixel Links, Seo Deception

This is another deceptive seo technique. Search engines consider the use of tiny, almost invisible, graphic image links just one pixel wide and high as an attempt at deception, which may lead to a site ban.

External Ranking Factors

Why Inbound Links To Sites Are Taken Into Account

As you can see from the previous section, many factors influencing the ranking process are under the control of webmasters. If these were the only factors then it would be impossible for search engines to distinguish between a genuine high-quality document and a page created specifically to achieve high search ranking but containing no useful information. For this reason, an analysis of inbound links to the page being evaluated is one of the key factors in page ranking. This is the only factor that is not controlled by the site owner.

It makes sense to assume that interesting sites will have more inbound links. This is because owners of other sites on the Internet will tend to have published links to a site if they think it is a worthwhile resource. The search engine will use this inbound link criterion in its evaluation of document significance.

Therefore, two main factors influence how pages are stored by the search engine and sorted for display in search results:

* Relevance, as described in the previous section on internal ranking factors.

* Number and quality of inbound links, also known as link citation, link popularity or citation index. This will be described in the next section.

Link Importance (Citation Index, Link Popularity)

You can easily see that simply counting the number of inbound links does not give us enough information to evaluate a site. It is obvious that a link from www.microsoft.com should mean much more than a link from some homepage like www.hostingcompany.com/~myhomepage.html. You have to take into account link importance as well as number of links.

Search engines use the notion of citation index to evaluate the number and quality of inbound links to a site. Citation index is a numeric estimate of the popularity of a resource expressed as an absolute value representing page importance. Each search engine uses its own algorithms to estimate a page citation index. As a rule, these values are not published.

As well as the absolute citation index value, a scaled citation index is sometimes used. This relative value indicates the popularity of a page relative to the popularity of other pages on the Internet. You will find a detailed description of citation indexes and the algorithms used for their estimation in the next sections.

Link Text (Anchor Text)

The link text of any inbound site link is vitally important in search result ranking. The anchor (or link) text is the text between the HTML tags «A» and «/A» and is displayed as the text that you click in a browser to go to a new page. If the link text contains appropriate keywords, the search engine regards it as an additional and highly significant recommendation that the site actually contains valuable information relevant to the search query.

Relevance Of Referring Pages

As well as link text, search engines also take into account the overall information content of each referring page.

Example: Suppose we are using seo to promote a car sales resource. In this case a link from a site about car repairs will have much more importance that a similar link from a site about gardening. The first link is published on a resource having a similar topic so it will be more important for search engines.

Google Pagerank – Theoretical Basics

The Google company was the first company to patent the system of taking into account inbound links. The algorithm was named PageRank. In this section, we will describe this algorithm and how it can influence search result ranking.

page rank is estimated separately for each web page and is determined by the page rank (citation) of other pages referring to it. It is a kind of “virtuous circle.” The main task is to find the criterion that determines page importance. In the case of page rank, it is the possible frequency of visits to a page.

I shall now describe how user’s behavior when following links to surf the network is modeled. It is assumed that the user starts viewing sites from some random page. Then he or she follows links to other web resources. There is always a possibility that the user may leave a site without following any outbound link and start viewing documents from a random page. The page rank algorithm estimates the probability of this event as 0.15 at each step. The probability that our user continues surfing by following one of the links available on the current page is therefore 0.85, assuming that all links are equal in this case. If he or she continues surfing indefinitely, popular pages will be visited many more times than the less popular pages.

The page rank of a specified web page is thus defined as the probability that a user may visit the web page. It follows that, the sum of probabilities for all existing web pages is exactly one because the user is assumed to be visiting at least one Internet page at any given moment.

Since it is not always convenient to work with these probabilities the page rank can be mathematically transformed into a more easily understood number for viewing. For instance, we are used to seeing a page rank number between zero and ten on the Google Toolbar.

According To The Ranking Model Described Above:

* Each page on the Net (even if there are no inbound links to it) initially has a page rank greater than zero, although it will be very small. There is a tiny chance that a user may accidentally navigate to it.

* Each page that has outbound links distributes part of its page rank to the referenced page. The page rank contributed to these linked-to pages is inversely proportional to the total number of links on the linked-from page – the more links it has, the lower the page rank allocated to each linked-to page.

* page rank A “damping factor” is applied to this process so that the total distributed page rank is reduced by 15%. This is equivalent to the probability, described above, that the user will not visit any of the linked-to pages but will navigate to an unrelated website.

Let us now see how this page rank process might influence the process of ranking search results. We say “might” because the pure page rank algorithm just described has not been used in the Google algorithm for quite a while now. We will discuss a more current and sophisticated version shortly. There is nothing difficult about the page rank influence – after the search engine finds a number of relevant documents (using internal text criteria), they can be sorted according to the page rank since it would be logical to suppose that a document having a larger number of high-quality inbound links contains the most valuable information.

Thus, the page rank algorithm "pushes up" those documents that are most popular outside the search engine as well.

Google Page rank – Practical Use

Currently, page rank is not used directly in the Google algorithm. This is to be expected since pure page rank characterizes only the number and the quality of inbound links to a site, but it completely ignores the text of links and the information content of referring pages. These factors are important in page ranking and they are taken into account in later versions of the algorithm. It is thought that the current Google ranking algorithm ranks pages according to thematic page rank. In other words, it emphasizes the importance of links from pages with content related by similar topics or themes. The exact details of this algorithm are known only to Google developers.

You can determine the page rank value for any web page with the help of the Google tool bar that shows a page rank value within the range from 0 to 10. It should be noted that the Google tool bar does not show the exact page rank probability value, but the page rank range a particular site is in. Each range (from 0 to 10) is defined according to a logarithmic scale.

Here is an example: each page has a real page rank value known only to Google. To derive a displayed page rank range for their tool bar, they use a logarithmic scale as shown in this table
Real PR Tool bar PR
1-10 1
10-100 2
100-1000 3
1000-10.000 4

Etc.

This shows that the page rank ranges displayed on the Google tool bar are not all equal. It is easy, for example, to increase page rank from one to two, while it is much more difficult to increase it from six to seven.

In practice, page rank is mainly used for two purposes:

1. Quick check of the sites popularity. page rank does not give exact information about referring pages, but it allows you to quickly and easily get a feel for the sites popularity level and to follow trends that may result from your seo work. You can use the following “Rule of thumb” measures for English language sites: PR 4-5 is typical for most sites with average popularity. PR 6 indicates a very popular site while PR 7 is almost unreachable for a regular webmaster. You should congratulate yourself if you manage to achieve it. PR 8, 9, 10 can only be achieved by the sites of large companies such as Microsoft, Google, etc. PageRank is also useful when exchanging links and in similar situations. You can compare the quality of the pages offered in the exchange with pages from your own site to decide if the exchange should be accepted.

2. Evaluation of the competitiveness level for a search query is a vital part of seo work. Although PageRank is not used directly in the ranking algorithms, it allows you to indirectly evaluate relative site competitiveness for a particular query. For example, if the search engine displays sites with PageRank 6-7 in the top search results, a site with PageRank 4 is not likely to get to the top of the results list using the same search query.

It is important to recognize that the PageRank values displayed on the Google ToolBar are recalculated only occasionally (every few months) so the Google ToolBar displays somewhat outdated information. This means that the Google search engine tracks changes in inbound links much faster than these changes are reflected on the Google ToolBar.

AltaVista

In a time where Google is pre-eminent in the search business it is difficult to conceive how it was otherwise. Once upon a time AltaVista was the 800 lbs gorilla in the search engine jungle. AltaVista was originally conceived to showcase Digital Equipment Corporation’s technology. In the spring of 1995 DEC launched the Alpha 8400, a high performance database server. The AltaVista spider first started indexing the Web on the 4th of July, 1995. A team lead by Dr Louis Monier unveiled AltaVista on the 15th of December 1995. This search engine used several hundred robots running in parallel to index a much larger portion of the Web than predecessors. The system was also fast. Monier’s team had growth in mind. The system could be expanded to cope with increasing popularity. More than 300,000 people used the AltaVista on the first day and within 12 months the system was handling 19 million requests per day. For the unsophisticated Web of the mid-1990s it also delivered pretty good results based largely on on-page factors, especially for those searchers who mastered the advanced query interface. AltaVista added more services, in particular Babelfish. Named after a creature in the book The Hitchhiker’s Guide to the Galaxy, the Babelfish could automatically translate Web pages into a myriad of languages.

AltaVista’s subsequent decline, caused by a mixture of ambition and hubris, should serve as a lesson for anyone who bases their business around the results delivered by a single search engine.

At the start of 1998 Compaq, who’d grown from a maker of luggable IBM PC clones to the world’s largest personal computer manufacturer, swallowed the once mighty DEC. It was the 2nd wave of the dot.com boom. Compaq spun out AltaVista with the idea of an Initial Public Offering (IPO). Other search engines such as Yahoo! and Excite had already gone down the same road and had brought their founders and investors vast wealth. However the window of IPO opportunity was fast closing.
By 1999 Search Engines were viewed as being passé. Portals were all the rage. A portal would act as a focus for a surfer’s activity on the Web and would provide the owner multiple channels to market products. AltaVista recast itself as a portal and even started to offer Internet access. In the United Kindom it went as far as to announce unmetered access. This at a time when AOL, the biggest online provider, charged by the hour. Unfortunately for AltaVista the telecommunications market wasn’t ready. The botched announcement cost the UK boss, Andy Mitchell his job and damaged AltaVista’s reputation.

The move to a portal also detracted from the core search business. Users had to cut through the cruft to get to search then found the results cluttered with sponsored links. It then emerged that, with the notable exception of paid inclusion, the index hadn’t been updated in months. By the end of 1999 MSN Search dropped AltaVista as its provider. With stale content and untrustworthy results users began to desert in droves to the simple, search focussed interface of new kid: Google. In February 2003 Overture acquired AltaVista for $140 million, a fraction of its $2.3 billion valuation at the height of the dot.com boom. Although they’d survived the dot.bomb this lead some wags to dub the search engine: AltaBusta.

AltaVista holds a number of search related US patents including methods for identifying duplicate content in indexes (5,970,497 and 6,138,113) and a method for spidering and indexing the Web (6,021,409)

Canonical URLs

The term Canonical is derived from mathematics and means a URL in simplest or standard form. It is widely used within SEO circles. For example a home page could have multiple URLs
• http://www.mysite.com/
• http://mysite.com/
• http://www.mysite.com/index.php

These are different URLs as far as a search engine is concerned as technically a Web server could return different content for each. However many web servers are configured to return exactly the same content:

index.html

In this case we should pick a one version of the URL, the canonical form, and use this both internally and externally. All other forms should use an HTTP 301 permenant redirect to send search engine robots (and users) to the correct version.
Concept-based search
Concept-based search identifies and suggests alternative search queries that are closely related to the user’s search query. The idea is to focus a user’s search activity from the general, where lots of results are returned to the more specific with fewer, better matching results.

One way for search engines to implement concept based search is to examine how closely the results match those obtained from other searches. If there is a close match it is likely that the two search queries are related and the second query can be suggested as an alternative. Analyzing clicks can also reveal relationships. If two different queries both result in a large number of clicks on the link there queries may be considered as related.

The popularity of searches can also be used to match independent queries. Microsoft has filed a patent application (Method for finding semantically related search engine queries; 20060248068; 2nd November, 2006) based on this concept. As an example the change in popularity for searches about the “winter olympics” might match those for “curling” or “Bode Miller” (a downhill skier). Microsoft’s invention analyzes the density of a given query at various points in time. That is how many searches are there for “winter olympics” compared to the overall number of searches. This removes global effects such as a rise in overall popularity of the search engine affecting results. A mathematical process called Fourier analysis can be used to make rapid comparisons between the various results.

Domain Parking

It is possible to make money from domain names without even setting up a website. Services such as http://domainspa.com/, http://namedrive.com/ and http://trafficparking.com/ let you park domains. If type-in traffic arrives at the parked domain it is redirected to a template page. Advertisers that are part of the domain parking service’s network bid for keywords. If any of these match keywords in the parked domain they are displayed on the template page either in the form of adverts or links. Advertisers are charged for click through traffic and a percentage goes to the domain owner. Ads are usually geo-targeted, by language, country or even city. At the same time it is usually possible to advertise the domain as “for sale”.
Although not an SEO technique in itself domain parking can be an option for domains prior to building a website. With domains costing relatively little to register domain parking can be a viable option for well researched domain names. However you share a lot of your revenue with the domain parking service and you are very unlikely to get repeat visits or links, unless the domain was previously owned. If the parked domain is getting significant traffic you should consider developing a minisite.

Duplicate Content

A great deal of the Web is duplicate or near-duplicate content. Documents may be served in different formats: HTML, PDF, Text for different audiences. Documents may get mirrored to avoid delays or to provide fault tolerance. Content is syndicated and re-branded for different audiences and markets. Some websites aggregate or incorporate content from other sources on the Web, the most common example are RSS news feeds. Affiliate websites present identical storefronts with only cosmetic changes. Press releases are often duplicated by many media outlets. Businesses wishing to protect their trademarks often register different versions of their domain name which all point to the same content but look like different websites from the point of view of a search engine. Content management systems, forums and blogs are often designed to let the same content be accessed through alternative URLs.

Finally there is a problem of plagiarism and copying from public domain sources, such as Wikipedia, the Open Directory Project and Project Gutenberg. This is often done to create large, content rich sites in order to manipulate rankings and generate revenue based on content targeted advertising.

When users submit queries to search engines they do not want the results pages stuffed with many duplicate or near duplicate pages. Indexing and filtering near duplicate content also puts a load on search engines in terms of storage and computational resources. Algorithms already exist for efficiently classifying duplicate content. For example a Hash function can generate a numeric fingerprint representing a page’s content. Pages with identical fingerprints can be dropped from search results and excluded by robots when they next index pages.

Near duplicate pages are more complicated. Both Altavista (now owned by Yahoo! - patents: 5,970,497 and 6,138,113), Google (6,615,209 and 6,658,423) have been awarded US patents that improve on existing methods for classifying duplicate content. The secret is to make comparisons quickly without doing some kind of word-by-word matching. One of Altavista’s patents looks for similarities in the outbound links on a page. Google’s patents focus on generating hashes or fingerprints for parts rather than the whole page. Now to you and me neither of these ideas would seem to be that novel and probably took less than a wet Sunday afternoon in Menlo Park to conceive but you have to remember that the US patent office also gave a patent for how to use a garden swing (US Patent No. 6,368,227). The patent land-grab is also about having some bargaining chips with other companies, many would stand up about as well as a beach condo in a Florida Hurricane if tested in court. However they do have the effect of discouraging new entrants to the market.

Microsoft has also gotten into the game with a patent application (20060248066) for a “system and method for optimizing search results through equivalent results collapsing”. This patent is based on a method known as shingleprints which is the subject of a previous patent application (20050210043). A shingleprint reduces a document to a set of features that are representative of the document. For example this could be all the proper-nouns in the document. The number of common features, divided by the total number of features gives a number between 0 and 1. Essentially similar documents will have a shingleprint closer to 1.

Both Microsoft and Google’s patents are capable of identifying duplicate content that is either a subset of another document or substantially similar. Google suggests that the most relevant document is returned in the results pages. This could be the most recent (although to my mind most recent would imply a copy) or the document with the highest page rank. Microsoft say that user clicks could be used to select the most popular version to return in future queries. Probably the biggest target in Google’s sights at the moment are the many duplicates of public domain content such as Wikipedia. Some webmasters have found their original pages have been dropped in favor of mirrors so the system is not without flaws. The system should also foil domain spammers who register many different domain names under different keywords all pointing to the same website. Google keeps many of what it considers duplicate pages in its secondary supplemental results index.

The implication of all this from an optimization perspective is that search engines are getting increasingly sophisticated in identifying duplicate content. Building a site using duplicate content to inflate rankings will become increasingly difficult.

Mini site

A mini site is a website that is focussed on a single topic. The aim is not to build relationships with visitors or provide a wide coverage of a subject but to get users to take an action such as buy a product (typically an eBook), click on an affliate link or sign up to a newsletter, or all three. This should be achieved without burning up a lot of resources such as bandwidth. You could also use a minisite to provide information about a current trend, for example on the 11 November 2006 Hershey announced a recall of their chocolate bars due to Salmonella. I checked and the domains:
hershey-recall.com
hersheyrecall.com
were available. According to Word tracker this was one of the top searches in November. I would not expect much type-in traffic for this subject but you could register the hyphenated version, slap up a mini site with facts you find on the recall, add some good inbound-links so the site gets spidered quickly and hope to make some money from advertising before the trend dies. However before you dash off to your nearest registrar I checked on Google AdWords and advertisers were only bidding around 20 cents for clicks on Salmonella, although they were paying a dollar on Hershey.

By being focussed visitor choice is streamlined and product, affiliate and content targeted advertising can be extremely relevant. Mini sites range from one to a number of pages, there is no hard and fast rule except that they are single topic. By concentrating on one subject there are possibilities for search engine optimization and type-in traffic in terms of keywords in domains, URLs and on-page elements as well as inter and cross-linking. The generally small number of pages makes it easier to experiment with structure and layout. Mini sites will usually have a shallow structure, the famous “2-clicks” rule, which makes it easier for search engine robots to find and index the content.

Although some mini sites are extremely popular and earn a lot of money most will bring in much more modest revenue. The aim should be to build the site rapidly and then do very little in the way of updates. A network of mini sites could earn more than a single site covering a lot of bases and be much lower maintenance. This obviously has an affect on the subject matter. Spreading effort over a number of sites also spreads the risk if one of the sites suffers a drop in popularity due to increased competition or a change in market.

Building a network of mini sites is not simply a case of taking your old macro site and splitting it down by subject area. The SEO benefit will be minimal, there will be no increase in page rank as you have the same amount of content. However there is an argument for spinning off sections of large sites as mini sites. You can benefit from a keyword rich domain name. If the site has useful content people will link directly to the site using this domain which has page rank and anchor text benefits.
Because revenue, at least initially, can be very low hosting costs need to be kept to a minimum. Some have had success building mini sites on free hosting packages, either using the free host’s domain name or by a redirect. However hosting services are frequently parking their own advertising on these sites. The other solution is to run your own web server or virtual web server. Packages are not that expensive. This lets you direct as many domains as you like (and your web server can cope with) to a single Internet address. There is a caveat, having a number of sites on a single address is not unusual, this is how many host packages work, having a deeply interlinked network of sites on a single address may look like a link farm to a search engine. The aim of your mini sites is to garner inbound links.

Misspellings

To err is human, and all too common it would seem. For example Google ran a project to analyze misspellings of the first name of Britney Spears, a singer, over a three month period from information provided by their spelling correction system:
http://www.google.com/jobs/britney.html
Over 20% of the queries were incorrectly spelt with the two most common errors, brittany and brittney, covering around 16% of searches. Assuming people don’t accept the correction suggested by Google or Yahoo! that is an awful lot of searches going somewhere. Britney Spears may not the easiest name to spell, there is an urban legend that her parents named her after the province of Brittany in Western France where they had once taken a vacation but didn’t know how to spell the name correctly.

Not all errors are misspellings. Some are good old fashioned typos; these commonly involve forgotten letters and reversed letter pairs in a word. Examples are traslation instead of translation and eihgt instead of eight.

Domain Names

There are two ways we can use misspellings. We can register domains for misspellings of popular keywords, brands and existing domain names in the hope of piggy-backing off other people’s SEO efforts. This borders on cyber-squatting and can have legal ramifications if the name is a trademark. Sometimes you can find that a link from an authority or high PageRank site will use the misspelled domain name buying you some instant credibility with search engines, until the webmaster notices his error. This also has the corollary that if we are going to invest money establishing a domain name we should also consider registering common misspellings in addition to registering in different countries to protect our brand. This can begin to cost quite a bit of money in registration fees and may only be worthwhile for well funded sites.

As an example, common misspellings of the domain: google.com are gogle.com, googel.com and goolge.com. All of these redirect to Google’s home page. Google missed a few though. Googl.com, gpogle.com and goolgle.com redirect to sites totally unrelated to Google and would appear to exist simply to profit from the Google brand.

Misspellings can also be incorporated into the content of pages. I recently noticed that a lot of searches to a site I manage used a common misspelling. This occurred in a couple of places in the text and because it was a proprietary term it was not found by the spell checker built into some search engines. I was about to fix the page when I decided to check out the Word tracker and Overture databases to see how many people searched on the correct and incorrect spelling. I was surprised to find that the misspelling was actually more popular. Checking Google and Yahoo! it was clear that most websites spelt the term correctly so my page had risen to number one in the search results because it was well optimized and there was very little competition.

Common Misspellings

It is fairly easy to come up with misspellings for your target keywords. Try reversing letter combinations, missing letters or using letters close to each other on the keyboard. As an example the letters R and T may get substituted. Phonetic spellings are also common as Ms Spears demonstrates. Before creating pages full of misspellings you should check out whether anyone uses the terms, Overture and Word tracker are your friends. See how much competition there is from orthographically challenged webmasters. Wikipedia, amongst other resources, have data on frequently misspelled words:
http://en.wikipedia.org/wiki/List_of_common_misspellings
Forums are also a source of common misspellings for specialized areas. For example in skiing many Anglophones spell the French winter resort of Courchevel as Courcheval.

Incorporating misspellings into web pages is more challenging. Having a site full of spelling mistakes won’t impress visitors and potential advertisers much. It would be possible to use techniques such as entry pages which send the user to the correct version of the page. Search engines may see this as a Black Hat technique although if the content of the real and entry pages are identical it is really providing a service to the end user. If your website is database driven you could take a list a common misspellings, words such as effect spelt as affect, and automatically generate duplicate content pages substituting misspellings for the real words. With the introduction of spell checking of queries by search engines the effects will be somewhat diluted. Google for one also seems to be aware of common misspellings and language differences (e.g. color and colour) and indexes pages with alternate versions of words.

You could also use inbound-links with misspellings in the anchor text. Given the value of good inbound-links this technique should be used sparsely although some kind of internal, search engine friendly site map with major misspellings could be an idea.

Automatic Correction of URLs

Not directly related to search engine optimization are users that mistype Uniform Resource Locators (URLs), either directly into the browser address bar or webmasters who make errors with links. A typo will generate an error on the web server commonly known as a 404 Not Found error after its HTTP (HyperText Transfer Protocol) code. It is a good idea to trap these errors and redirect the user either to the home page or to a site map so they can try to find the right link. As part of this process it is also possible to spell check the URL to try and locate the correct resource name. Filters such as mod_speling for the Apache webserver and URLSpellCheck for Microsoft’s IIS can provide a simple: did you mean X? type of correction.

Spam

The term Spam comes from a Monty Python comedy sketch set in a trucker’s café. All the dishes on the menu come with spam - a type of tinned spiced ham. In the computer world spam is used to denote excessive repetition: multiple posts, usually commercial, to forums and unsolicited email are the two most frequent examples. For SEOers the term includes the excessive use of keywords, duplicate content, unnatural link structures and the posting of links to guestbooks and membership lists.
Blog comment, guestbook and member list spam

The blog or weblog phenomena has done a great deal to revitalize interest in the Internet following the dot.com bust. By using a pre-packaged content management system (CMS), blogs enable even technical neophytes (aka newbies) to publish their words. Blogs range from personal diaries right through to online-newspapers written by professional writers and journalists who enjoy the editorial freedom the medium offers.

Blogs also have two features which attract high search engine rankings. Bloggers link freely to other sites, creating dense inter-linking between highly themed content. Bloggers are also prodigious, creating large quantities of fresh content. Blogs were designed from the start to be interactive. Readers can post comments and usually include links to other sources. These features mean that the most popular blogs have PageRanks of 7.

The popularity of blogs was quickly spotted by people wishing to manipulate search engine results. They could boost the rankings of their own sites by using the comment, guestbook or member list features that are part of most blog software. Typing blog, weblog or guestbook into Google will bring up many high-ranking targets, especially when the query is combined with the inurl operator. Usually a spammer’s comment is completely irrelevant and is posted to multiple blogs as part of the same campaign:

Great article about global warming, why don’t you cool off a bit check out this page on hot babes?

Spammers even run automated scripts known as spambots. These attempt to post comment spam to sites running well known blog software. The aim is quantity rather than quality but it can mean that a single site gets hit by huge numbers of comments, often posted at the same time. Spammers are hard to trace as the spambots are frequently run on pirated machines referred to as botnets.

Blog spam had the advantage of keyword rich anchor text coupled with highly ranked pages. The aim is not just to get click through traffic but to subvert the ranking algorithms used by search engines. The fresh content offered by blogs means they get frequent visits from search engines. A day spent spamming the most popular blogs can rapidly boost a website to the top of the search engine results pages. As is often the case on the Web some of the most virulent spammers are pushing adult content sites and cover their tracks using anonymous proxies and compromised zombie hosts.
The popularity of this technique has spread rapidly and blog spammers have soon found themselves in an arms race. They have to visit the best blogs on an ever more frequent basis as other messages soon push their links off the coveted and highly ranked home page into search engine oblivion.

Needless to say blog owners are none too happy with this state of affairs. Some have removed comment pages or disabled the capability to post links. Others, wishing to preserve the spirit of the medium, spend hours moderating and removing a veritable tidal wave of spam. Technical solutions have been adopted, disguising outbound-links using JavaScript or rerouting links via a hidden page to stop anchor text and PageRank benefits from being transferred. Automated systems block links to known spammers or links using popular spam anchor text words.


Comments: I just wanted to say WOW! your site is really good and im proud to
be one of your perm. surfers, be sure to my penis enlargement pills project
site, dont laugh! here is my penis enlargement pills site: penis enlargement pills
Spam protection may have the effect of intensifying spam as spambots may take an ever more scattergun approach to posting. One theory on why spammers are so poor at grammar and spelling is that it helps trick automatic (Bayesian) spam filters. I suspect that after typing in 500 spam messages in a session they just get lazy.

Referrer spam

Referrer spam shows just how ingenious people can be in finding ways to manipulate search engine rankings. When someone clicks on a hyperlink their browser opens up the new web page. As part of the communications process (called HTTP which stands for HyperText Transfer Protocol) their browser sends the web address (URL) of the page that contained the hyperlink. This address is called the Referrer. The user’s web server will log this address and it is useful for traffic analysis, for example to judge the effectiveness of inbound-links.

Referrer spam has become an increasing problem. Spammers have armies of zombie hosts or botnets at their command ready to launch a campaign. These zombies are computers on the Internet where the spammer has installed a server by using some security flaw in the Operating System, usually Windows. Often a scatter gun approach is adopted, the spammer doesn’t know if the log file is indexed by search engines or not and hopes that at least a percentage of the spam will make it through. Webmasters running Apache can look at the mod_security package as a way to combat this kind of spam by blocking popular keywords in referrer pages, examples would be: poker, Viagra and loans.

The technique is definitely frowned upon by search engines and can get you banned from their index. It manipulates search engine rankings by creating what are in effect fake inbound links. It subverts the HTTP Referrer mechanism. It clogs log files with bogus information and it consumes resources on the target web server.
Spammers may counter that it is up to server administrators to protect against this form of manipulation but that is like saying that homeowners must lock their doors or risk being robbed. There is usually no good reason to have log-files publicly viewable. The log files should be password protected and preferably not visible to the Internet. Webmasters can also use a robots.txt file to tell search engines not to index the directory containing their logs and can turn off the referrer feature in CMS. Log reports have many outbound-links on a single page so the overall benefit of each link is limited.

Keyword Spam

Keyword spam is the excessive repetition of keywords on a page. It is usually done using hidden HTML elements that are indexed by search engines but are not visible to users including Title, Meta, and Alt text. Spammers have found that they can disguise keywords in the contents of the page by making the text the same color as the background and tucking it away at the bottom of the page. However this still takes up space so may be noticed by competitors, particularly if they type CTRL-A to highlight all the text on a page. It is possible for search engines to detect text which is the same color as the background and this could flag that the page is using spammy techniques. Microsoft Search claims to automatically penalize such pages.
An extension on the hidden text idea is to hide the keyword spam using style-sheets (CSS). This gives the spammer great scope for stuffing keywords into important elements such as Headings without them being noticed. The following style will format all Heading 1 text as 1pt high white text.

Search Engines and Spam


Tackling spam in results has been one of the major efforts of search engines over the last couple of years. For example in November 2006 Microsoft filed patent application 20060248072 outlining a system and method for spam identification. The method takes a multi-pronged approach including identifying pages that look like spam and incorporating user feedback into search results. Microsoft says that its user base of searchers is the best way of identifying whether results are spam. It suggests that something as simple as a toolbar button could be used to flag a page as spam. To prevent a spammer marking competitor pages as spam the user would be tracked via their IP address or network to identify the type and quantity of sites being marked as spam and to compare this with other user input from different queries. An obvious weakness is that a botnet could be used to generate a large amount of feedback from random IP addresses.

Microsoft’s patent also suggests that user feedback would be combined with other algorithmic techniques. For example they could examine the percentage of content that is advertising (the so called MFA or Made for AdSense sites), whether there is keyword stuffing or if the site is part of a bad neighbourhood of spam related sites. It may also use intelligence from its content targeted advertising to identify the value of query terms, so called money words. These are terms where advertisers bid high rates such as “hotel” or “viagra”. Pages that satisfy these terms would have more aggressive spam filtering than non-commercial websites. Less aggressive filtering may also apply to sites that a user visits regularly and sites that they link to, so called authority sites. This data could be gathered through the user’s tool bar.

Stem Words

Stemming is the ability to automatically search for different forms of a keyword. If the word computers is queried, the search engine may also return pages containing computing, computed, computer, computation etc. Computer is the stem or root word.
Yahoo! and Google support stemming by default. Google introduced stemming around the time of the Florida update leading some pundits to suggest this was the cause of the major upheavals that some highly optimized commercial sites suffered. Google’s stemming algorithm provides a wider choice of results where the keywords used are too restrictive. You will notice it most on queries with three or more keywords.
Stemming means that it is no longer necessary to target different forms of a single word in optimizations. However the specific keywords will rank better than their stemmed variants.

Stop Words

Stop Words are words that are so common that they have little relevance to the context of a web page. Examples would be adverbs, conjunctions and prepositions. Excluding stop words saves resources on search engines with little effect on the quality of results.
Common stop words include
about, an, and, are, as, at, be, by, for, from, how
in, is, it, of, or, that, the, this, to, was, what, when
which, who, why, will, with

Searchers can ask search engines to include stop words by using the ‘+’ symbol before the stop word or by putting the entire search phrase in quotes but such searches are the exception rather than the norm. They are often used where the searcher knows an exact phrase from a page. A good example is the start of Hamlet’s soliloquy, “To be or not to be, that is the question”. On a search engine that ignores stop words the results will be very different. Google started indexing stop words in 2005.

Except for these specific cases stop words may be avoided in phrases that target keywords. Examples would be in Anchor Text, Title elements and ALT (alternative) text in image links. This should not be taken to extreme, for example headings should still include stop words where it helps the readability of the content.
Supplemental Results

Supplemental results were introduced to Google searches during 2003. Google says that the results are part of an auxiliary index with fewer constraints placed on pages. For example pages may be orphans, doorway pages with no inbound links, empty pages or have content that Google cannot index (the results relying on meta data). SERPS from the supplemental index are only shown only where there are very few matches from the main index. It is like a final throw of the search dice to throw up some useful information. Supplemental cache results are frozen at the time they were indexed and will often be stale and may show information you no longer want to be public. Supplemental updates are infrequent and results can stick around for up to a year.

Cloaking

Cloaking describes the process of returning different content depending on whether the visitor is a search engine spider or end user. The content seen by the robot indexing the site can be highly optimized for that search engine and may even be completely different from the page the user will see. Search engines do not like this kind of manipulation of their results and cloaked pages can result in a ban. A software business selling spyware, was kicked off both Google and Yahoo! when, they claim, their SEO company used cloaking to optimize their site. All the more reason to understand the techniques any paid SEO outfit may be thinking of using.

Competition

It is important to remember that you are not just trying to second guess how search engines work but are competing with thousands of other websites to get into the top ten of search engine results pages. In the SEO game there are only a few winners for a given set of keywords. Beating the competition is not a question of luck or chance but strategy. A campaign should be planned by selecting keywords, then studying the competition, analysing their strategy and then either doing it better or targeting points of weakness.

Content Management Systems (CMS)

Content Management Systems (CMS) are becoming increasingly popular for managing today's large and complex web sites. The actual content of the website is held in a database, MySql is a very popular relational database choice as it is free and is often supplied as part of a web hosting package. The content is retrieved from the database and packaged into web pages by a software system running on the web server. The format of the pages can be highly customized by using templates and style sheets (CSS). From the user viewpoint the site looks like normal web pages.
Content Management Systems let website owners concentrate on the information in the site without worrying about detail such as creating pages in the Hyper Text Markup Language (HTML). Many large websites, particularly anything interactive such as news sites, blogs and forums are driven by CMS. Complex sites that have specific requirements will write their own software but many off-the-shelf packages, both free and commercial, are available. These are often written in the PHP or Perl programming languages. As with MySQL these two computer languages are free and frequently come as an integral part of web hosting packages. Search engines can spider Perl, PHP, ASP.Net, Cold fusion, Python and Java amongst other languages providing the pages are reachable. Movable Type and pMachine are examples of the most popular Content Management Systems.

Just as the standard look and feel of a CMS will not suit most websites they are also poorly optimized for search engines straight out of the box. The focus of CMS designers is information delivery to human users not search engine robots. There are a number of customizations that make Content Management Systems more search engine friendly.

Tuesday 4 December 2007

Improve Link Popularity in 10 Easy Steps

You've spent the last few months optimizing your web site. You did your homework and learned all about optimizing techniques for your web page. Your relevant keywords are prominently placed in all the right places on your pages. Yet your site still isn't ranking the way you want. What do you do?
It's time to improve you link popularity!

Why bother with link building? Link popularity and link quality are very important because every major search engine now considers them as a part of their ranking algorithms. If you don't have links, you won't rank well for competitive keywords.
If your page includes all the important on-the-page criteria and scores well with Page Primer, it's time to focus on your links. Good inbound links can move your page up the ranking ladder and act as new entry points to your site. But how does your site get those coveted inbound links we hear so much about?

First off, let's make sure we understand the basics. Link popularity is the measure of inbound links to your web site. Link analysis evaluates which sites are linking to you and the link text itself.

Fortunately, there are a lot of ways to improve your link quality and popularity, which will give you a boost in the rankings. Here are some guidelines to help you set up your own linking campaign:

1. Prepare your site first

Before you start your link building campaign, take time to get your site in shape. Make sure your site looks professional, has good content and is easy to navigate. Validate your HTML code and check your links with a tool like HTML Toolbox. If a potential linker goes to your site and finds broken pages, they are not going to want to link to you.

In addition, directories have gone on record saying they may exclude sites with broken links and page errors. Directories want only professional looking sites in their databases, so do your homework on your site before you start promoting it and your linking campaign will be more effective.

2. Budget time for link building

Don't expect to grow your link popularity overnight. Budget time every week to work on link building. If you force yourself to spend a couple hours a week on link building, it will become part of your routine. Pick one day a week and set aside time as your "link building time." If you don't make it a priority, it won't get done.

Link building is an incremental activity. Over time these one or two new links start adding up until they are hundreds or even thousands of links.


3. Establish realistic link goals


Don't expect to see instant results. Link building is difficult, frustrating and time intensive. Convincing another web site to link to you can be exasperating. If you get one good quality link a month you're doing better than the majority of sites out there.

Patience and creativity are key to link building. Track your progress so you know who you've asked already. It could be embarrassing to ask a site for a link if they've already given you one.

If a company initially declines your link request, wait a while and then ask again. Their company focus may change over time. A "no" today may change into a "yes" 6 - 9 months later.

4. Develop internal management support

If you're link building in-house, build support from your company's internal management for your link building. This usually means educating management about the benefits of link building.

Link popularity is unique to the search engine industry - it's not taught in graduate schools (not yet, anyway). Sit down with your management and explain the concept behind link building - don't assume they understand it or have even heard of the term. In fact, most won't have a clue what you're talking about.

Explain link building in terms they will understand and in ways that will get their attention, such as describing the relationship of link building and increased revenue. Talking about making more money usually gets management's attention.
Why worry about management support? You will need it to provide the time and money you need to get into search engines or directories.

5. Link popularity is all about quality

Be selective about the sites from which you request links. Search engines use sophisticated rules when judging the importance of a link, and the popularity of the site linking to you is a key criteria. One link from CNet is worth far more than a link from a personal web site.

And don't even think of using a link farm! Link farms are sites that exist solely to link to other web sites. Link farms are a blatant attempt to inflate your link popularity, and search engines take a dim view of them. Google in particular has been known to ban sites found using a link farm.

Try to identify non-competitive sites in the same field as your site. Links from sites that are related to your area carry more weight than sites from Aunt Sue's favorite horse site. That doesn't mean you should refuse a link from Aunt Sue, just be aware it won't help you much in link quality terms. On the other hand, links from sites within your industry are strong endorsements for your site.


6. Develop a relationship with a site


Before you ask for the link, get to know the web site. Establish yourself as a real human first. That way, when you ask for a link, it's harder for them to say no.
Impersonal broadcast emails asking for links are spam. Sure, it's easier, but it will only result in making another company mad at you. Spam link requests do not work and waste everyone's time. Don't do it!

7. Provide the linking code

Make it easy for other sites to link to you. Send the prospective linker the exact HTML code you want in the link and suggest which page you want the link from. This ensures the right words are used in the link and reduces the burden in setting up the link. Everybody on the Internet is pressed for time and if you don't make it "drop-in simple" by giving them the exact HTML, you've made their job too hard. Make it easy and your success rate will go up.

8. Get directory listings

Jumpstart your link campaign by getting directory links first. This is especially important if you have a new site or a site with no inbound links. A shortage of inbound links puts your site at a severe disadvantage because link analysis is an important part of every search engine's ranking algorithm.
The way to overcome this disadvantage is to get a few quality links. A good way to start is to get listed in as many directories as you can. There are many directories out there, and the more you can get into the better.
A few to target include:

• Open Directory
• Yahoo
• LookSmart
• Zeal.com
• Joeant.com
• Business.com

Be aware that most of these directories require you to pay for a listing. It's worth the expense.

9. Consider bartering for links

It's a good idea to have something to offer in return for a link. Many sites won't link to you unless you link back to them or otherwise make it worth their while. Create a Resources or Partner page that allows you to have a place from which you can easily link to them.

You might also offer to work a barter arrangement with them. If you have a popular site with their target market, they might consider free advertisements in exchange for a link. If the link is of great value to you, be prepared to give something back.

10. Link building alternatives

If time constraints keep you from link building, consider outsourcing your link popularity work. Link building is undoubtedly the most time consuming part of search engine optimization. You may find it is not cost effective to do it in house. That doesn't mean you shouldn't do it, it just means you hire someone else to do it for you.

Many top SEO firms have turned to outsourcing this function. For example: Jill Whalen of highrankings.com uses Debra O'Neil-Mastaler's link building firm.
Outsourcing to a reputable link building firm ensures good links and could be a more efficient model for you if you are already time limited.

One word of caution if you do chose to hire a company specializing in link building: make sure any firm you hire follows good link building practices. Ask them to describe the process they use to request links Make sure they follow a personalized approach, and don't simply spam sites with requests for links.

If they refuse to discuss their link building methods you can assume they use impersonal widespread email drops or link farms - that's spam. They may call it a fancy name, but if the process involves sending out large numbers of form emails, it's still spam and will only set your campaign backwards and injure your company's professional reputation. Go find a different company or develop your links in house.
Just do it!

Link popularity is important and the link building process needs to be given high priority. Link analysis is only going to get more important to search engines, not less. Search engines have found it highly resistant to manipulation and a legitimate way to measure the importance of a site. Since link building takes time, the sooner you start the better.

So think of link building as a long-term investment in your site. Put in a little time now to improve your linking today to insure a good search engine ranking in the future

Guide Of SEO

Search Engine Optimization

Search Engine Optimization (SEO) as a subset of search engine marketing seeks to improve the number and quality of visitors to a web site from "natural" ("organic" or "algorithmic") search results. The quality of visitor traffic can be measured by how often a visitor using a specific keyword leads to a desired conversion action, such as making a purchase or requesting further information. In effect, SEO is marketing by appealing first to machine algorithms to increase search engine relevance and secondly to human visitors. The term SEO can also refer to "search engine optimizers", an industry of consultants who carry out optimization projects on behalf of clients.

Search engine optimization is available as a stand-alone service or as a part of a larger marketing campaign. Because SEO often requires making changes to the source code of a site, it is often most effective when incorporated into the initial development and design of a site, leading to the use of the term "Search Engine Friendly" to describe designs, menus, Content management systems and shopping carts that can be optimized easily and effectively.

A range of strategies and techniques are employed in SEO, including changes to a site's code (referred to as "on page factors") and getting links from other sites (referred to as "off page factors"). These techniques include two broad categories: techniques that search engines recommend as part of good design, and those techniques that search engines do not approve of and attempt to minimize the effect of, referred to as spamdexing. Some industry commentators classify these methods, and the practitioners who utilize them, as either "white hat SEO", or "black hat SEO".[1] Other SEOs reject the black and white hat dichotomy as an over-simplification.

History Of Search Engines

In the early days of Internet development, its users were a privileged minority and the amount of available information was relatively small. Access was mainly restricted to employees of various universities and laboratories who used it to access scientific information. In those days, the problem of finding information on the Internet was not nearly as critical as it is now.

Site directories were one of the first methods used to facilitate access to information resources on the network. Links to these resources were grouped by topic. Yahoo was the first project of this kind opened in April 1994. As the number of sites in the Yahoo directory inexorably increased, the developers of Yahoo made the directory searchable. Of course, it was not a search engine in its true form because searching was limited to those resources who’s listings were put into the directory. It did not actively seek out resources and the concept of seo was yet to arrive.

Such link directories have been used extensively in the past, but nowadays they have lost much of their popularity. The reason is simple – even modern directories with lots of resources only provide information on a tiny fraction of the Internet. For example, the largest directory on the network is currently DMOZ (or Open Directory Project). It contains information on about five million resources. Compare this with the Google search engine database containing more than eight billion documents.

The WebCrawler project started in 1994 and was the first full-featured search engine. The Lycos and AltaVista search engines appeared in 1995 and for many years Alta Vista was the major player in this field.

In 1997 Sergey Brin and Larry Page created Google as a research project at Stanford University. Google is now the most popular search engine in the world.

Currently, there are three leading international search engines – Google, Yahoo and MSN Search. They each have their own databases and search algorithms. Many other search engines use results originating from these three major search engines and the same seo expertise can be applied to all of them. For example, the AOL search engine (search.aol.com) uses the Google database while AltaVista, Lycos and AllTheWeb all use the Yahoo database.

Common Search Engine Principles

To understand seo you need to be aware of the architecture of search engines. They all contain the following main components:

Spider - a browser-like program that downloads web pages.

Crawler – a program that automatically follows all of the links on each web page.

Indexer - a program that analyzes web pages downloaded by the spider and the crawler.

Database– storage for downloaded and processed pages.

Results engine – extracts search results from the database.

Web server – a server that is responsible for interaction between the user and other search engine components.

Specific implementations of search mechanisms may differ. For example, the Spider+Crawler+Indexer component group might be implemented as a single program that downloads web pages, analyzes them and then uses their links to find new resources. However, the components listed are inherent to all search engines and the seo principles are the same.

Spider - This program downloads web pages just like a web browser. The difference is that a browser displays the information presented on each page (text, graphics, etc.) while a spider does not have any visual components and works directly with the underlying HTML code of the page. You may already know that there is an option in standard web browsers to view source HTML code.

Crawler - This program finds all links on each page. Its task is to determine where the spider should go either by evaluating the links or according to a predefined list of addresses. The crawler follows these links and tries to find documents not already known to the search engine.

Indexer - This component parses each page and analyzes the various elements, such as text, headers, structural or stylistic features, special HTML tags, etc.

Database - This is the storage area for the data that the search engine downloads and analyzes. Sometimes it is called the index of the search engine.

Results Engine - The results engine ranks pages. It determines which pages best match a user's query and in what order the pages should be listed. This is done according to the ranking algorithms of the search engine. It follows that page rank is a valuable and interesting property and any seo specialist is most interested in it when trying to improve his site search results. In this article, we will discuss the seo factors that influence page rank in some detail.

Web server - The search engine web server usually contains a HTML page with an input field where the user can specify the search query he or she is interested in. The web server is also responsible for displaying search results to the user in the form of an HTML page.

Internal Ranking Factors

Several factors influence the position of a site in the search results. They can be divided into external and internal ranking factors. Internal ranking factors are those that are controlled by seo aware website owners (text, layout, etc.) and will be described next.
Web page layout factors relevant to seo

Amount Of Text On A Page

A page consisting of just a few sentences is less likely to get to the top of a search engine list. Search engines favor sites that have a high information content. Generally, you should try to increase the text content of your site in the interest of seo. The optimum page size is 500-3000 words (or 2000 to 20,000 characters).

Search engine visibility is increased as the amount of page text increases due to the increased likelihood of occasional and accidental search queries causing it to be listed. This factor sometimes results in a large number of visitors.

Number Of Keywords On A Page

Keywords must be used at least three to four times in the page text. The upper limit depends on the overall page size – the larger the page, the more keyword repetitions can be made. Keyword phrases (word combinations consisting of several keywords) are worth a separate mention. The best seo results are observed when a keyword phrase is used several times in the text with all keywords in the phrase arranged in exactly the same order. In addition, all of the words from the phrase should be used separately several times in the remaining text. There should also be some difference (dispersion) in the number of entries for each of these repeated words.

Let us take an example. Suppose we optimize a page for the phrase "seo software” (one of our seo keywords for this site) It would be good to use the phrase “seo software” in the text 10 times, the word “seo” 7 times elsewhere in the text and the word “software” 5 times. The numbers here are for illustration only, but they show the general seo idea quite well.

Keyword Density And Seo

Keyword page density is a measure of the relative frequency of the word in the text expressed as a percentage. For example, if a specific word is used 5 times on a page containing 100 words, the keyword density is 5%. If the density of a keyword is too low, the search engine will not pay much attention to it. If the density is too high, the search engine may activate its spam filter. If this happens, the page will be penalized and its position in search listings will be deliberately lowered.

The optimum value for keyword density is 5-7%. In the case of keyword phrases, you should calculate the total density of each of the individual keywords comprising the phrases to make sure it is within the specified limits. In practice, a keyword density of more than 7-8% does not seem to have any negative seo consequences. However, it is not necessary and can reduce the legibility of the content from a user’s viewpoint.

Location Of Keywords On A Page

A very short rule for seo experts – the closer a keyword or keyword phrase is to the beginning of a document, the more significant it becomes for the search engine.

Text Format And Seo

Search engines pay special attention to page text that is highlighted or given special formatting. We recommend:

* use keywords in headings. Headings are text highlighted with the «H» HTML tags. The «h1» and «h2» tags are most effective. Currently, the use of CSS allows you to redefine the appearance of text highlighted with these tags. This means that «H» tags are used less than nowadays, but are still very important in seo work.;

* Highlight keywords with bold fonts. Do not highlight the entire text! Just highlight each keyword two or three times on the page. Use the «strong» tag for highlighting instead of the more traditional «B» bold tag.

TITLE Tag

This is one of the most important tags for search engines. Make use of this fact in your seo work. Keywords must be used in the TITLE tag. The link to your site that is normally displayed in search results will contain text derived from the TITLE tag. It functions as a sort of virtual business card for your pages. Often, the TITLE tag text is the first information about your website that the user sees. This is why it should not only contain keywords, but also be informative and attractive. You want the searcher to be tempted to click on your listed link and navigate to your website. As a rule, 50-80 characters from the TITLE tag are displayed in search results and so you should limit the size of the title to this length.

Keywords In Links

A simple seo rule – use keywords in the text of page links that refer to other pages on your site and to any external Internet resources. Keywords in such links can slightly enhance page rank.

Alt Attributes In Images

Any page image has a special optional attribute known as "alternative text.” It is specified using the HTML ALT tag. This text will be displayed if the browser fails to download the image or if the browser image display is disabled. Search engines save the value of image ALT attributes when they parse (index) pages, but do not use it to rank search results.

Currently, the Google search engine takes into account text in the ALT attributes of those images that are links to other pages. The ALT attributes of other images are ignored. There is no information regarding other search engines, but we can assume that the situation is similar. We consider that keywords can and should be used in ALT attributes, but this practice is not vital for seo purposes.

Description Meta Tag

This is used to specify page descriptions. It does not influence the seo ranking process but it is very important. A lot of search engines (including the largest one – Google) display information from this tag in their search results if this tag is present on a page and if its content matches the content of the page and the search query.

Experience has shown that a high position in search results does not always guarantee large numbers of visitors. For example, if your competitors' search result description is more attractive than the one for your site then search engine users may choose their resource instead of yours. That is why it is important that your Description Meta tag text be brief, but informative and attractive. It must also contain keywords appropriate to the page.

Keywords Meta Tag

This Meta tag was initially used to specify keywords for pages but it is hardly ever used by search engines now. It is often ignored in seo projects. However, it would be advisable to specify this tag just in case there is a revival in its use. The following rule must be observed for this tag: only keywords actually used in the page text must be added to it.

Site Structure

Number Of Pages

The general seo rule is: the more, the better. Increasing the number of pages on your website increases the visibility of the site to search engines. Also, if new information is being constantly added to the site, search engines consider this as development and expansion of the site. This may give additional advantages in ranking. You should periodically publish more information on your site – news, press releases, articles, useful tips, etc.

Navigation Menu

As a rule, any site has a navigation menu. Use keywords in menu links, it will give additional seo significance to the pages to which the links refer.

Keywords In Page Names

Some seo experts consider that using keywords in the name of a HTML page file may have a positive effect on its search result position.

Avoid Subdirectories

If there are not too many pages on your site (up to a couple of dozen), it is best to place them all in the root directory of your site. Search engines consider such pages to be more important than ones in subdirectories.

One Page – One Keyword Phrase

For maximum seo try to optimize each page for its own keyword phrase. Sometimes you can choose two or three related phrases, but you should certainly not try to optimize a page for 5-10 phrases at once. Such phrases would probably produce no effect on page rank.

Seo And The Main Page

Optimize the main page of your site (domain name, index.html) for word combinations that are most important. This page is most likely to get to the top of search engine lists. My seo observations suggest that the main page may account for up to 30-40% percent of the total search traffic for some sites.

Search Engines Work

Search engines are the key to finding specific information on the vast expanse of the World Wide Web. Without sophisticated search engines, it would be virtually impossible to locate anything on the Web without knowing a specific URL. But do you know how search engines work? And do you know what makes some search engines more effective than others?

When people use the term search engine in relation to the Web, they are usually referring to the actual search forms that searches through databases of HTML documents, initially gathered by a robot.

There are basically three types of search engines: Those that are powered by robots (called crawlers; ants or spiders) and those that are powered by human submissions; and those that are a hybrid of the two.

Crawler-based search engines are those that use automated software agents (called crawlers) that visit a Web site, read the information on the actual site, read the site's meta tags and also follow the links that the site connects to performing indexing on all linked Web sites as well. The crawler returns all that information back to a central depository, where the data is indexed. The crawler will periodically return to the sites to check for any information that has changed. The frequency with which this happens is determined by the administrators of the search engine.

Human-powered search engines rely on humans to submit information that is subsequently indexed and cataloged. Only information that is submitted is put into the index.

SEO steps and Tools used

1. Analyze our web site

2. Find Keyword relevant to our website

3. Find best keywords

Tool: (https://adwords.google.com/select/KeywordToolExternal )
http://www.digitalpoint.com/tools/suggestion/?keywords=visit+panama+city&country=us
Keyword gold
Good Keyword for Key phrase
http://www.websitepromotionsoft.com/bestpromotionkeyword.html

4. Use the keyword in Head tag, Meta tag, Meta description, alt etc.

5. Mockup validation OR HTML validation

Tool: (http://validator.w3.org/ )

6. Correct the errors

7. Put keyword in the URLs and folders

8. Find the Indirect competitors

Tool: Arelis – Affiliates, Find new Link partners, and Automated Link exchange

9. Link Exchange with Indirect competitors

Tool: Ariles, Promosoft

10. Check for broken links

Tool: Arlies, www.webmaster-toolkit.com, www.seochat.com Broken link checker sites

11. PAD file creation and Submission to Directories (Optional)

Tool: PADgen,
Promosoft - PAD file submission Tool

12. Submit the web site to all major search engines (Only once in a month or 48days)

Tool:

i. Google - http://www.google.com/addurl/?continue=/addurl

ii. Yahoo - http://search.yahoo.com/info/submit.html

iii. MSN - http://search.msn.com/docs/submit.aspx

iv. Altavista - http://addurl.altavista.com/addurl/default

v. Alltheweb- http://addurl.alltheweb.com/help/webmaster/submit_site


13. PPC management

Tool:

i. Go ogle - https://adwords.google.com/select/Login

ii. Yahoo - https://secure.overture.com/s/dtc/center/?mkt=us

iii. MSN - https://adcenter.microsoft.com/Default.aspx

iv. Miva - https://adcenter.us.miva.com/login.aspx

14. Analyze and generate the report

Tool: http://www.google.com/analytics

www.indextools.com
http://freestats.com/
http://www.webstat.com/
http://os.hitslink.com/
http://www.websidestory.com/
http://www.opentracker.net/index.jsp

15. Work on the weaker areas of our website

Tool: http://www.seochat.com/

http://www.seochat.com/ etc. Available in the web

Axandra’s IBP – Full SEO lifecycle management tool

http://www.googlerankings.com/

http://www.yahoosearchrankings.com/index.php

Link Building Guide

Introduction

I have been providing many thousands of links to my SEO clients and I have myself established a number of PR6 sites and almost won the biggest SEO contest ever with the means of huge link power.

The purpose of this article is to help you in your link building using 5 steps.

1. Study and understand the guidelines for a natural simulation.
2. Study and understand your link profile.
3. Study and understand the effective ways to get links.
4. Work out your own link building program.
5. Get the links!

This article is based on my opinion and what I believe. (use at your own risk).
Guidelines for a Natural Simulation

The majority of your backlinks you acquire should:

• Have a varied anchor text

Target your keywords but make very sure to vary the anchor a lot. Even a few “click here” is good to get.

• Come from related pages

This is probably the most important, especially for english web sites. It is commonly believed that links from related pages carries more “weight” (not PR).

• Come from different locations on the linking pages.

Don’t have all the backlinks from footers or any other specific place. Have them on the top of the page, inside the body text, navigation, footer etc.

• Placed with a gradual natural increase

Don’t place hundreds or thousands of links to a new site the first days … I have tested that and gotten the site banned in Google. You need to gradually place the links and the quantity all relates to how much links the site is usually getting. So if a site usually gets 10 new links per week then don’t place 100 suddenly one day.

• Placed from pages with a varied PR value

In my opinion not more than 15-20% of your backlinks to a site should be from PR 5+ pages.

• Come from good neighbourhood

Don’t have a majority of your backlinks from adult, pharmacy, and poker/casino sites. Such sites are known to spam and I am pretty sure Google frowns upon them.

• Come from different C-class IPs

Google sees this. Have your links coming from total different sites.
• Come from both old trusted as well as new sites
It is also good to mix this up.

• Come from reciprocal linking

Reciprocal links are still good but don’t have them as a majority of your backlinks. 3-way-linking is a good alternative but don’t use any patter.

• Point to internal pages as well as your home page

Never underestimate backlinks pointing to your internal pages. As you have your internal pages SEO’d as well, get backlinks to these as well with varied anchor. It is been speculating that the value of these has increased during the BigDaddy update.

• Come from non-directories

A selected few directories such as ODP and Yahoo carries trust in Google from the editorial vote. However, many directories does not give that and I even suspect that having a majority of your backlinks from 200 free general directories even can do harm to your link profile. They can still be good but should just be something extra to the links you already have.

• Not be temporary

The age of your backlinks is important. It has even been speculated that this is a factor of the Google Sandbox filter. Strive to get permanent links. When renting links, rent for as long as possible.

• Not have paid linking footprints around them

Words such as sponsors, advertisement and the like are spotted by Google and are devalued. If the majority of your backlinks come from such links it could maybe even do you harm.

• Not come from pages that links to bad neighbourhood

You should not place your links on pages that are in bad neighbourhood, but you should also not place them on sites that themself link to these kind of sites. Google uses advanced relation and co-citation systems in their algo.

• Come from pages with the same language

Seem to be obvious but is very important to show up properly in the Google country specific searches. It seems to be from my recent observations that this is the most important factor in determining the language of the site for Google (other factors is IP, Domain name TLD and actual written language of the site).

Your Link Profile

Study the above Guidelines for a Natural Simulation and write down the points your site is in danger for to get a potential unnatural flag warning from search engines. Those are the points that you should not continue with.
Effective ways to get links

• Link Baiting

This is the single most important method. It has mainly to do with good content that gets links the natural way. To keep things simple in this guide I have decided that everything that makes people link to you because of the content on the page/site falls under this category of link baiting. You need to read this post I wrote on link baiting.

Comments: Pure white hat, recommended but sadly not very effective if not something real major.

• Directory submissions/Link Building using directories

There are specialized SEOs and Link Builders that are doing directory submissions. There are 4 kinds of directories here: pay-for-inclusion, reciprocal required, free and niche directories. And there are of course specialists in each one of these. There are almost one thousand free SEO friendly general directories, a list of these and a submission service can be seen here. As for placement in reciprocal directories there are both brokers and non-brokers.

Comments: Cheap way to get many low quality links. Can take months before effect is seen. Deep links usually not possible. New edit and comment on this: since BigDaddy Google has been smashing and deindexed internal pages on a lot of directories, but the indexed pages still show a PR value even though they are deindexed. Keep this in mind and run a site: command or type the URL of the internal page in Google to see if the directory/page has been hit.

• Link Exchange, 2 and 3 ways

The good old way. I link to you and you link to me. This is more useful if you have 30 or more sites as you can get and provide relevant links. Rule of thumb: you contact people, not the opposite. There are also Reciprocal Link Programs but I don’t recommend that.

Comments: Don’t overdue it …

• High PR-subscriptions (Grey Hat?)

You can rent PR 5-9 text links at various places for various prices. Cheap places I had some very good success with: here, here and here.
Comments: Don’t get too many high PR links too fast, get relevant if possible.

• Link-Vault

I am myself getting 200 permanent links a day from this program to 40 URLs, all with varied anchor. The links you get from this program are mostly footer links and other low quality links but the program is extremely powerful. Do not, I repeat, do not display Link-Vault ads on your most valuable web sites - that is a red flag, especially since BigDaddy.

Comments: Only negative is that the links are mostly footer and the relevancy is not that good, other than that it is great.

• DP Coop Ads (Grey Hat)

This program gives you a lot of rotating temporary backlinks from mostly footers of sites. I have been using this program successfully to boost SERP ranks in Google, especially at the V7N SEO contest. This program is extremely powerful as well as dangerous and can have your site banned or harmed if you are not careful. My rule of thumb: maximum 5K weight on a single ad. At one moment I had 120K weight and I was getting many many thousands of links to more than 40 ads that I had.
Comments: Remember that these are all footer, irrelevant, temporary and with fast growth.

• Linking from your own web sites

This is an excellent way if you have many relevant pages in the same niche.
Comments: Don’t interlink all, keep sites on different C-class IPs and avoid patterns.

• Pay bloggers

You can post on the business section of webmaster forums stating that you will pay to get the blogger to blog about and link to your site. Example.
Comment: Good to get the word out. The blog post will over time be buried down in the blog structure.

• Article submission

Write an article with links in the article and/or in the resource section to your site. Or pay somone to write these for you. Use specialist (example) to have these submitted to article directories such as this one.
Comment: Long term strategy, takes time to see effects. Can be good for traffic.
• Search the SERPS and make offers. (Jim Boykins speciality)

Google Base Optimization

Google Base happens to be the only shopping search engine which allows merchants to define their own attributes (optional fields).
It is also formerly known as Froogle, and if you have any product that can be sold online, you have no excuse to not use this service. It's free, and Google is trying to promote it's usage.

If you want better results on the shopping engines, try optimizing your feed - it’s no longer good enough to just post all your products and expect your listings to be found.

There is a new opportunity that can be had for online retailers, and it's known as the "Onebox" result. To learn more about this great way to drive targeted traffic, visit OneBoxer, a fantastic blog that covers all angles better than I can.

Recently, Google made more requirements on a generic level, but there may be more if you are within a specific product category:
brand
condition
description
expiration_date
id
image_link
link
price
product_type
title

Internet Marketing

When you break down the actual components of Internet marketing, it all comes down to variants of coding a website to meet the requirements of the search engines.

While there are many ways to grow your presence online, one of the truest ways to make sure your efforts will continue to work for you years from now, is to be sure your efforts are properly optimized.

From press releases to blogs to articles to whitepapers, anything that appears online should be optimized, since you never know how someone will first come to learn of your product or service.

Videos uploaded to YouTube or Google video should be optimized. Podcasts can be optimized. Your RSS feed can be optimized.

Getting the point?

Since the engines are always looking for the most relevant content, if your content is properly optimized, it has a better chance of staying ranked for long periods of time.

Anything that appears online or exists in digital format on the web can be optimized. It only takes a little bit of extra effort, but the payoff is well worth it.

Search Engine Optimization & Marketing

Because of the frequency of active blogs (daily - or at least 5 times a week) the search engines have put high weight on blogs that are focused and tend to stay on topic.

It's the freshness of content (in a perfect world) that a search engine is craving and rewarding to the blogs. The ideal situation is that the blog would provide for tiny snippets of information that over time build up to a greater whole. (Almost like a blook)

But the most overloooked element of a blog is that once you've established a frequent pattern of posting and you have the Googlebot coming to your site every few days, you can then use that to link to other sites, sub-domains or any deep links that you need to get crawled. While most blog postings may not have direct links to any sites in particular, you could always format your blog with some permanent links on the side.

Another mistake made by the amatuer blogger is that they don't realize that you can optimize your blog. True, most blogs only have a few areas, namely the "home" page and the "archive" page. But depending on the tool being used to post the blog, there are some places where you can take advantage seo-wise.

The title of your blog should be thought of as a headline - grab the reader's attention - but also be sure that your blog title is also what gets archived - that way your titles can become search queries as well.

While the main objective of a blog should be to get your message out, don't forget to take a little time for SEO and you should see your efforts payoff in the search engine results pages.

The Benefits of Black Hat Techniques

Yes, you read that correctly. There are benefits of "black hat" techniques used to manipulate rankings in the search engines.

If you work within the search engine optimization industry, or you are a webmaster, you have most likely already encountered some black hat techniques in play. As someone who does SEO for a living, it's in my best interests to know what the techniques are in order to combat them.

Problem is, to a black hat, it's all worth it. While the majority of us sweat and labor and obsess on getting top rankings in Google and other search engines, the black hatters sit back and laugh at our efforts.

The elite hatters will "churn-and-burn" websites, but in the three months of that website's existance, the black hatter can easily make over 100k from each site.

That's why they continue to employ their techniques, all the while raking in the cash. This is also the same reason why spammers continue to do what they do - out of the millions of emails sent, there are enough people clicking on the advertisement to make them money.

So, now that we know the motivation of the black hat, where is the benefit?

The black hatters keep the search engines on their toes by constantly pushing the boundries of manipulating data and information on the web. If the Googles of the world could just sit back and never apply any new thoughts to their algorithms, the "Average Joe" would never have a shot at getting a top spot in the search engines.

All of the top spots would be owned by the players with the most money to spend.

But, since the black hats continually look for loopholes in the algorithms, it forces the search engines to continually re-think their apporach on how to rank the content found on the Internet.

While I admit that it can be frustrating to see a site above yours in the search engine results that is employing these techniques, you must keep in mind that you have a chance to achieve top rankings because of their techniques.

While the black hatter relies on automation to obtain their goals, the rest of us have to do it with effort. That is the kryptonite of the black hatters. Eventually, your site should have better, original content and more relevant links pointing to your site, and that will drive the black hatters away.

The goal of the black hat is to make money as quickly as possible. The elite black hatters are smart enough to know when they have been beaten in a specific area, and will simply turn their focus to another topic.

You can beat the black hat. But it will take time, and it will take resolve. Consider it a battle - because they do.

Home| History| Search engine optimization| Guide of SEO| SEO process| Common SEO Mistakes| Search Engines Work| Google base Optimization| Google tricks| PDF optimization death to SEO| Why you need outbound links| Why search engines fail| Content optimization| Automatic SEO| Search engine optimization for google| Link building guide| Improve link popularity in 10 easy steps| SEO steps and tools used| SEO for blogs| Benefits of black hat techniques| How to create google adsense pages| Search engine optimization marketing| Internet Marketing