Skip to main content

Do Search Engines Care About Valid HTML?

Importance of valid HTML

Like most web developers, Ive heard a lot about the importance of valid HTML recently. I've read about how it makes it easier for people with disabilities to access your site, how its more stable for browsers, and how it will make your site easier to be indexed by the search engines. So when I set out to design my most recent site, I made sure that I validated each and every page of the site. But then I got to thinking while it may make my site easier to index, does that mean that it will improve my search engine rankings?

HTML Validator Extension

To get a feel for how much value the search engines place on being HTML validated, I decided to do a little experiment. I started by downloading the handy Firefox HTML Validator Extension (http://users.skynet.be/mgueury/mozilla/) that shows in the corner of the browser whether or not the current page you are on is valid html. It shows a green check when the page is valid, an exclamation point when there are warnings, and a red x when there are serious errors. I decided to use Yahoo! Buzz Index to determine the top 5 most searched terms for the day, which happened to be World Cup 2006, WWE, FIFA, Shakira, and Paris Hilton. I then searched each term in the big three search engines (Google, Yahoo!, and MSN) and checked the top 10 results for each with the validator. That gave me 150 of the most important data points on the web for that day.

Analyzing results of HTML validation

The results were particularly shocking to me only 7 of the 150 resulting pages had valid html (4.7%). 97 of the 150 had warnings (64.7%) while 46 of the 150 received the red x (30.7%). The results were pretty much independent of search engine or term. Google had only 4 out of 50 results validate (8%), MSN had 3 of 50 (6%), and Yahoo! had none. The term with the most valid results was Paris Hilton which turned up 3 of the 7 valid pages. Now I realize that this isn't a completely exhaustive study, but it at least shows that valid html doesn't seem to be much of a factor for the top searches on the top search engines.

What I was surprised about

Even more surprising was that none of the three search engines home pages validated! How important is valid HTML if Google, Yahoo!, and MSN don't even practice it themselves? It should be noted, however, that MSNs results page was valid html. Yahoos homepage had 154 warnings, MSNs had 65, and Googles had 22. Googles search results page not only didn't validate, it had 6 errors!
How many of the top sites have valid HTML?
In purusing the web I also noticed that immensely popular sites like ESPN.com, IMDB, and MySpace don't validate. So what is one to conclude from all of this? Its reasonable to conclude that at this time valid HTML isn't going to help you improve your search position. If it has any impact on results, it is minimal compared to other factors. The other reasons to use valid html are strong and I would still recommend all developers begin validating their sites; just don't expect that doing it will catapult you up the search rankings right now.

Comments

Popular posts from this blog

How to Create Chat Bubbles in CSS

At first, you need to write the appropriate HTML markup. Refer to the examples below: INPUT HTML <div class="talk-bubble"> <div class="talktext"> <p>CSS Talk Bubble configured by classes. Defaults to square shape, no triangle. Height is auto-adjusting to the height of the text.</p> </div> </div> <div class="talk-bubble tri-right left-top"> <div class="talktext"> <p>This one adds a right triangle on the left, flush at the top by using .tri-right and .left-top to specify the location.</p> </div> </div> <div class="talk-bubble tri-right left-in"> <div class="talktext"> <p>This talk-bubble uses .left-in class to show a triangle on the left slightly indented. Still a blocky square.</p> </div> </div> <div class="talk-bubble tri-right round btm-left"> <div class="talktext"> <p>And now

How to Install and Configure Drupal

Drupal is a feature rich Content Management System (CMS) which allows webmasters to run dynamic website without PHP knowledge. Publishers can add website content through admin interface. Thanks many modules it is possible to install a lot of additional features like spell check, displaying banners or AdSense ads, chat, mailing list, e-commerce modules etc. Drupal CMS was written in PHP and can be downloaded free of charge. In this article I will explain how to install this script and describe the configuration basics. Web hosting requirements for Drupal CMS Before you start the installation process, ensure that your web host offers PHP support, cron jobs and MySQL database. Drupal requires PHP version 4.3.3 or higher (Drupal 4.5 and earlier versions will not run on PHP 5), PHP XML extension and MySQL v3.23.17 or higher. If you are not sure which version of PHP and MySQL your web hosting provider currently uses, contact their help desk and ask them first. How to install Drupal CMS

Advantages of Using PHP

History of PHP PHP (it originally was abbreviation of for Personal Home Page) was first written by Rasmus Lerdorf . These were Perl scripts to track users of his Web pages. On communication and suggestions from other people he rewrote it as a scripting language and added support for forms. As its popularity grew, a core group of developers created an API for it and turned it into PHP3. Presently we have version 5. (PHP5) and its growth is likely to continue. Advantages of PHP PHP will run on all platforms, including most UNIXs, Windows (95/98/NT/2000) and Macs. As this uses the same code base, all scripts will run identically on all the platforms. PHP is similar to C. So anyone who has experience with a C-style language will easily learnd PHP. In C-style languages we can also include Javascript and Java. In fact, much of PHP’s functionality is provided by wrappers around the underlying system calls (such as fread() and strlen()) so C programmers will immediately feel at home.