Search Engine Robots - How They Work, What They Do (Part I)


Automated search engine robots, sometimes called "spiders" or "crawlers", are the seekers of web pages. How do they work? What is it they really do? Why are they important?

You'd think with all the fuss about indexing web pages to add to search engine databases, that robots would be great and powerful beings. Wrong. Search engine robots have only basic functionality like that of early browsers in terms of what they can understand in a web page. Like early browsers, robots just can't do certain things. Robots don't understand frames, Flash movies, images or JavaScript. They can't enter password protected areas and they can't click all those buttons you have on your website. They can be stopped cold while indexing a dynamically generated URL and slowed to a stop with JavaScript navigation. How Do Search Engine Robots Work?

Think of search engine robots as automated data retrieval programs, traveling the web to find information and links.

When you submit a web page to a search engine at the "Submit a URL" page, the new URL is added to the robot's queue of websites to visit on its next foray out onto the web. Even if you don't directly submit a page, many robots will find your site because of links from other sites that point back to yours. This is one of the reasons why it is important to build your link popularity and to get links from other topical sites back to yours.

When arriving at your website, the automated robots first check to see if you have a robots.txt file. This file is used to tell robots which areas of your site are off-limits to them. Typically these may be directories containing only binaries or other files the robot doesn't need to concern itself with.

Robots collect links from each page they visit, and later follow those links through to other pages. In this way, they essentially follow the links from one page to another. The entire World Wide Web is made up of links, the original idea being that you could follow links from one place to another. This is how robots get around.

The "smarts" about indexing pages online comes from the search engine engineers, who devise the methods used to evaluate the information the search engine robots retrieve. When introduced into the search engine database, the information is available for searchers querying the search engine. When a search engine user enters their query into the search engine, there are a number of quick calculations done to make sure that the search engine presents just the right set of results to give their visitor the most relevant response to their query.

You can see which pages on your site the search engine robots have visited by looking at your server logs or the results from your log statistics program. Identifying the robots will show you when they visited your website, which pages they visited and how often they visit. Some robots are readily identifiable by their user agent names, like Google's "Googlebot"; others are bit more obscure, like Inktomi's "Slurp". Still other robots may be listed in your logs that you cannot readily identify; some of them may even appear to be human-powered browsers.

Along with identifying individual robots and counting the number of their visits, the statistics can also show you aggressive bandwidth-grabbing robots or robots you may not want visiting your website. In the resources section of the end of this article, you will find sites that list names and IP addresses of search engine robots to help you identify them. How Do They Read The Pages On Your Website?

When the search engine robot visits your page, it looks at the visible text on the page, the content of the various tags in your page's source code (title tag, meta tags, etc.), and the hyperlinks on your page. From the words and the links that the robot finds, the search engine decides what your page is about. There are many factors used to figure out what "matters" and each search engine has its own algorithm in order to evaluate and process the information. Depending on how the robot is set up through the search engine, the information is indexed and then delivered to the search engine's database.

The information delivered to the databases then becomes part of the search engine and directory ranking process. When the search engine visitor submits their query, the search engine digs through its database to give the final listing that is displayed on the results page.

The search engine databases update at varying times. Once you are in the search engine databases, the robots keep visiting you periodically, to pick up any changes to your pages, and to make sure they have the latest info. The number of times you are visited depends on how the search engine sets up its visits, which can vary per search engine.

Sometimes visiting robots are unable to access the website they are visiting. If your site is down, or you are experiencing huge amounts of traffic, the robot may not be able to access your site. When this happens, the website may not be re-indexed, depending on the frequency of the robot visits to your website. In most cases, robots that cannot access your pages will try again later, hoping that your site will be accessible then.

Resources

*SpiderSpotting - Search Engine Watch http://searchenginewatch.com/webmasters/spiders.html

*Robotstxt.org List of robots and protocols for setting up a robots.txt file. http://www.robotstxt.org/

*Spider-Food Tutorials, forums and articles about Search Engine spiders and Search Engine Marketing. http://spider-food.net/

*Spiderhunter.com Articles and resources about tracking Search Engine spiders. http://www.spiderhunter.com/

*Sim Spider Search Engine Robot Simulator Search Engine World has a spider that simulates what the Search Engine robots read from your website. http://www.searchengineworld.com/cgi-bin/sim_spider.cgi

Daria Goetsch is the founder and Search Engine Marketing Consultant for Search Innovation Marketing, a Search Engine Optimization company serving small businesses. She has specialized in Search Engine Promotion since 1998, including three years as the Search Engine Specialist for O'Reilly Media, Inc., a technical book publishing company.

Copyright © 2002-2005 Search Innovation Marketing. http://www.searchinnovation.com All Rights Reserved.

Permission to reprint this article is granted if the article is reproduced in its entirety, without editing, including the bio information. Please include a hyperlink to http://www.searchinnovation.com when using this article in newsletters or online.

2JobSearch.net - Jobs | Work From Home | All Your Employment Needs
2jobsearch.net has partnered with CareerBuilder.com and HomeBasedBusinessListings.com to bring you the best in job searching, online recruiting and work from home opportunities

Home Based Business Listings
Home based business listing is your best source for legitimate home based business and work from home opportunities.

Online Income
Internet Based Home Business - Find the Home Based Business You've Been Searching For

Home Based Business Opportunity
Work From Home Opportunity - Easy Home Based Opportunity

Home Based Business Listings Blog
Find The Right Home Based Business For You.

Work From Home Opportunity
Home Based Business Opportunity - Start Your Own Home Based Business

Work From Home Blog
Work From Home Blog - Great Source of Home Based Business Information

Related Articles:

What We Believe: A History of the George Warren Brown School of Social Work: 1909-2007
To celebrate nearly 100 years of existence and a new era in social work education, the George Warren Brown School of Social Work at Washington University in St. Louis is publishing What We Believe: A History of the George Warren Brown School of Social Work: 1909- 2007.

Students Aim For Fun at Work
A new survey of the ?Digital Generation? has been launched today by workplace experts Career Innovation in partnership with AIESEC, the world?s largest student-run organisation. The research will track people?s use of new technology, analyse their motivation and enable students to find out about careers that match their profile.

Finding The Right Cash Making Work From Home Opportunity
Home is where the heart is and home is where people go to escape from the stresses of their usual lives Many times people think that they would much rather be at work from home all day and not be forced up into a regular job

Procrastination: Make It Work FOR You!
Man, I love to procrastinate! And I'm pretty darn good at it, too. Would you believe I started writing this article almost six months ago? Yep.

Invest In Yourself ? Your Career, Future Income Stream, Education And Training
The advice often given to young couples starting off in life is ?Not to buy what you cannot afford? The same basic advice should be heeded by many

Bathroom Remodeling Ideas That Work
Bathroom remodeling is one of the best ways of preparing a home for sale. After kitchen remodels, bathroom remodels are next in raising the sales value of your home.

How To Succeed In Your Online Home Based Business Opportunity
To succeed in your online home based business opportunity nothing beats experience for knowing exactly what to do. The thing is there have been plenty of people who have already gone through the process of learning what to do, making errors, losing money, putting up with not making any money. We have done all of that so that you don't have to! This is your opportunity to "stand on the shoulders of giants" - which basically means it's your chance to do what we did minus many of the mistakes!

Seven Myths You Must Challenge Now To Begin Your Second Career Today
* Has it been awhile since you explored career options?* Are you making your 21st century choices based on beliefs that were accurate when Reagan was president of the US (and Thatcher was PM of Great Britain)?* Do you wish you could take a test that would point you to the perfect Second Career?Then you're probably operating on yesterday's myths -- time to move to today's reality!Myth #1: Science supports the traditional linear career change model: test for interests, identify careers and go find a job.Reality #1 Researchers at Stanford and Harvard found that career exploration proceeds in a zig-zag trial-and-error path, almost always with a hint of serendipity.

Does the NicoCure Patch Really Work For Those Who Want to Quit Smoking
If you are reading this article then you are most likely a smoker who has at least considered the possibility of quitting. There are a great many quit smoking programs, products, and techniques available, one of them being the NicoCure quit smoking patch. Smokers often have varying reasons for wanting to quit such as improving their health, increasing their lifespan, being around to take care of their families, or just to avoid the harassment that todays smokers often receive from the anti-smoking sector. An ideal quit smoking product would be all natural with no dangerous drugs, does not use nicotine substitutes, has a high success ratio, and controls the dreaded nicotine withdrawal symptoms. NicoCure claims that it does all these thin...

Running Your Home Based Business With A Toddler Underfoot
As a work at home parent, you theoretically have the best of both worlds. You have the ability to earn an income without the hassle of a commute. You also have the luxury of being home to raise your children. Some days, however, that luxury begins to lose its charm, as you attempt to juggle the demands of parenting and work at the same time. If you have children that are toddler aged, this process becomes even more difficult. This article will discuss a few tips for working from home with a toddler underfoot.

The Worst Way To Fill Construction Management Jobs Openings
Finding new employees to fill Construction Management Jobs Openings is always a difficult prospect for any employer. It is like rolling dice: the employer is gambling that he will find a candidate who went to a good technical school and completed the bachelor's degree requirements to get certified in the academic fields of civil engineering, construction management, or construction science and thus be the best person to hire for those Construction Management Jobs Openings. What adds on to the pressure on the employer is that Construction Management Jobs Openings may need to be filled in fast if he has clients presenting Construction Projects to be bidded on.

New Trends in Technology Makes Teen Author's Career Easier to Launch, Market for Less Than Price of Jeans
19-Year-Old Shalayne Alexandria, Author of Teen / Young Adult Fiction Series Nyville High, Uses Interactive Online Community, Video Trailers, Free Cell Phone Books to Create and Maintain Relationship with Readers. Author's third book in the Nyville High series, Nyville High No. 3: True Lies will be featured under the New Title Showcase at BookExpo America 2008

What to Wear at Work
Heading out the door on a workday with rainclouds in the sky usually means it is time to grab raingear and an umbrella That is just one part of knowing what to wear at work

Power tools: powering our hands and also giving speed to our work
In earlier days, man used tools which were made of bones or wood. Then came the metallic tools, made of copper and iron. The modern era is of power tools. They are nothing but normal tools, which are being powered by a motor that can be an electric one, or a gasoline based or one based on compressed air. They may be stationary or portable.

Larry Keim Says Military Downsizing in Iraq Will Have Effect on U.S. Jobs Market
Larry Keim suggests that officers are the equivalent of executive talent and can fill jobs at that level.


Privacy Policy | Copyright/Trademark Notification