Recent Articles

Visitors Value Video Viewing Variety
A survey by AOL and AP found that 54 percent of Internet users have watched or downloaded a video clip.

A/B Testing for Adsense is Often Pointless
Google recently posted a little blurb about The ABCs of A/B Testing. I hate to be a wet blanket, but for most of us lower volume publishers, that's a pointless exercise with little to gain.

YouTube Receives Minor Revisions
YouTube went down again thursday night, but this time, it was a planned "outage." Now the site is up and running again, with "some maintenance and . . . a few new features" under its belt.

Google Domains: Search And Everything Else
Only four of Google's domains receive more than one percent of the traffic coming to a Google.com address from a US-based requestor.

Calacanis Enticing Diggers With Cash
Weblogs Inc and Netscape head Jason Calacanis is offering $1,000 per month to the top users on Digg, Reddit, Newsvine, Delicious, and Flickr...

PodZinger, TED, And Alex Laats
The TEDTalks series featuring talks by Al Gore, David Pogue, Tony Robbins, and others at the Technology, Entertainment, Design conference can be searched...



Click to Play

Graphics and Design Tip
In today’s visibility tip we’re going to talk a little bit about your site’s graphics and design. Graphics can be the gateway to a great site experience, or they can...

Low Rate eCommerce & Retail Plans
11.06.06


Yahoo Search Subtly Nudges Webmasters

By Chris Richardson

One of biggest thorns in the side of all search engines is non-relevant search results. Whether the cause of these types of results are spammers or webmasters who aren’t conscientious enough to check their work, poor SERPs can damage the reputation of search engine almost irrevocably (who, exactly, would want to use an engine that gives irrelevant results?).
Recent WebProBlog Post
Microsoft adCenter Under Fire
The data collection methods used by Microsoft’s adCenter service have been called into question. A complaint filed with the FTC equated their contextual advertising practices to adware.

Visit WebProBlog and contribute your opinion today!


The big four have taken various steps to address this, which usually consists of various webmaster guidelines and FAQs. However, Yahoo Search is trying a different approach. On the Yahoo Search blog, it was revealed they now parse wildcard characters (*, $) included in robot.txt files. The request for Yahoo to do this was made at an SES conference (who says search engines don’t listen to their audience?).

The blog entry reveals the details of the wildcard allowances:

‘*’ - matches a sequence of characters

You can now use ‘*’ in robots directives for Yahoo! Slurp to wildcard match a sequence of characters in your URL. You can use this symbol in any part of the URL string you provide in the robots directive. For example,

User-Agent: Yahoo! Slurp
Allow: /public*/

Disallow: /*_print*.html
Disallow: /*?sessionid

Low Rate eCommerce & Retail Plans

While the $ informs Slurp to:

anchor the match to the end of the URL string. Without this symbol, Yahoo! Slurp would match all URLs against the directives, treating the directives as a prefix. For example:

User-Agent: Yahoo! Slurp
Disallow: /*.gif$
Allow: /*?$

The first command tells Slurp to disallow all files that end with the .gif extension. The second command informs Slurp to allow files ending with “?” to be included in the crawl.

If you noticed, in the * wildcard explanation, Priyank Garg used Session IDs as an example and this is what led to my opening statement about irrelevant search results. Obviously, Yahoo wants to rid their index of as many erroneous URLs as possible and accepting the * wildcard in robot.txt files helps in this process, provided webmasters are willing to implement these allowances.

In fact, I asked Yahoo senior PR manager Shelia Tran if this was their intention when they introduced this change. She replied:

"We try our best to crawl and find the best unique content on webmaster’s sites. However, any help that webmasters can give in terms of hints and directives to prevent the crawler from crawling duplicate pages or crawler traps is welcome."

In other words, Yahoo wants webmasters to take an active role in preventing Slurp traps. Putting the appropriate commands in your site’s robot.txt file will also help them keep their SERPs relatively clean, which is something that benefits all search engine users.

Want to add your thoughts?
As always, visitors of WebProBlog are invited to share their comments, suggestions, ideas, and contributions. Visit WebProBlog and contribute your opinion today!


About the Author:
Chris Richardson is a search engine writer and editor for WebProNews. Visit WebProNews for the latest search news.

About InternetProNews
News and updates for the internet professional

InternetProNews is brought to you by:

WebProNews.com Jayde.com
MarketingNewz.com SalesNewz.com
CareerNewz.com InvestNewz.com
eCommNewz.com WebsiteNotes.com
AdvertisingDay.com ManagerNewz.com
SoHoDay.com CRMNewz.com


-- InternetProNews is an iEntry, Inc. publication --
iEntry, Inc. 2549 Richmond Rd. Lexington KY, 40509
2006 iEntry, Inc. All Rights Reserved Privacy Policy Legal

archives | advertising info | news headlines | free newsletters | comments/feedback | submit article


InternetProNews News Archives About Us Feedback InternetProNews Home Page About Article Archive News Downloads WebProWorld Forums Jayde iEntry Advertise Contact