Click here to get back home

Not the browsers, dummy, the search engines

 HomeNewsGroups | Search | About
 comp.infosystems.www.authoring.html    Post an article   get this group's latest topics as an RSS feed add this group's latest topics to your My MSN content add this group's latest topics to your My Yahoo content
Subject Author Date
Not the browsers, dummy, the search engines Kent Feiler 02-10-2007
Posted by Kent Feiler on February 10, 2007, 10:11 am
Please log in for more thread options



If I understand the general direction of recent posts, the idea is to
improve the quality of html/css by soliciting help from the various
browsers. Browsers can certainly detect problems but they have no
sensible place to report them and no way to prevent the same problem
from happening over-and-over in multiple sites around the world. That
idea simply doesn't work.

But how about this one. Suppose we have all of those search engine
spiders do a cursory html/css edit check while they're creeping around
on the internet, and not post items with errors into their search
files. Or perhaps flag them on their lists as having errors.

This has some things going for it. It would prevent bad html from
being disseminated all over the world; it would inform the authors of
the bad html that they have a problem, and it would encourage them to
fix their problem since that's the only way anyone will be able to
find the kernels of wisdom they wish to share with the world.

The negative from the search engine point of view would be that the
spiders would take substantially longer to analyze a given file. There
may be a positive for them as well (Other than that warm glow inside
when they just know they're doing the right thing!), they would be
delivering a better product to their customers. If someone selected a
item from a Google list they could be fairly sure it wouldn't end up
being a pile of pointy brackets and wall-to-wall text.



Regards,


Kent Feiler
www.KentFeiler.com

Posted by Andy Dingley on February 10, 2007, 10:27 am
Please log in for more thread options


> But how about this one. Suppose we have all of those search engine
> spiders do a cursory html/css edit check while they're creeping around
> on the internet,

Interesting idea...

This would be a solution to a problem of authors not knowing if their
sites were invalid. However the real problem is that authors don't
_care_ (or even understand) if their sites are invalid. If someone
cares to check, it's not hard to tell. This "spider validation" idea
just doesn't solve the real issue.


Posted by Adrienne Boswell on February 10, 2007, 1:29 pm
Please log in for more thread options


Gazing into my crystal ball I observed "Andy Dingley"

>> But how about this one. Suppose we have all of those search engine
>> spiders do a cursory html/css edit check while they're creeping
>> around on the internet,
>
> Interesting idea...
>
> This would be a solution to a problem of authors not knowing if their
> sites were invalid. However the real problem is that authors don't
> _care_ (or even understand) if their sites are invalid. If someone
> cares to check, it's not hard to tell. This "spider validation" idea
> just doesn't solve the real issue.
>
>

Actually, I am one of the few that does care, and I have seen good
results from it. For example, I live in Glendale, California, and I am
Catholic. What would be important to me? To find out when mass was in
Glendale. If I Google for mass glendale, I am greeted by first two
results that are not applicable, and the very next, Holy Family Catholic
Community happens to be a site I developed. Pretty good for 1,080,000
results - 3rd, and 1st relavent result.

I haven't used any "black hat" methods, or Merlin wizardry, just clean,
valid markup.

--
Adrienne Boswell at Home
Arbpen Web Site Design Services
http://www.cavalcade-of-coding.info
Please respond to the group so others can share


Posted by Henri Sivonen on February 10, 2007, 12:16 pm
Please log in for more thread options



> But how about this one. Suppose we have all of those search engine
> spiders do a cursory html/css edit check while they're creeping around
> on the internet, and not post items with errors into their search
> files.

That wont work, because most Web content is erroneous but still useful
to users. Search engines compete on the usefulness of their results, so
excluding useful results that have errors that browsers are able to
silently recover from would be a very bad business move.

> Or perhaps flag them on their lists as having errors.

Not a new idea. This has been discussed relatively recently on the
WHATWG list, for example (even though the discussion was off-topic
there).

This wont work, because flagging erroneous pages would mean that the
vast majority of search results would have an error flag next to them
adding clutter to the search UI. A person performing a search isn't
primarily interested about the spec conformance of the pages.

> This has some things going for it. It would prevent bad html from
> being disseminated all over the world; it would inform the authors of
> the bad html that they have a problem, and it would encourage them to
> fix their problem since that's the only way anyone will be able to
> find the kernels of wisdom they wish to share with the world.

Search engines aren't in the business of putting perpetrators of bad
HTML on the stocks.

> There
> may be a positive for them as well (Other than that warm glow inside
> when they just know they're doing the right thing!), they would be
> delivering a better product to their customers.

Why would having an error flag next to just about every search result
item constitute delivering a better product to their customers?

--
Henri Sivonen
hsivonen@iki.fi
http://hsivonen.iki.fi/
Mozilla Web Author FAQ: http://mozilla.org/docs/web-developer/faq.html

Posted by Kent Feiler on February 11, 2007, 9:27 am
Please log in for more thread options


wrote:


> But how about this one. Suppose we have all of those search engine
> spiders do a cursory html/css edit check while they're creeping
> around on the internet, and not post items with errors into their
> search files.

That wont work, because most Web content is erroneous but still useful
to users. Search engines compete on the usefulness of their results,
so excluding useful results that have errors that browsers are able to
silently recover from would be a very bad business move.

> Or perhaps flag them on their lists as having errors.

Not a new idea. This has been discussed relatively recently on the
WHATWG list, for example (even though the discussion was off-topic
there).

This wont work, because flagging erroneous pages would mean that the
vast majority of search results would have an error flag next to them
adding clutter to the search UI. A person performing a search isn't
primarily interested about the spec conformance of the pages.

> This has some things going for it. It would prevent bad html from
> being disseminated all over the world; it would inform the authors
> of the bad html that they have a problem, and it would encourage
> them to fix their problem since that's the only way anyone will be
> able to find the kernels of wisdom they wish to share with the
world.

Search engines aren't in the business of putting perpetrators of bad
HTML on the stocks.

> There may be a positive for them as well (Other than that warm glow
> inside when they just know they're doing the right thing!), they
> would be delivering a better product to their customers.

Why would having an error flag next to just about every search result
item constitute delivering a better product to their customers?
----------------------------------------------------------------------

Yeah, eliminating results for bad markup is a bad idea, but there are
likely a lot of variations on the thought of flagging bad markup. How
about something like a 1-10 rating where 1 = total crap and, of
course, 10 = the markup version of Bo Derek. As a user, if you had a
choice of two relevent items on a Google list, you might make your
choice on the basis of the markup score.

Would that motivate authors to improve their markup? Anyone in this
newsgroup who got less than a 10 would be horribly embarrassed, but
for others, it's hard to say.


Regards,


Kent Feiler
www.KentFeiler.com

Similar ThreadsPosted
What is going on with the Search Engines? March 15, 2005, 1:54 am
How I won big with search engines December 6, 2007, 3:58 am
How I won big with search engines December 6, 2007, 6:49 am
How to keep a site out of the search engines? May 24, 2006, 9:52 am
newbie: how to make a url appear in the search engines December 29, 2005, 9:04 am
do search engines index blogs? March 29, 2007, 6:08 pm
Multiviews and multilanguage content (and search engines!) October 23, 2004, 4:36 pm
Effectiveness of META tags in search engines? March 28, 2005, 8:58 pm
How do search engines index multilingual content? January 29, 2006, 2:46 pm
Do splash pages make you invisible to search engines November 10, 2005, 9:37 am

Our other projects:

Art Dolls, Fairies and Mermaids - Sunnyfaces.net

Roy's Linux, Programming and Search Engines messages

1-Script XML SitemapXML Sitemap