Re: Average non-meaningfull word ratio
Hi Colin,
Yes, I'd be interested in keeping up-to-date with any news you find out. Perhaps if you do start discussions at ofthe forums it might be nice if you posted the urls here so we could all follow any developments as they happen.
Regardless, I look forward to further discussion on this and other points.
PS - TDOL, I've opened up a new forum for statistics & analysis discussions. ;-)
Re: Average non-meaningful word ratio
Hi
Good and bad news.
I've worked out another way to do the same thing, but it's not exactly the same method as I was origionally going to use...
I'll keep you posted, but atm I'm still trying to figure it out for myself.
Basically, the search engine analyses the whole website (rather than just a single file), and then words used more frequently are generally considered to be less likely to be relevant. *However* if the site focuses on one word (eg: this site focuses on the word "english"), then this method would be broken. Therefor, it also looks at the html files, and considers text in bold tags to be sacred, and 'thou shall not give bad ratings to sacred words'. Then words such as "the" get smuggled into bold text, and make themselves appear sacred... Still working on a cure to this one, but it will be solved :-) Maybe have some sort of holy water to dip words into. If it dies, it was not sacred...
Err... Yep :-)
Cheers
Re: Average non-meaningful word ratio
If bold tags are considered sacred, then it would be easy to overload the system by cramming text with tags, wouldn't it? ;-)
Re: Average non-meaningful word ratio
Indeed :P But then, the webmaster would be a fool. This isn't a search engine for the whole www, just one site.
It's okay though, I'm still in testing phase, but as yet - it's looking good.
Cheers
Re: Average non-meaningful word ratio
Sorry, I missed that point. ;-)
Re: Average non-meaningful word ratio
:-P
Having said that, most webmasters are dozzy nowerdays, so I wouldn't put it past them to leave the odd bold tag open, and forget to close it for several paragraphs. For that reason, it filters out long bold strings and suchlike (maybe only focusing on the first few words, up until the full stop? Not decided yet).
Cheers (better close that bold tag now :P)
Re: Average non-meaningful word ratio
And what happens to those who highlight important words with a different tag? :lol:
Re: Average non-meaningful word ratio
I'll make a big list of tags. The main problem is if developers start using tags, since they could be enphasising words, but yet they could also be used for making text smaller, etc.
The admin of the site will have to set the search engine up correctly at the start, so that it knows span tags with class="enphasis" are enphasis tags.
It's fairly dependant on the admin using it properly, but remember that there are whole jobs in SEO, so this is much better than dictating how you write the site itself.
Cheers
Re: Average non-meaningful word ratio
Make a special site 'keyword' tag? ;-)
Re: Average non-meaningful word ratio