Forum newsfeeds
Forum Newsfeeds


Sites for Teachers

Sites for Teachers


Go Back   UsingEnglish.com ESL Forum > Analysing Language > Text Analysis and Statistics

Reply
 
LinkBack Thread Tools Display Modes
  #11 (permalink)  
Old 26-Nov-2004, 14:06
Red5's Avatar
Webmaster, UsingEnglish.com
 
Join Date: Nov 2002
Country: England
Posts: 2,781
Current Location: London
First Language: British English
Member Type: Other
Thanks: 2
Thanked 62 Times in 34 Posts
Red5 has disabled reputation
Default Re: Average non-meaningfull word ratio

Hi Colin,

Yes, I'd be interested in keeping up-to-date with any news you find out. Perhaps if you do start discussions at ofthe forums it might be nice if you posted the urls here so we could all follow any developments as they happen.

Regardless, I look forward to further discussion on this and other points.

PS - TDOL, I've opened up a new forum for statistics & analysis discussions.
Reply With Quote
Sponsored Links
  #12 (permalink)  
Old 26-Nov-2004, 14:29
colinhorne
Guest
 
Posts: n/a
Default Re: Average non-meaningful word ratio

Hi

Good and bad news.

I've worked out another way to do the same thing, but it's not exactly the same method as I was origionally going to use...

I'll keep you posted, but atm I'm still trying to figure it out for myself.

Basically, the search engine analyses the whole website (rather than just a single file), and then words used more frequently are generally considered to be less likely to be relevant. *However* if the site focuses on one word (eg: this site focuses on the word "english"), then this method would be broken. Therefor, it also looks at the html files, and considers text in bold tags to be sacred, and 'thou shall not give bad ratings to sacred words'. Then words such as "the" get smuggled into bold text, and make themselves appear sacred... Still working on a cure to this one, but it will be solved Maybe have some sort of holy water to dip words into. If it dies, it was not sacred...

Err... Yep

Cheers
Reply With Quote
  #13 (permalink)  
Old 30-Nov-2004, 03:40
Editor, UsingEnglish.com
 
Join Date: Nov 2002
Country: UK
Posts: 25,671
Current Location: Phnom Penh
First Language: English
Member Type: English Teacher
Thanks: 6
Thanked 543 Times in 478 Posts
Tdol has disabled reputation
Default Re: Average non-meaningful word ratio

If bold tags are considered sacred, then it would be easy to overload the system by cramming text with tags, wouldn't it?
Reply With Quote
  #14 (permalink)  
Old 30-Nov-2004, 05:51
colinhorne
Guest
 
Posts: n/a
Default Re: Average non-meaningful word ratio

Indeed :P But then, the webmaster would be a fool. This isn't a search engine for the whole www, just one site.

It's okay though, I'm still in testing phase, but as yet - it's looking good.

Cheers
Reply With Quote
  #15 (permalink)  
Old 03-Dec-2004, 09:04
Editor, UsingEnglish.com
 
Join Date: Nov 2002
Country: UK
Posts: 25,671
Current Location: Phnom Penh
First Language: English
Member Type: English Teacher
Thanks: 6
Thanked 543 Times in 478 Posts
Tdol has disabled reputation
Default Re: Average non-meaningful word ratio

Sorry, I missed that point.
Reply With Quote
  #16 (permalink)  
Old 04-Dec-2004, 18:31
colinhorne
Guest
 
Posts: n/a
Default Re: Average non-meaningful word ratio



Having said that, most webmasters are dozzy nowerdays, so I wouldn't put it past them to leave the odd bold tag open, and forget to close it for several paragraphs. For that reason, it filters out long bold strings and suchlike (maybe only focusing on the first few words, up until the full stop? Not decided yet).

Cheers (better close that bold tag now :P)
Reply With Quote
  #17 (permalink)  
Old 11-Dec-2004, 23:44
Editor, UsingEnglish.com
 
Join Date: Nov 2002
Country: UK
Posts: 25,671
Current Location: Phnom Penh
First Language: English
Member Type: English Teacher
Thanks: 6
Thanked 543 Times in 478 Posts
Tdol has disabled reputation
Default Re: Average non-meaningful word ratio

And what happens to those who highlight important words with a different tag?
Reply With Quote
  #18 (permalink)  
Old 12-Dec-2004, 05:19
colinhorne
Guest
 
Posts: n/a
Default Re: Average non-meaningful word ratio

I'll make a big list of tags. The main problem is if developers start using <span> tags, since they could be enphasising words, but yet they could also be used for making text smaller, etc.

The admin of the site will have to set the search engine up correctly at the start, so that it knows span tags with class="enphasis" are enphasis tags.

It's fairly dependant on the admin using it properly, but remember that there are whole jobs in SEO, so this is much better than dictating how you write the site itself.

Cheers
Reply With Quote
  #19 (permalink)  
Old 12-Dec-2004, 08:39
Editor, UsingEnglish.com
 
Join Date: Nov 2002
Country: UK
Posts: 25,671
Current Location: Phnom Penh
First Language: English
Member Type: English Teacher
Thanks: 6
Thanked 543 Times in 478 Posts
Tdol has disabled reputation
Default Re: Average non-meaningful word ratio

Make a special site 'keyword' tag?
Reply With Quote
  #20 (permalink)  
Old 12-Dec-2004, 08:45
colinhorne
Guest
 
Posts: n/a
Default Re: Average non-meaningful word ratio

Hmm? Don't get you :P
Reply With Quote
Reply

Bookmarks

Tags
average, nonmeaningful, word, ratio

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On

Similar Threads
Thread Thread Starter Forum Replies Last Post
word stress bread Ask a Teacher 1 16-Jul-2004 00:05
Word Checker 1 - The Dolch basic word list Tdol UsingEnglish.com Content 0 24-May-2004 12:26
Word Checker 1 - The Dolch basic word list Tdol UsingEnglish.com Content 0 19-Apr-2004 14:30
word for "word reminder" Anonymous Ask a Teacher 3 09-Dec-2003 05:41
Questions about Inversions - Inverted Word Order Anonymous General Language Discussions 21 31-May-2003 21:43


New To Site? Need Help?

All times are GMT. The time now is 04:49.


vBulletin, Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Search Engine Optimization by vBSEO 3.2.0
Copyright © 2002 - 2008 UsingEnglish.com