jocs
Jul 28th 2004, 10:16 am
I wan to expose to this comunity my idea and when we all think its the best way then make a new tool:
When counting keyword density and prominence we have many ways to evaluate the keywords but I think we can have a tool that joins all ways and gives a concrete total view of each keyword.
note: when i say word I mean words that are not stopwords.
We'll view it by this Example:
<html><body> tal tal tal tal tal tal tal tal tal tal <b> yam tam pam </b></body></html>
keyword "yam": (values are not real)
it is 0 non stop words far from <b> tag - 100% prominence (it is the first word)
it is 10 non stop words far from <body> - tag 2% prominence
it is 10 non stop words far from <html> - tag 2%p rominence
Ok now we have a tag value table: (values are not real)
<HTML> 100
<BODY> 80
<B> 30
note: if there are 5 <b> tags in the html page, <b> tag value is divided by 5, the meaning of this is that if there is only one <b> tag in the html it will be more important than if there are 5.
then we'll calculate the total Rank of keyword yam doing:
VALUE += (% of prominence)*(tag value) for each tag that affects the keyword.
Value of "yam"=100%*30 (<B> tag value) + 2%*80 (<body> tag value) + 2%*100(<html> tag value)
Value of "yam"= 30+1.6+2=33.6
...
...
...
when we have all words values calculated we apply this rules:
if a keyword is just before another it will gain 1% of other value
if a keyword is just After another it will gain 0.8% of other value
if a keyword is the second word before another it will gain 0.5% of other value
if a keyword is the second word after another it will gain 0.4% of other value
meaning if a word is next to another, if one is important the other also will be a little important.
here is my idea. What do you think of?
Hope you understan dit :confused: , at least the idea, I would appreciate all comments, thanx.
When counting keyword density and prominence we have many ways to evaluate the keywords but I think we can have a tool that joins all ways and gives a concrete total view of each keyword.
note: when i say word I mean words that are not stopwords.
We'll view it by this Example:
<html><body> tal tal tal tal tal tal tal tal tal tal <b> yam tam pam </b></body></html>
keyword "yam": (values are not real)
it is 0 non stop words far from <b> tag - 100% prominence (it is the first word)
it is 10 non stop words far from <body> - tag 2% prominence
it is 10 non stop words far from <html> - tag 2%p rominence
Ok now we have a tag value table: (values are not real)
<HTML> 100
<BODY> 80
<B> 30
note: if there are 5 <b> tags in the html page, <b> tag value is divided by 5, the meaning of this is that if there is only one <b> tag in the html it will be more important than if there are 5.
then we'll calculate the total Rank of keyword yam doing:
VALUE += (% of prominence)*(tag value) for each tag that affects the keyword.
Value of "yam"=100%*30 (<B> tag value) + 2%*80 (<body> tag value) + 2%*100(<html> tag value)
Value of "yam"= 30+1.6+2=33.6
...
...
...
when we have all words values calculated we apply this rules:
if a keyword is just before another it will gain 1% of other value
if a keyword is just After another it will gain 0.8% of other value
if a keyword is the second word before another it will gain 0.5% of other value
if a keyword is the second word after another it will gain 0.4% of other value
meaning if a word is next to another, if one is important the other also will be a little important.
here is my idea. What do you think of?
Hope you understan dit :confused: , at least the idea, I would appreciate all comments, thanx.