Tool for text and keyword analysis: Topicalizer

Discussion in 'Products & Tools' started by BjoernW, Feb 26, 2006.

  1. #1
    Hello everbody,

    I would like to introduce a new tool for text analysis, which was (and is being) developed by me:
    http://www.topicalizer.com/

    This software analyses a given text input regarding aspects like type / token frequency, lexical density, sentence and paragraph structure, readability, word frequencies, collocations, possible keywords and many more.

    Apart from that it automatically creates an abstract for a given text.

    --
    Best regards
    Bjoern
     
    BjoernW, Feb 26, 2006 IP
  2. iconv

    iconv Well-Known Member

    Messages:
    189
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    108
    #2
    SOunds like an interesting idea, but the site is not working for me, it keeps on showing me the index page even after entering a url and hitting the 'Topicalize' button.
     
    iconv, Feb 26, 2006 IP
  3. BjoernW

    BjoernW Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #3
    That's strange. Which browser do you use and with which URL did you try it out?
     
    BjoernW, Feb 26, 2006 IP
  4. alienated

    alienated Active Member

    Messages:
    144
    Likes Received:
    6
    Best Answers:
    0
    Trophy Points:
    73
    #4
    Doesn't work for IE. Firefox is fine.
     
    alienated, Feb 26, 2006 IP
  5. BjoernW

    BjoernW Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #5
    Thanks for this helpful feedback.
    This is a really strange problem,because the site works equally well in IE for MacOS X and I cannot see for the moment what makes it choke in IE for Windows.
     
    BjoernW, Feb 26, 2006 IP
  6. eKstreme

    eKstreme Guest

    Messages:
    131
    Likes Received:
    14
    Best Answers:
    0
    Trophy Points:
    0
    #6
    I've had this problem before. I'm guessing in /process/ you're checking for a variable called "submit". IE does not submit it that (always? sometimes?). Anyway, the solution is to create a hidden field and then check that. I hope I'm making sense.

    Of course, it could be something completely different :rolleyes: Please post back if you need more help.
     
    eKstreme, Feb 27, 2006 IP
  7. BjoernW

    BjoernW Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #7
    Thanks for your help. I have just added another hidden field called 'submit' and, could you maybe check again if it works now?

    @eKstreme: The keyword extraction tool from your signature looks quite nice, too:)
     
    BjoernW, Feb 27, 2006 IP
  8. eKstreme

    eKstreme Guest

    Messages:
    131
    Likes Received:
    14
    Best Answers:
    0
    Trophy Points:
    0
    #8
    I can confirm it's working in IE6 on WinXP SP2 :) One suggestion: at the moment, the Topicalizer is identifying itself as "Mozilla/5.0". May I suggest you send a more informative user agent string? Web log junkies (me included) would like that, and it can add to your publicity.

    Glad you like the KEA. It works quite well to identify most keywords a page ranks for. If you have any suggestions for improvements, please PM me :)
     
    eKstreme, Feb 27, 2006 IP
  9. BjoernW

    BjoernW Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #9
    Thanks a lot for your help and feedback, I will think about what identity Topicalizer will use in the future.
     
    BjoernW, Feb 27, 2006 IP
  10. FlyinBlind

    FlyinBlind Peon

    Messages:
    25
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #10
    Hi!

    Tried to check my sites but kept getting:

    "URL could not be retrieved:
    mysite.com"

    Tried it with and without "http" and "www" in Firefox 1.5 and IE 6.0.
     
    FlyinBlind, Feb 28, 2006 IP
  11. iconv

    iconv Well-Known Member

    Messages:
    189
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    108
    #11
    All is working fine now, must have been the IE issue, thanks for looking into it. Handy little tool, thanks for making it available.
     
    iconv, Mar 2, 2006 IP
  12. BjoernW

    BjoernW Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #12
    This was a firewall-related issue, which has been solved by now.
     
    BjoernW, Mar 2, 2006 IP
  13. BjoernW

    BjoernW Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #13
    Topicalizer now identifies itself as 'Mozilla/5.0 (compatible; Topicalizer/www.topicalizer.com)'
     
    BjoernW, Mar 2, 2006 IP
  14. eKstreme

    eKstreme Guest

    Messages:
    131
    Likes Received:
    14
    Best Answers:
    0
    Trophy Points:
    0
    #14
    Indeed it does. Thank you :)
     
    eKstreme, Mar 2, 2006 IP
  15. BjoernW

    BjoernW Peon

    Messages:
    8
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    0
    #15
    There are several new tools available under:

    http://topicalizer.com/similarDocuments/
    http://topicalizer.com/augmentKeywords/
    http://topicalizer.com/coOccurrences/

    With the first one you can find web pages that are similar to a URL or text content-wise.
    The second one finds synonyms and related terms (hypernyms, hyponyms, generic terms) for a text.
    The last tool finally makes use of large corpora of different text categories in order to find words that co-occur frequently with the words which have been entered.
     
    BjoernW, May 4, 2006 IP