Automated content grabber..

Discussion in 'PHP' started by ziya, Sep 15, 2007.

  1. #1
    Hello,

    Please if you can help me , give me some clue...

    There are 2 sites wich

    1- There is not any RSS or XML files there
    2- There is RSS file but, there is not text tag inside RSS, only short description


    I need a tool with help of it to grab content from both sites( description, text, media, date etc).
    I want to set up a cron job , and I want that job to update my tables from those sites without my help everyday.

    Please help me ....
     
    ziya, Sep 15, 2007 IP
  2. warzone

    warzone Peon

    Messages:
    153
    Likes Received:
    4
    Best Answers:
    0
    Trophy Points:
    0
    #2
    I know what you mean. You want to steal their content and then post it onto a blogger/wordpress account with your adsense on it? Its not there. But if you get a coder to code it for you, there will be one then. I think it should cost you around about 200-500USD to get a custom coded 'crawler' script.
     
    warzone, Sep 15, 2007 IP
  3. DopeDomains

    DopeDomains Guest

    Messages:
    207
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    0
    #3
    There is no one solution fits all for site scraping.. Ive coded a dozen (give or take) and every site requires different finness to get it right. And if they change layouts, you have to recode it.. sometimes from scrtach (worse case). Luckily with CSS it's getting easier (love those DIV tags).

    -Jason
     
    DopeDomains, Sep 15, 2007 IP
  4. HypertextFever

    HypertextFever Peon

    Messages:
    158
    Likes Received:
    1
    Best Answers:
    0
    Trophy Points:
    0
    #4
    If you can get away with just meta tags it would be 100 times easier to program.

    So many people have malformed HTML holding their site up, and make regex substring routines grab tags you don't want (yay!)
     
    HypertextFever, Sep 16, 2007 IP
  5. ziya

    ziya Well-Known Member

    Messages:
    1,971
    Likes Received:
    28
    Best Answers:
    0
    Trophy Points:
    140
    #5
    warzone - Yes some thing like that. But I am going to publish with active link to those sites :)

    But what about automated bloggers websites ? How do they build automated blogs ?

    DopeDomains - Can you help me with codes ? There are 2 websites for now. I am tired of copy pasting content from those sites. I want to automate that process. Can I PM you ?
     
    ziya, Sep 16, 2007 IP
  6. DopeDomains

    DopeDomains Guest

    Messages:
    207
    Likes Received:
    8
    Best Answers:
    0
    Trophy Points:
    0
    #6
    Feel free to contact me via PM, or my Messenger icons.

    -Jason
     
    DopeDomains, Sep 16, 2007 IP