Scraping other Websites as a Content Fetcher

Discussion in 'PHP' started by Philip C. Ngo, Jul 18, 2014.

  1. #1
    I want to create a content grabber with php,What is the best one that You heard ,
    My mean is send a request to external url and get data as html format then parse it and display the title of content to the Main page.

    all grabbed content be index in my database ,after click on link you go to My web page
    whit this url:

    example.com/title of grabbed content/grabbedcontent.php
     
    Last edited: Jul 18, 2014
    Philip C. Ngo, Jul 18, 2014 IP
  2. Hamid Hassan

    Hamid Hassan Banned

    Messages:
    17
    Likes Received:
    0
    Best Answers:
    0
    Trophy Points:
    46
    #2
    I have more then 5 years experience in web development. I can do this for you.
     
    Hamid Hassan, Jul 18, 2014 IP
  3. AsHinE

    AsHinE Well-Known Member

    Messages:
    240
    Likes Received:
    8
    Best Answers:
    1
    Trophy Points:
    138
    #3
    So, what is your question?
    How to fetch remote url? How to get its title or what do you want to know?
     
    AsHinE, Jul 21, 2014 IP
  4. Philip C. Ngo

    Philip C. Ngo Member

    Messages:
    85
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    43
    #4
    Actually I want to know How the parse content will goes to database(mysql) and Automatically will make url with title of grabbed content(with Array)?
     
    Last edited: Jul 21, 2014
    Philip C. Ngo, Jul 21, 2014 IP
  5. kutchbhi

    kutchbhi Active Member

    Messages:
    130
    Likes Received:
    4
    Best Answers:
    2
    Trophy Points:
    70
    #5
    Capture page using curl .
    Parse page using Querypath.
     
    kutchbhi, Jul 22, 2014 IP
  6. Philip C. Ngo

    Philip C. Ngo Member

    Messages:
    85
    Likes Received:
    2
    Best Answers:
    0
    Trophy Points:
    43
    #6
    said:http://querypath.org/
    there isn't any alternative PHP library for manipulating HTML and XML
    also for curl and file_get_content for huge project
     
    Philip C. Ngo, Jul 22, 2014 IP