View Full Version : Looking for help creating a couple helpful tools
schlottke
Apr 5th 2004, 9:39 am
Shawn, GuyFromChicago, and whoever else may be reading...
I do not have extensive knowledge of php, yet ;-) I do need to get a couple tools working, and I don't have really the darndest idea how to start.
My goals:
I am creating a website that will become a portal for wrestling related information...
Articles, News, Shopping (obviously plugging JRWrestling.com as the froogle - if you will, and a directory of HS, College, and other related links.
My problem is this: I want to draw news from 2 - 3 different resources (all of which will give me permission.) automatically without having to touch code.
There is a google news script (that doesnt seem to work anymore for them since they sued the maker) if you want to see that for a core idea, please let me know.
digitalpoint
Apr 5th 2004, 10:09 am
Well the easiest way to do it would be if they give you some sort of standard feed (like RSS). There are lots of RSS feed scripts around (check hotscripts.com for example).
- Shawn
schlottke
Apr 5th 2004, 10:14 am
Well, the problem there is... these guys run big sites, but don't have technical knowledge. I will need to pull it directly off of their site. The extention is .asp... I wonder if editing a link harvester would work..
Google doesnt offer RSS, I dont belive, but the code seems to pull the news from the pages still some how.
digitalpoint
Apr 5th 2004, 10:18 am
They probably are screen-scraping... Which means you would need to write a custom script for each site.
- Shawn
schlottke
Apr 5th 2004, 10:19 am
Ok Ok.. Id be willing to pay for the creation of this, as well as a script that can create an Outlook file from osCommerce's customer list...
Or you could create it and offer the code to the public free?
digitalpoint
Apr 5th 2004, 10:22 am
Sorry... not something I want to do. It's useless to the public because it would only work for the specific site.
If you can't do it yourself, your best option would be to hire a programmer to do it for you.
- Shawn
schlottke
Apr 5th 2004, 10:37 am
Understandable. I guess I'll have to get on the grindstone and work it out. ;)
GuyFromChicago
Apr 5th 2004, 6:52 pm
Shawn, GuyFromChicago, and whoever else may be reading...
I do not have extensive knowledge of php, yet ;-) I do need to get a couple tools working, and I don't have really the darndest idea how to start.
My goals:
I am creating a website that will become a portal for wrestling related information...
Articles, News, Shopping (obviously plugging JRWrestling.com as the froogle - if you will, and a directory of HS, College, and other related links.
My problem is this: I want to draw news from 2 - 3 different resources (all of which will give me permission.) automatically without having to touch code.
There is a google news script (that doesnt seem to work anymore for them since they sued the maker) if you want to see that for a core idea, please let me know.
As much as I would like to help, I’m not your guy. My strengths are more in the hardware side of things. I know enough programming to be dangerous, but that’s about it. I just work on small apps to make my life and job easier – my partner does the “hard stuff” :D
schlottke
Apr 5th 2004, 8:12 pm
totally understandable, I didn't want to try to figure it out either.. lol
TampaDave
Apr 7th 2004, 7:23 am
What are you trying to scrape?
I could write something in VBScript for .asp that grabs stuff.
For a sample of such a scrip see
http://www.sfreader.com/scraper.asp
enter a URL into the field; be sure to include http://
Example: http://www.yahoo.com
The result will show you the html source for the page and the page itself.
Basics:
Goes out, grabs client side source, load into a variable, serves it up. The thing is that the content is now being served from YOUR server. Any spider hitting this page would see it as YOUR page, not the page you are scraping from.
TampaDave
schlottke
Apr 7th 2004, 11:59 am
Im trying to scrape the following:
Most notably:
http://thewrestlingmall.com/htmls/news.asp?Cat=4
http://themat.com/frontNews/dynamic/top10.asp
http://themat.com/pressbox/presslist.asp?catid=4
But wouldn't mind:
http://thewrestlingmall.com/htmls/news.asp?Cat=2
http://thewrestlingmall.com/htmls/news.asp?Cat=3
http://thewrestlingmall.com/htmls/news.asp?Cat=5
http://themat.com/pressbox/presslist.asp?catid=49
vBulletin® v3.6.8, Copyright ©2000-2008, Jelsoft Enterprises Ltd.