is there a php script that will import a wikipedia db dump into wordpress?

Discussion in 'PHP' started by webcabbie, Dec 20, 2013.

  1. #1
    is there a php script that will import a wikipedia db dump into wordpress?

    So forgive me here guys I am a total dumbass but this is what I want to do..

    I want to take a wikipedia database dump and import it into a blank wordpress install so that all the posts from wikipedia are viewable in the wordpress site.

    I realize this is crazy and I rather not discuss why I want to do it but I need to accomplish this.

    There has to be a way..
     
    webcabbie, Dec 20, 2013 IP
  2. Philbeng

    Philbeng Well-Known Member

    Messages:
    36
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    100
    #2
    I know Wordpress pretty well but as far as I know there is no out of the box solution for this.
    There is always a way but for this it looks like custom scripting/coding.
    You also need to think about where you are going to host it as the database is huge and the resources needed to run and serve it aren't going to make your average host happy!
    Are you talking about running the whole of the Wikipedia dump, multi lingual etc.. or just the "en" version? Also are you talking about all the categories and pages or just a category?
    If you are only after a category then it might be possible to import the "xml" dump through the wordpress admin panel as a rss feed, which would probably have to be imported in stages depending on the category size. Another option if you are only after a subset of the pages is there is a plugin out there that will scrape pages from wikipedias api but there is a limit on how many api calls you can make.
    It might be easier to use the wikimedia software and have that modded by someone to suit what you are trying to achieve.
    To me it looks like it is going to be a lot of work & expense.
     
    Philbeng, Dec 21, 2013 IP
  3. webcabbie

    webcabbie Active Member

    Messages:
    355
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    58
    #3
    hosting is not an issue, i only want to use the English version and the entire English version. All of it. There has to be a simple way to make the database dumps into something that can be imported into wikipedia.
     
    webcabbie, Dec 23, 2013 IP
  4. Philbeng

    Philbeng Well-Known Member

    Messages:
    36
    Likes Received:
    3
    Best Answers:
    0
    Trophy Points:
    100
    #4
    It isn't as simple as you seem to think unfortunately. As the database structures are very different.
    Basically your options without spending a lot of time searching:
    • Have a custom script written.
    • Have someone mod wikimedia software
    • Have someone modify the wiki dump to suit the Wordpress software
    • Use the XML dump from Wikimedia and put that in to Wordpress through the admin panel, (this looks interesting http://wordpress.org/plugins/wp-all-import/) as a rss feed, this would likely take a lot of attempts and time.
    • Or do any of the above yourself
    I've never worked on or know of a Wordpress site with a DB of that size so wouldn't know how well Wordpress would handle it, without a lot of optimization. There are a couple of plugins on Wordpress repositories but they only seem to work with the api from wikipedia.
     
    Last edited: Dec 24, 2013
    Philbeng, Dec 24, 2013 IP