View Single Post
Old 08-10-2009, 11:44 AM   #1
ahi
Wizard
ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.ahi ought to be getting tired of karma fortunes by now.
 
Posts: 1,790
Karma: 507333
Join Date: May 2009
Device: none
Generating from Wikipedia

Are there any good tools or programming (python or PHP) libraries for working with (the) Wikipedia (database) in a reasonably high level way, for the purposes of extracting articles?

The sort of thing I'd want is to be able to grab the full HTML and (full resolution) associated images of an article with a single command or command line call. Get a list of linked article keywords in a similarly straightforward way.

Ideally it ought to be possible to program a sort of limited but "intelligent" spidering of articles.

Thanks for any tips in advance.

- Ahi
ahi is offline   Reply With Quote