data-grabbing & mining - need script-help

PHP programming forum. Ask questions or help people concerning PHP code. Don't understand a function? Need help implementing a class? Don't understand a class? Here is where to ask. Remember to do your homework!

Moderator: General Moderators

Post Reply
bernard_hinault
Forum Newbie
Posts: 1
Joined: Sat Jul 22, 2006 5:08 pm

data-grabbing & mining - need script-help

Post by bernard_hinault »

hello all,

this is probably one of the best places to ask such questions. so i do it now.

first of - i have to explain something; I have to grab some data out of a phpBB in order to do some field reseach. I need the data out of a forum that is runned by a user community. I need the data to analyze the discussions.


to give an example - let us take this forum here. How can i grab all the data out of this forum - and get it local and then after wards put it in a local database - of a phpBB-forum - is this possible"?!"?
http://www.nukeforums.com/forums/vi...c03f1cb45d39493

Nothing harmeful - nothing bad - nothing serious and dangerous. But the issue is.
i have to get the data - so what?

I need the data in a allmost full and complete formate. So i need all the data like

username .-
forum
thread
topic
text of the posting and so on and so on.

how to do that?

i need some kind of a grabbing tool - can i do it with that kind of tool. How do i sove the storing-issue into the local mysql-database.

Well you see that is a tricky work - and i am pretty sure taht i am getting help here. So for any and all help i am very very thankful

#many many thanks in advance

metabo
- a Ethno-reseracher
User avatar
Chris Corbyn
Breakbeat Nuttzer
Posts: 13098
Joined: Wed Mar 24, 2004 7:57 am
Location: Melbourne, Australia

Post by Chris Corbyn »

file_get_contents() with some complex preg_match().

What you're doing is probably illegal unless you've got permission from the website owners though. I know we'd not be too happy if you tried to "download" our database and set up another phpBB with all our content.
Post Reply