Page 1 of 1

A proxy-like website fetching and displaying

Posted: Fri Jul 31, 2009 5:03 pm
by ubax
Is there a way to easily fetch a website, parse it and then display it on your own server as if it was viewed from the original?

This is necessary for being in control of the content that is being displayed - before displaying, content is filtered.

One could fetch the website and the absolutize all relative URLs, but it still creates multiple problems - relative links generated by JS and CSS and so forth.

Any ideas, directions?

Re: A proxy-like website fetching and displaying

Posted: Fri Jul 31, 2009 5:49 pm
by alex.barylski
You pretty much nailed the requirements, and yes it's mostly possible, minus the relative JS paths.

I believe there is a apache module that does this, I dunno how it deals with CSS/JS relative paths