I want to parse the page produce by a web browser when you allow directory browsing.
an example
Code: Select all
<html><head><META http-equiv="Content-Type" content="text/html; charset=UTF-8"><title>mediatracker.selfip.com - /mtcanal2/</title></head><body><H1>test - /mtcanal2/</H1><hr>
<pre><A HREF="/">[To Parent Directory]</A><br><br> Tuesday, August 26, 2008 5:59 PM 147345966 <A HREF="/mtcanal2/canal_2_08_26_08_5pm.wmv">canal_2_08_26_08_5pm.wmv</A><br> Thursday, August 28, 2008 2:39 PM 2569751096 <A HREF="/mtcanal2/canal_2_08_27_08_11pm.wmv">canal_2_08_27_08_11pm.wmv</A><br> Wednesday, August 27, 2008 9:47 PM 260430522 <A HREF="/mtcanal2/canal_2_08_27_08_5pm.wmv">canal_2_08_27_08_5pm.wmv</A><br></pre><hr></body></html>
like
[0]= /mtcanal2/canal_2_08_26_08_5pm.wmv
[1]=/mtcanal2/canal_2_08_27_08_11pm.wmv
[3]=/mtcanal2/canal_2_08_27_08_5pm.wmv
I created this script
Code: Select all
$str = implode("",file('http://localhost/dir'));
$returnArray=array();
$regex_pattern = "/<A HREF=\"(.*)\">(.*)<\/A>/";
preg_match_all($regex_pattern,$str, $returnArray);
print_r($returnArray);