I consider myself a decent programmer but I have yet to master regexes. I’m taking over a journalist’s website and the guys who designed the original site actually used FLAT FILES to store his columns. Not only that, but the flatfiles contain multiple columns! UGH! This guy has 100+ columns so I figure the best way is to write a parser to simply take the title and content from the files. Any help with the regex code is MORE than welcomed! I realize if I can at least get the content, I can strip the image and anchor tags out. Right now I just need to be able to grab the title and content. Below is the format:
Code: Select all
<B><A NAME=“999999999”><FONT COLOR=”#00000” SIZE=“2” FACE=“arial”>Column Title</FONT></B>
<FONT COLOR=”#000000” SIZE=“2” FACE=“arial”> <A HREF=“mailto:example@example.com”></A> <br>
<FONT COLOR=”#000000” SIZE=“2” FACE=“arial”><br>http://example.com/image.jpg>Column Content<br><br>###