Consider the following links:
Code: Select all
<td><a href="http://www.geoworld.org/sp" title="">Spain</a></td>
<td><a href="http://www.geoworld.org/zi" title="">Zimbabwe</a></td>
Code: Select all
<td><a href="http://www.geoworld.org/Spain" title="Spain">Spain</a></td>
<td><a href="http://www.geoworld.org/Zimbabwe" title="Zimbabwe">Zimbabwe</a></td>
One more detail...can you adjust it so that spaces in place names that consist of more than one word are replaced by underscores in the link, as follows? (If not, don't worry about it. I can probably fix that with a second regex that simply replaces spaces with underscores in links.)
Code: Select all
<td><a href="http://www.geoworld.org/United_Kingdom" title="United Kingdom">United Kingdom</a></td>
Code: Select all
<td><a href="http://www.geoworld.org/sp" title="">Spain</a></td>
Code: Select all
<td><a href="http://www.geoworld.org/sp" title="">Spain</a>Spain2Spain3</td>
I'm playing with a software program called RegExhibit and have learned that I can match everything between the tags with this regular expression: title="".*>
However, I don't have a clue about manipulating the data I've matched.
Thanks!