Need to pull certain data from a file and insert into mysql
Posted: Sun Mar 25, 2007 6:55 pm
feyd | Please use
feyd | Please use[/syntax]
Code: Select all
,Code: Select all
and [syntax="..."] tags where appropriate when posting code. Your post has been edited to reflect how we'd like it posted. Please read: [url=http://forums.devnetwork.net/viewtopic.php?t=21171]Posting Code in the Forums[/url] to learn how to do it too.[/color]
Hey essentially my boss gave me 50,000 or so files that he needs to be scraped in a sense.
In each html file there is a form that has information whihc needs to be scraped. I've attached a copy of the information from the form as a Sample. Basically all the inputs need to be copied and put into a MySQL database as a single record. IE FName, LName, Company, City, State, Country, PostalCode etc.
I figure i'd obvously need to open the file and read it but how do I isolate the inputs etc?
Thanks in advance.
[syntax="html"]<table width="100%" border="0" align="center">
<tr>
<td><span class="style49">First Name:</span></td>
<td><span class="style17"></span></td>
<td><input name="FName" type="text" class="style8" value="JOHN" size="30"></td>
</tr>
<tr>
<td><span class="style49">Last Name:</span></td>
<td><span class="style17"></span></td>
<td><input name="LName" type="text" class="style8" value="DOE" size="30"></td>
</tr>
<tr>
<td><span class="style49">Company:</span></td>
<td><span class="style17"></span></td>
<td><input name="Company" type="text" class="style8" value="" size="30"></td>
</tr>
<tr>
<td><span class="style49">Address:</span></td>
<td><span class="style17"></span></td>
<td><span class="style49">
<textarea name="Address" cols="30" rows="3" >1010 UNIVERSITY</textarea>
</span></td>
</tr>
<tr>
<td><span class="style49">City:</span></td>
<td><span class="style17"></span></td>
<td><input name="City" type="text" class="style8" value="SAN DIEGO" size="30"></td>
</tr>
<tr>
<td><span class="style49">State/ Region:</span></td>
<td><span class="style17"></span></td>
<td><span class="style49">
<select name="State">
<option value="" ></option>
<option value="AK" >Alaska</option>
<option value="AL" >Alabama</option>
<option value="AR" >Arkansas</option>
<option value="AZ" >Arizona</option>
<option value="CA"selected>California</option>
<option value="CO" >Colorado</option>
<option value="CT" >Connecticutt</option>
<option value="DC" >Washington DC</option>
<option value="DE" >Delaware</option>
<option value="FL" >Florida</option>
<option value="GA" >Georgia</option>
<option value="HI" >Hawaii</option>
<option value="IA" >Iowa</option>
<option value="ID" >Idaho</option>
<option value="IL" >Illinois</option>
<option value="IN" >Indiana</option>
<option value="KS" >Kansas</option>
<option value="KY" >Kentucky</option>
<option value="LA" >Louisiana</option>
<option value="MA" >Massachusetts</option>
<option value="MD" >Maryland</option>
<option value="ME" >Maine</option>
<option value="MI" >Michigan</option>
<option value="MN" >Minnesota</option>
<option value="MO" >Missouri</option>
<option value="MS" >Mississippi</option>
<option value="MT" >Montana</option>
<option value="NC" >North Carolina</option>
<option value="ND" >North Dakota</option>
<option value="NE" >Nebraska</option>
<option value="NH" >New Hampshire</option>
<option value="NJ" >New Jersey</option>
<option value="NM" >New Mexico</option>
<option value="NV" >Nevada</option>
<option value="NY" >New York</option>
<option value="OH" >Ohio</option>
<option value="OK" >Oklahoma</option>
<option value="OR" >Oregon</option>
<option value="PA" >Pennsylvania</option>
<option value="PR" >Puerto Rico</option>
<option value="RI" >Rhode Island</option>
<option value="SC" >South Carolina</option>
<option value="SD" >South Dakota</option>
<option value="TN" >Tennessee</option>
<option value="TX" >Texas</option>
<option value="UT" >Utah</option>
<option value="VA" >Virginia</option>
<option value="VT" >Vermont</option>
<option value="WA" >Washington</option>
<option value="WI" >Wisconsin</option>
<option value="WV" >West Virginia</option>
<option value="WY" >Wyoming</option>
</select>
or<br>
<input name="Region" type="text" class="style8" id="Region" size="30">
</span> </td>
</tr>
<tr>
<td><span class="style49">Zip/ Postal Code:</span></td>
<td><span class="style17"></span></td>
<td><input name="PostalCode" type="text" class="style8" value="92103" size="30"></td>
</tr>
<tr>
<td><span class="style49">Country:</span></td>
<td><span class="style17"></span></td>
<td><span class="style49">
<select name="CountryRegion">
<option value="<To be selected>" selected><To be selected></option>
<option value="AFGHANISTAN" >AFGHANISTAN</option>
<option value="ALBANIA" >ALBANIA</option>
<option value="ALGERIA" >ALGERIA</option>
<option value="AMERICAN SAMOA" >AMERICAN SAMOA</option>
<option value="ANDORRA" >ANDORRA</option>
<option value="ANGOLA" >ANGOLA</option>
<option value="ANGUILLA" >ANGUILLA</option>
<option value="ANTARCTICA" >ANTARCTICA</option>
<option value="ANTIGUA AND BARBUDA" >ANTIGUA AND BARBUDA</option>
<option value="ARGENTINA" >ARGENTINA</option>
<option value="ARMENIA" >ARMENIA</option>
<option value="ARUBA" >ARUBA</option>
<option value="AUSTRALIA" >AUSTRALIA</option>
<option value="AUSTRIA" >AUSTRIA</option>
<option value="AZERBAIJAN" >AZERBAIJAN</option>
<option value="BAHAMAS" >BAHAMAS</option>
<option value="BAHRAIN" >BAHRAIN</option>
<option value="BANGLADESH" >BANGLADESH</option>
<option value="BARBADOS" >BARBADOS</option>
<option value="BELARUS" >BELARUS</option>
<option value="BELGIUM" >BELGIUM</option>
<option value="BELIZE" >BELIZE</option>
<option value="BENIN" >BENIN</option>
<option value="BERMUDA" >BERMUDA</option>
<option value="BHUTAN" >BHUTAN</option>
<option value="BOLIVIA" >BOLIVIA</option>
<option value="BOSNIA AND HERZEGOVINA" >BOSNIA AND HERZEGOVINA</option>
<option value="BOTSWANA" >BOTSWANA</option>
<option value="BOUVET ISLAND" >BOUVET ISLAND</option>
<option value="BRAZIL" >BRAZIL</option>
<option value="BRUNEI DARUSSALAM" >BRUNEI DARUSSALAM</option>
<option value="BULGARIA" >BULGARIA</option>
<option value="BURKINA FASO" >BURKINA FASO</option>
<option value="BURMA" >BURMA</option>
<option value="BURUNDI" >BURUNDI</option>
<option value="BYELORUSSION SSR" >BYELORUSSION SSR</option>
<option value="CAMBODIA" >CAMBODIA</option>
<option value="CAMEROON" >CAMEROON</option>
<option value="CANADA" >CANADA</option>
<option value="CANTON AND ENDERBURY ISLANDS" >CANTON AND ENDERBURY ISLANDS</option>
<option value="CAPE VERDE" >CAPE VERDE</option>
<option value="CAYMAN ISLANDS" >CAYMAN ISLANDS</option>
<option value="CENTRAL AFRICAN REPUBLIC" >CENTRAL AFRICAN REPUBLIC</option>
<option value="CHAD" >CHAD</option>
<option value="CHILE" >CHILE</option>
<option value="CHINA" >CHINA</option>
<option value="CHRISTMAS ISLAND" >CHRISTMAS ISLAND</option>
<option value="COCOS (KEELING) ISLANDS" >COCOS (KEELING) ISLANDS</option>
<option value="COLOMBIA" >COLOMBIA</option>
<option value="COMOROS" >COMOROS</option>
<option value="CONGO" >CONGO</option>
<option value="COOK ISLANDS" >COOK ISLANDS</option>
<option value="COSTA RICA" >COSTA RICA</option>
<option value="CROATIA" >CROATIA</option>
<option value="CUBA" >CUBA</option>
<option value="CYPRUS" >CYPRUS</option>
<option value="CZECHOSLOVAKIA" >CZECHOSLOVAKIA</option>
<option value="DENMARK" >DENMARK</option>
<option value="DJIBOUTI" >DJIBOUTI</option>
<option value="DOMINICA" >DOMINICA</option>
<option value="DOMINICAN REPUBLIC" >DOMINICAN REPUBLIC</option>
<option value="DRONNING MAUD LAND" >DRONNING MAUD LAND</option>
<option value="EAST TIMOR" >EAST TIMOR</option>
<option value="ECUADOR" >ECUADOR</option>
<option value="EGYPT" >EGYPT</option>
<option value="EL SALVADOR" >EL SALVADOR</option>
<option value="EQUATORIAL GUINEA" >EQUATORIAL GUINEA</option>
<option value="ERITREA" >ERITREA</option>
<option value="ESTONIA" >ESTONIA</option>
<option value="ETHIOPIA" >ETHIOPIA</option>
<option value="EUROPEAN UNION" >EUROPEAN UNION</option>
<option value="FAEROE ISLANDS" >FAEROE ISLANDS</option>
<option value="FALKLAND ISLANDS (MALVINAS)" >FALKLAND ISLANDS (MALVINAS)</option>
<option value="FIJI" >FIJI</option>
<option value="FINLAND" >FINLAND</option>
<option value="FRANCE" >FRANCE</option>
<option value="FRENCH GUIANA" >FRENCH GUIANA</option>
<option value="FRENCH POLYNESIA" >FRENCH POLYNESIA</option>
<option value="GABON" >GABON</option>
<option value="GAMBIA" >GAMBIA</option>
<option value="GEORGIA" >GEORGIA</option>
<option value="GERMANY" >GERMANY</option>
<option value="GHANA" >GHANA</option>
<option value="GIBRALTAR" >GIBRALTAR</option>
<option value="GREECE" >GREECE</option>
<option value="GREENLAND" >GREENLAND</option>
<option value="GRENADA" >GRENADA</option>
<option value="GUADELOUPE" >GUADELOUPE</option>
<option value="GUAM" >GUAM</option>
<option value="GUATEMALA" >GUATEMALA</option>
<option value="GUINEA" >GUINEA</option>
<option value="GUINEA-BISSAU" >GUINEA-BISSAU</option>
<option value="GUYANA" >GUYANA</option>
<option value="HAITI" >HAITI</option>
<option value="HEARD AND MCDONALD ISLANDS" >HEARD AND MCDONALD ISLANDS</option>
<option value="HONDURAS" >HONDURAS</option>
<option value="HONG KONG" >HONG KONG</option>
<option value="HUNGARY" >HUNGARY</option>
<option value="ICELAND" >ICELAND</option>
<option value="INDIA" >INDIA</option>
<option value="INDONESIA" >INDONESIA</option>
<option value="IRAN" >IRAN</option>
<option value="IRAQ" >IRAQ</option>
<option value="IRELAND" >IRELAND</option>
<option value="ISRAEL" >ISRAEL</option>
<option value="ITALIA" >ITALIA</option>
<option value="IVORY COAST" >IVORY COAST</option>
<option value="JAMAICA" >JAMAICA</option>
<option value="JAPAN" >JAPAN</option>
<option value="JOHNSTON ISLAND" >JOHNSTON ISLAND</option>
<option value="JORDAN" >JORDAN</option>
<option value="KAMPUCHEA" >KAMPUCHEA</option>
<option value="KAZAKSTAN" >KAZAKSTAN</option>
<option value="KENYA" >KENYA</option>
<option value="KIRIBATI" >KIRIBATI</option>
<option value="KOREA" >KOREA</option>
<option value="KUWAIT" >KUWAIT</option>
<option value="KYRGYZSTAN" >KYRGYZSTAN</option>
<option value="LAO PEOPLES DEMOCRATIC REPUBLIC" >LAO PEOPLES DEMOCRATIC REPUBLIC</option>
<option value="LATVIA" >LATVIA</option>
<option value="LEBANON" >LEBANON</option>
<option value="LESOTHO" >LESOTHO</option>
<option value="LIBERIA" >LIBERIA</option>
<option value="LIBYAN ARAB JAMAHIRIYA" >LIBYAN ARAB JAMAHIRIYA</option>
<option value="LIECHTENSTEIN" >LIECHTENSTEIN</option>
<option value="LITHUANIA" >LITHUANIA</option>
<option value="LUXEMBOURG" >LUXEMBOURG</option>
<option value="MACAU" >MACAU</option>
<option value="MACEDONIA" >MACEDONIA</option>
<option value="MADAGASCAR" >MADAGASCAR</option>
<option value="MALAWI" >MALAWI</option>
<option value="MALAYSIA" >MALAYSIA</option>
<option value="MALDIVES" >MALDIVES</option>
<option value="MALI" >MALI</option>
<option value="MALTA" >MALTA</option>
<option value="MARSHALL ISLANDS" >MARSHALL ISLANDS</option>
<option value="MARTINIQUE" >MARTINIQUE</option>
<option value="MAURITANIA" >MAURITANIA</option>
<option value="MAURITIUS" >MAURITIUS</option>
<option value="MAYOTTE" >MAYOTTE</option>
<option value="MEXICO" >MEXICO</option>
<option value="MICRONESIA, FEDERATED STATES OF" >MICRONESIA, FEDERATED STATES OF</option>
<option value="MIDWAY ISLANDS" >MIDWAY ISLANDS</option>
<option value="MOLDOVA, REPUBLIC OF" >MOLDOVA, REPUBLIC OF</option>
<option value="MONACO" >MONACO</option>
<option value="MONGOLIA" >MONGOLIA</option>
<option value="MONTSERRAT" >MONTSERRAT</option>
<option value="MOROCCO" >MOROCCO</option>
<option value="MOZAMBIQUE" >MOZAMBIQUE</option>
<option value="MYANMAR" >MYANMAR</option>
<option value="NAMIBIA" >NAMIBIA</option>
<option value="NAURU" >NAURU</option>
<option value="NEPAL" >NEPAL</option>
<option value="NETHERLANDS" >NETHERLANDS</option>
<option value="NETHERLANDS ANTILLES" >NETHERLANDS ANTILLES</option>
<option value="NEW CALEDONIA" >NEW CALEDONIA</option>
<option value="NEW ZEALAND" >NEW ZEALAND</option>
<option value="NICARAGUA" >NICARAGUA</option>
<option value="NIGER" >NIGER</option>
<option value="NIGERIA" >NIGERIA</option>
<option value="NIUE" >NIUE</option>
<option value="NORFOLK ISLAND" >NORFOLK ISLAND</option>
<option value="NORTH KOREA" >NORTH KOREA</option>
<option value="NORWAY" >NORWAY</option>
<option value="OMAN" >OMAN</option>
<option value="PACIFIC ISLANDS (TRUSH TERRITORY)" >PACIFIC ISLANDS (TRUSH TERRITORY)</option>
<option value="PAKISTAN" >PAKISTAN</option>
<option value="PALAU" >PALAU</option>
<option value="PALESTINIAN TERRITORY" >PALESTINIAN TERRITORY</option>
<option value="PANAMA" >PANAMA</option>
<option value="PAPUA NEW GUINEA" >PAPUA NEW GUINEA</option>
<option value="PARAGUAY" >PARAGUAY</option>
<option value="PERU" >PERU</option>
<option value="PHILIPPINES" >PHILIPPINES</option>
<option value="PITCAIRN ISLAND" >PITCAIRN ISLAND</option>
<option value="POLAND" >POLAND</option>
<option value="PORTUGAL" >PORTUGAL</option>
<option value="PUERTO RICO" >PUERTO RICO</option>
<option value="QATAR" >QATAR</option>
<option value="REUNION" >REUNION</option>
<option value="ROMANIA" >ROMANIA</option>
<option value="RUSSIAN FEDERATION" >RUSSIAN FEDERATION</option>
<option value="RWANDA" >RWANDA</option>
<option value="SAMOA" >SAMOA</option>
<option value="SAN MARINO" >SAN MARINO</option>
<option value="SAO TOME AND PRINCIPE" >SAO TOME AND PRINCIPE</option>
<option value="SAUDI ARABIA" >SAUDI ARABIA</option>
<option value="SENEGAL" >SENEGAL</option>
<option value="SEYCHELLES" >SEYCHELLES</option>
<option value="SIERRA LEONE" >SIERRA LEONE</option>
<option value="SINGAPORE" >SINGAPORE</option>
<option value="SLOVAKIA" >SLOVAKIA</option>
<option value="SLOVENIA" >SLOVENIA</option>
<option value="SOLOMON ISLANDS" >SOLOMON ISLANDS</option>
<option value="SOMALIA" >SOMALIA</option>
<option value="SOUTH AFRICA" >SOUTH AFRICA</option>
<option value="SPAIN" >SPAIN</option>
<option value="SRI LANKA" >SRI LANKA</option>
<option value="ST. HELENA" >ST. HELENA</option>
<option value="ST. KITTS-NEVIS-ANGUILLA" >ST. KITTS-NEVIS-ANGUILLA</option>
<option value="ST. LUCIA" >ST. LUCIA</option>
<option value="ST. PIERRE AND MIQUELON" >ST. PIERRE AND MIQUELON</option>
<option value="ST. VINCENT AND THE GRENADINES" >ST. VINCENT AND THE GRENADINES</option>
<option value="SUDAN" >SUDAN</option>
<option value="SURINAME" >SURINAME</option>
<option value="SWAZILAND" >SWAZILAND</option>
<option value="SWEDEN" >SWEDEN</option>
<option value="SWITZERLAND" >SWITZERLAND</option>
<option value="SYRIAN ARAB REPUBLIC" >SYRIAN ARAB REPUBLIC</option>
<option value="TAIWAN" >TAIWAN</option>
<option value="TAJIKISTAN" >TAJIKISTAN</option>
<option value="TANZANIA, UNITED REPUBLIC OF" >TANZANIA, UNITED REPUBLIC OF</option>
<option value="THAILAND" >THAILAND</option>
<option value="TOGO" >TOGO</option>
<option value="TOKELAU" >TOKELAU</option>
<option value="TONGA" >TONGA</option>
<option value="TRINIDAD AND TOBAGO" >TRINIDAD AND TOBAGO</option>
<option value="TUNISIA" >TUNISIA</option>
<option value="TURKEY" >TURKEY</option>
<option value="TURKMENISTAN" >TURKMENISTAN</option>
<option value="TURKS AND CAICOS ISLANDS" >TURKS AND CAICOS ISLANDS</option>
<option value="TUVALU" >TUVALU</option>
<option value="U.S. VIRGIN ISLANDS" >U.S. VIRGIN ISLANDS</option>
<option value="UGANDA" >UGANDA</option>
<option value="UKRAINE" >UKRAINE</option>
<option value="UNITED ARAB EMIRATES" >UNITED ARAB EMIRATES</option>
<option value="UNITED KINGDOM" >UNITED KINGDOM</option>
<option value="UNITED STATES" >UNITED STATES</option>
<option value="UPPER VOLTA" >UPPER VOLTA</option>
<option value="URUGUAY" >URUGUAY</option>
<option value="UZBEKISTAN" >UZBEKISTAN</option>
<option value="VANUATU" >VANUATU</option>
<option value="VATICAN CITY STATE" >VATICAN CITY STATE</option>
<option value="VENEZUELA" >VENEZUELA</option>
<option value="VIETNAM" >VIETNAM</option>
<option value="VIRGIN ISLANDS, BRITISH" >VIRGIN ISLANDS, BRITISH</option>
<option value="WAKE ISLAND" >WAKE ISLAND</option>
<option value="WALLIS AND FUTUNA ISLANDS" >WALLIS AND FUTUNA ISLANDS</option>
<option value="WESTERN SAHARA" >WESTERN SAHARA</option>
<option value="YEMEN" >YEMEN</option>
<option value="YEMEN, DEMOCRATIC" >YEMEN, DEMOCRATIC</option>
<option value="YUGOSLAVIA" >YUGOSLAVIA</option>
<option value="ZAIRE" >ZAIRE</option>
<option value="ZAMBIA" >ZAMBIA</option>
<option value="ZIMBABWE" >ZIMBABWE</option>
</select>
</span></td>
</tr>
<tr>
<td><span class="style49">Phone:</span></td>
<td><span class="style17"></span></td>
<td><input name="Phone" type="text" class="style8" value="(303) 809-3929" size="30"></td>
</tr>
<tr>
<td><span class="style49">Email:</span></td>
<td><span class="style17"></span></td>
<td><input name="Email" type="text" class="style8" value="cdown@cox.net" size="30"></td>
</tr>
<tr>
<td><span class="style49">Birthdate:</span></td>
<td><span class="style17"></span></td>
<td><input name="Bdate" type="text" class="style8" value="" size="30"></td>
</tr>
<tr>
<td><span class="style17"></span></td>
<td><span class="style17"></span></td>
<td><input name="ID" type="hidden" class="style8" value="1"></td>
</tr>
</table>
feyd | Please use[/syntax]
Code: Select all
,Code: Select all
and [syntax="..."] tags where appropriate when posting code. Your post has been edited to reflect how we'd like it posted. Please read: [url=http://forums.devnetwork.net/viewtopic.php?t=21171]Posting Code in the Forums[/url] to learn how to do it too.[/color]