Parsing html page with russian content using regex
Posted: Tue Feb 07, 2006 7:37 am
Hey guys,
I am trying to parse a russian html file from a russian webpage. I am using curl. I am supposed to get some values from some set fields:
Like
Product Id: 400212
Product Type: engine
Here both the labels product Id and product price are in russian. I need to extract their values.
If the content was in english I could have done it without a problem but I am just not sure how to handle foreign languages? Any one has done this before?
Thanks!
I am trying to parse a russian html file from a russian webpage. I am using curl. I am supposed to get some values from some set fields:
Like
Product Id: 400212
Product Type: engine
Here both the labels product Id and product price are in russian. I need to extract their values.
If the content was in english I could have done it without a problem but I am just not sure how to handle foreign languages? Any one has done this before?
Thanks!