tzixiang
August 27th, 2007, 07:58 AM
Hi All! I'm currently stuck with doing my school project and need some help.
I need to split the html code into 5 parts,
Authors, Year, Title, Journal, Page
example of the HTML code is as follows:
<br><br>Chua Eng Huang, Cecil, Sandeep Purao, Veda C. Storey, 2006, Developing Maintainable Software: The READABLE Approach, <i>Decision Support Systems (Netherlands)</i>, Vol. 42, No. 1, pp 469 - 491.<br><br>Chua Eng Huang, Cecil, Huoy Min Khoo, Detmar W. Straub, Savitha Kadiyala, David Kuechler, 2005, The Evolution of e-Commerce Research: A Stakeholder Perspective, <i>Journal of Electronic Commerce Research (United States)</i>, Vol. 6, No. 4, pp 262 - 280.
I manage to do a Split with "<br><br> " to separate the journal papers but are having problems now spliting them further into the 5 parts started above. Truncate commands can't be used because there is no standard length for the different parts.
The criteria i was looking at is to to extract 4 numbers XXXX to identify the year, the big chunk to the left of the year will be the authors, the last part will be split into 3 by taking out the part in italics as the journal, the part to the left of the journal is the title and the part on the right is the page.
Is there a better way to do it? Because i'm struggling with coding the logic i mentioned in the above paragraph and if thats the only way. Can some helpful soul provide some advice or aid to break the html code into 5 parts?
Thanks...
I need to split the html code into 5 parts,
Authors, Year, Title, Journal, Page
example of the HTML code is as follows:
<br><br>Chua Eng Huang, Cecil, Sandeep Purao, Veda C. Storey, 2006, Developing Maintainable Software: The READABLE Approach, <i>Decision Support Systems (Netherlands)</i>, Vol. 42, No. 1, pp 469 - 491.<br><br>Chua Eng Huang, Cecil, Huoy Min Khoo, Detmar W. Straub, Savitha Kadiyala, David Kuechler, 2005, The Evolution of e-Commerce Research: A Stakeholder Perspective, <i>Journal of Electronic Commerce Research (United States)</i>, Vol. 6, No. 4, pp 262 - 280.
I manage to do a Split with "<br><br> " to separate the journal papers but are having problems now spliting them further into the 5 parts started above. Truncate commands can't be used because there is no standard length for the different parts.
The criteria i was looking at is to to extract 4 numbers XXXX to identify the year, the big chunk to the left of the year will be the authors, the last part will be split into 3 by taking out the part in italics as the journal, the part to the left of the journal is the title and the part on the right is the page.
Is there a better way to do it? Because i'm struggling with coding the logic i mentioned in the above paragraph and if thats the only way. Can some helpful soul provide some advice or aid to break the html code into 5 parts?
Thanks...