Hello pone,
thanks for your replies!
And yes, you're right, there's no "until", it's a do/while loop...
Nice idea to do it with regexes, but it's not as easy as in my example, which I tried to simplify (too much, I guess...)
The actual output looks like this:
.
.
<tr class="significant">
<td class="relevance text-center">
<div class="bar" style="width:100%" title="100%"></div>
</td>
<td class="text-center">
<a href="[some link]"><img src="[some pic]" width="9px" height="10px"></a>
</td>
<td><a href="http://[some album url]">Post Card</a></td>
<td>Mary Hopkin</td>
<td>Toshiba EMI</td>
<td>1969</td>
<td>Pop/Rock</td>
</tr>
.
.
From the first
only "100%" is needed,
the second | must be discarded,
from the 3rd I need the URL and the album title "Post Card"
and the remaining 4 I need completely, so my result string is:
"100|[some album url]|Post Card|Mary Hopkin|Toshiba EMI|1969|Pop/Rock"
(I can't specify the name of the web site this script is intended for (but it's not difficult to guess...), since it's somehow "uncorrect" to run scripts against their web site. But lately they changed their code and so there's work to do...)
By now, I'm able to collect all the information needed for the indexpage, but my do/while loop won't work. Maybe I have to reconstruct it to find a proper "while" condition.
.sundance.
|