LMPX.COM |
Home | Linux | Mysql | PHP | XML | ||
|
|
|||
From: Wiggins d'Anconia Date: Tue Apr 8 07:44:38 2008 Subject: Re: Extracting TD's from a Text File (Regex Help).
sara.samsara@gmail.com wrote: > ################ TEXT FILE ################## > <td class="PhorumTableRowAlt thread" style="padding-left: 0px"> > > <a href="http://mysite.com/link/here_goes?id=239">LINK</a> > > <span class="PhorumNewFlag"></span></td> > > <td class="PhorumTableRowAlt" nowrap="nowrap" width="150"> > <a href="http://mysite.com/link/here_goes?id=239">LINK</a> </td> > <td class="PhorumTableRowAlt PhorumSmallFont" nowrap="nowrap" width="150">06/11/2007 12:29AM > </td> > </tr> > ############################################ > > The text file contains hundreds of tds structure like above. All I need is to extract the td with class "PhorumTableRowAlt thread". I have tried every possible option, but finally I am coming to you for any Regex for it? TIA. > > HERE IS WHAT I AM DOING: > > pen(TXT, "links.txt") or die "Unable to open file"; > my @links = <TXT>; > close (TXT); > foreach my $link(@links) { > if ($link =~ m|<td class="PhorumTableRow thread" style="padding-left: 0px">(.*?)</td>|gsi) { > print "$1";} > } > > > > But NOTHING coming up. No results. > > Thanks for any help. > > Sara. > Parsing HTML with regexes is just a bad idea. Try a module from CPAN, I've had good luck with HTML::TokeParser::Simple, http://search.cpan.org/perldoc?HTML::TokeParser::Simple http://danconia.org
| Navigate in group perl.beginners.cgi at sever nntp.perl.org | |
| Previous | Next |
| © No Copyright You are free to use Anything |
Site Maintained by PHP Developer
Powered By PHP Consultants |