regex - What regular expression can I use to find in an HTML document tables that only contain one row? -
i'm searching through html documents , trying find tables contain single row. regex can use this? i've tried negative lookahead , can isolate single row, don't see how ensure there's single <tr></tr> between <table></table> tags.
here's regex i'm working now:
<table[\w].*?<tr[\w].*?<\/tr>.*(?!.*<tr[\w])<\/table> this should not match regex:
<html> <body> <table> <tr> <td>a</td> </tr> <tr> <td>b</td> </tr> <tr> <td>c</td> </tr> <tr> <td>d</td> </tr> </table> </body> </html> this should match regex:
<html> <body> <table> <tr> <td>a</td> </tr> </table> </body> </html>
this should work: <table>(?>[^<]++|<(?!\/tr>))*<\/tr>(?>[^<]++|<(?!\/tr>))<\/table>
it looking 1 instance of </tr> between <table> , </table>.
details can found here: negative lookaround regex - 1 occurrence - java
Comments
Post a Comment