c# - Getting rid of unnecessary text Regex -
alright having trouble this: have string looks this
<a class="l _hid" href="http://www.cnbc.com/2016/07/28/royal-dutch-shell-second-quarter-net-profit-comes-in-at-118-billion.html" onmousedown="return rwt(this,'','','','1','afqjcnhzdjmd9kznazjkrec-famndp8ujw','mb_qkv1zfbnfljbx-jncea','0ahukewiwmbpoljboahvj6xqkht1qdfkqqqiigigamaa','','',event)">shell sees quarterly profits plummet 70% low <em>oil price</em> bites</a>
which, simplified, looks essentially:
<a class="l _hid" href="link" onmousedown="some gibberish">the string need <em>i guessing searched</em> bites</a>
it have been helpful if new how achieved. form of appreciated.
thanks in advance.
indeed link post answered questions:
regex match open tags except xhtml self-contained tags
it looks parsing html regex isn't best idea.
Comments
Post a Comment