回答此问题可获得 20 贡献值,回答如果被采纳可获得 50 分。
<p>我试图匹配转换成字符串的html文本。但是我的<em>正则表达式</em>都不起作用</p>
<p>我正在尝试匹配的Html文本:</p>
<pre><code>"[<span class="instancename">CLASS 8B PHY | TUE | 9AM to 9:40AM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8AB 2ND LG (HINDI) | TUE | 10AM to 10:40AM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8AB 2ND LG (BENGALI) | TUE | 10AM to 10:40AM <span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8AB 2ND LG (NEPALI) | TUE | 10AM to 10:40AM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8B GEOG | TUE | 11AM to 11:40AM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8B BIO | TUE | 12NOON to 12:40PM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8AB CP APP | TUE | 5PM to 5:40PM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8AB CM APP | TUE | 5PM to 5:40PM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>]"
</code></pre>
<p>我想匹配的句子是:</p>
<ol>
<li><p><code>CLASS 8B PHY | TUE | 9AM to 9:40AM</code></p>
</li>
<li><p><code>CLASS 8AB 2ND LG (HINDI) | TUE | 10AM to 10:40AM</code></p>
</li>
<li><p><code>CLASS 8B GEOG | TUE | 11AM to 11:40AM</code></p>
</li>
</ol>
<p>在上面提供的html文本中还有更多内容</p>
<p>我用来匹配这些的代码似乎不起作用:</p>
<pre><code>import re
html_text = [<span class="instancename">CLASS 8B PHY | TUE | 9AM to 9:40AM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8AB 2ND LG (HINDI) | TUE | 10AM to 10:40AM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8AB 2ND LG (BENGALI) | TUE | 10AM to 10:40AM <span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8AB 2ND LG (NEPALI) | TUE | 10AM to 10:40AM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8B GEOG | TUE | 11AM to 11:40AM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8B BIO | TUE | 12NOON to 12:40PM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8AB CP APP | TUE | 5PM to 5:40PM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>, <span class="instancename">CLASS 8AB CM APP | TUE | 5PM to 5:40PM<span class="accesshide"> BigBlueButtonBN</span></span>, <span class="accesshide"> BigBlueButtonBN</span>]
regex = re.compile(r'^[CLASS]*[M]')
match = regex.findall(str(html_text))
print(match)
</code></pre>
<p>我认为我没有提供合适的<em>正则表达式</em></p>