去除 html 的區塊

html code

<span class="line"><span style="color: #81A1C1"><</span><span style="color: #D8DEE9FF">div class</span><span style="color: #81A1C1">=</span><span style="color: #ECEFF4">"</span><span style="color: #A3BE8C">nav</span><span style="color: #ECEFF4">"</span><span style="color: #81A1C1">></span></span>
<span class="line"><span style="color: #D8DEE9FF">  </span><span style="color: #81A1C1"><</span><span style="color: #D8DEE9FF">p</span><span style="color: #81A1C1">></span><span style="color: #D8DEE9FF"> navbar </span><span style="color: #81A1C1"></</span><span style="color: #D8DEE9FF">p</span><span style="color: #81A1C1">></span></span>
<span class="line"><span style="color: #81A1C1"></</span><span style="color: #D8DEE9FF">div</span><span style="color: #81A1C1">></span></span>
<span class="line"><span style="color: #81A1C1"><</span><span style="color: #D8DEE9FF">div class</span><span style="color: #81A1C1">=</span><span style="color: #ECEFF4">"</span><span style="color: #A3BE8C">content</span><span style="color: #ECEFF4">"</span><span style="color: #81A1C1">></span></span>
<span class="line"><span style="color: #D8DEE9FF">  </span><span style="color: #81A1C1"><</span><span style="color: #D8DEE9FF">p</span><span style="color: #81A1C1">></span><span style="color: #D8DEE9FF"> content </span><span style="color: #81A1C1"></</span><span style="color: #D8DEE9FF">p</span><span style="color: #81A1C1">></span></span>
<span class="line"><span style="color: #81A1C1"></</span><span style="color: #D8DEE9FF">div</span><span style="color: #81A1C1">></span></span>
<span class="line"><span style="color: #D8DEE9FF">    </span></span>

如果要去除 <div class="nav">…</div> 這個區塊

<span class="line"><span style="color: #D8DEE9FF">import re</span></span>
<span class="line"></span>
<span class="line"><span style="color: #D8DEE9FF">html </span><span style="color: #81A1C1">=</span><span style="color: #D8DEE9FF"> </span><span style="color: #ECEFF4">"""</span></span>
<span class="line"><span style="color: #A3BE8C">    <div class=</span><span style="color: #ECEFF4">"</span><span style="color: #D8DEE9FF">nav</span><span style="color: #ECEFF4">"</span><span style="color: #A3BE8C">></span></span>
<span class="line"><span style="color: #A3BE8C">        <p> navbar </p></span></span>
<span class="line"><span style="color: #A3BE8C">    </div></span></span>
<span class="line"><span style="color: #A3BE8C">    <div class=</span><span style="color: #ECEFF4">"</span><span style="color: #D8DEE9FF">content</span><span style="color: #ECEFF4">"</span><span style="color: #A3BE8C">></span></span>
<span class="line"><span style="color: #A3BE8C">        <p> content </p></span></span>
<span class="line"><span style="color: #A3BE8C">    </div></span></span>
<span class="line"><span style="color: #ECEFF4">"""</span></span>
<span class="line"></span>
<span class="line"><span style="color: #D8DEE9FF">result </span><span style="color: #81A1C1">=</span><span style="color: #D8DEE9FF"> re.sub(</span><span style="color: #ECEFF4">"</span><span style="color: #A3BE8C"><div class=\</span><span style="color: #ECEFF4">"</span><span style="color: #D8DEE9FF">nav\</span><span style="color: #ECEFF4">"</span><span style="color: #A3BE8C">>[\s\S]*?<\/div></span><span style="color: #ECEFF4">"</span><span style="color: #D8DEE9FF">, </span><span style="color: #ECEFF4">""</span><span style="color: #D8DEE9FF">, html)</span></span>
<span class="line"></span>
<span class="line"><span style="color: #81A1C1">print</span><span style="color: #D8DEE9FF">(result)</span></span>
<span class="line"></span>
<span class="line"></span>

參考資料

給自己的Python小筆記 — 強大的數據處理工具 — 正則表達式 — Regular Expression — regex詳細教學

Last modified: 2025-02-25

Author

Comments

Write a Reply or Comment

Your email address will not be published.