擅长:python、mysql、java
<p>如果您不想使用urllib,而urllib可以为您这样做,那么可以使用split</p>
<pre><code>def canonical_url(u):
u = url_normalize(u)
u = url_query_cleaner(u,parameterlist = ['utm_source','utm_medium','utm_campaign','utm_term','utm_content'],remove=True)
u = u.lstrip("http://")
u = u.lstrip("https://")
u = u.lstrip("www.")
u = u.split('/')[0] # get before first slash
return u
</code></pre>