在Python中，如何将url字符串拆分成不同的部分？

网友

1楼 · 编辑于 2024-05-28 19:27:51

如果这是URL解析的范围，那么Python的内置rpartition将完成以下工作：

>>> URL = "http://example.com/random/folder/path.html"
>>> Segments = URL.rpartition('/')
>>> Segments[0]
'http://example.com/random/folder'
>>> Segments[2]
'path.html'

来自Pydoc，str.r部分：

Splits the string at the last occurrence of sep, and returns a 3-tuple containing the part before the separator, the separator itself, and the part after the separator. If the separator is not found, return a 3-tuple containing two empty strings, followed by the string itself

这意味着rpartition会搜索您，并在指定字符的最后一个（最右）出现处拆分字符串（在本例中为/）。它返回一个元组，其中包含：

(everything to the left of char , the character itself , everything to the right of char)

网友

2楼 · 编辑于 2024-05-28 19:27:51

python 2.x中的urlparse模块（或者python 3.x中的urllib.parse）将是实现这一点的方法。

>>> from urllib.parse import urlparse
>>> url = 'http://example.com/random/folder/path.html'
>>> parse_object = urlparse(url)
>>> parse_object.netloc
'example.com'
>>> parse_object.path
'/random/folder/path.html'
>>> parse_object.scheme
'http'
>>>

如果要对url下的文件路径执行更多操作，可以使用posixpath模块：

>>> from posixpath import basename, dirname
>>> basename(parse_object.path)
'path.html'
>>> dirname(parse_object.path)
'/random/folder'

之后，可以使用posixpath.join将这些部分粘合在一起。

编辑：我完全忘了windows用户会被os.path中的路径分隔符阻塞。我阅读了posixpath模块的文档，它对URL操作有一个特殊的引用，所以一切都很好。

网友

3楼 · 编辑于 2024-05-28 19:27:51

我没有使用Python的经验，但是我找到了urlparse module，它应该可以完成这项工作。

相关问题更多 >

编程相关推荐

热门问题

热门文章