Python 正则表达式用于 SIP URI 变量?
我正在使用这个正则表达式来处理SIP(会话发起协议)URI,以提取不同的内部变量。
_syntax = re.compile('^(?P<scheme>[a-zA-Z][a-zA-Z0-9\+\-\.]*):' # scheme
+ '(?:(?:(?P<user>[a-zA-Z0-9\-\_\.\!\~\*\'\(\)&=\+\$,;\?\/\%]+)' # user
+ '(?::(?P<password>[^:@;\?]+))?)@)?' # password
+ '(?:(?:(?P<host>[^;\?:]*)(?::(?P<port>[\d]+))?))' # host, port
+ '(?:;(?P<params>[^\?]*))?' # parameters
+ '(?:\?(?P<headers>.*))?$') # headers
m = URI._syntax.match(value)
if m:
self.scheme, self.user, self.password, self.host, self.port, params, headers = m.groups()
我需要修改这个表达式,以支持IPv6,并匹配所有不同类型的SIP URI。基本的想法是,IPv4的格式是192.168.0.1,而IPv6的格式是2620:0:2ef0:7070:250:60ff:fe03:32b7。因为端口号在“:”后面,所以在SIP URI中,IPv6是用括号括起来的。
它的一般格式是:
sip:用户:密码@主机:端口;uri-参数?头部
以下是一些示例:
uriList = [
'sip:192.1.2.3',
'sip:123@192.1.2.3',
'sip:192.1.2.3:5060',
'sips:123@[2620:0:2ef0:7070:250:60ff:fe03:32b7]',
'sip:[2620:0:2ef0:7070:250:60ff:fe03:32b7]',
'sip:[2620:0:2ef0:7070:250:60ff:fe03:32b7]:5060',
'sips:support@voip.example.com',
'sip:22444032@voip.example.com:6000',
'sip:thks.ashwin:pass@212.123.1.213',
]
输出
Scheme: sip, User: , Host: 192.1.2.3, Port:
Scheme: sip, User: 123, Host: 192.1.2.3, Port:
Scheme: sip, User: , Host: 192.1.2.3, Port: 5060
Scheme: sips, User: 123, Host: 2620:0:2ef0:7070:250:60ff:fe03:32b7, Port:
Scheme: sip, User: , Host: 2620:0:2ef0:7070:250:60ff:fe03:32b7, Port:
Scheme: sip, User: , Host: 2620:0:2ef0:7070:250:60ff:fe03:32b7, Port: 5060
Scheme: sips, User:support , Host: voip.example.com
Scheme: sip, User:22444032 , Host: voip.example.com, Port: 6000
Scheme: sip, User:thks.ashwin, Password:pass ,Host: 212.123.1.213
我尝试修改主机的表达式,以匹配IPv6和IPv4的格式,但没有成功 =´(
我一直在使用 https://pythex.org/ 来测试结果。
1 个回答
5
你的例子里没有头部信息和参数,所以我不知道它们是怎么出现的。不过你可以用下面的代码来匹配你的例子字符串:
[编辑1 - 添加了正则表达式来匹配主机名字符串,并支持用户:密码,基于提问者的新例子URI]
[编辑2 - 添加了参数和头部的正则表达式,并对正则表达式中的'或'部分做了更多注释]
import re
uriList = [
'sip:192.1.2.3',
'sip:123@192.1.2.3',
'sip:192.1.2.3:5060',
'sip:123@[2620:0:2ef0:7070:250:60ff:fe03:32b7]',
'sip:[2620:0:2ef0:7070:250:60ff:fe03:32b7]',
'sip:[2620:0:2ef0:7070:250:60ff:fe03:32b7]:5060',
'sips:support@voip.example.com',
'sip:22444032@voip.example.com:6000',
'sip:support:pass@212.123.1.213',
'sip:support:pass@212.123.1.213;urlparams=test',
'sip:support:pass@212.123.1.213?auth=basic',
'sip:support:pass@212.123.1.213;urlparams=test?auth=basic',
]
mPattern = re.compile(
'(?P<scheme>\w+):' #Scheme
+'(?:(?P<user>[\w\.]+):?(?P<password>[\w\.]+)?@)?' #User:Password
+'\[?(?P<host>' #Begin group host
+'(?:\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3})|' #IPv4 address Host Or
+'(?:(?:[0-9a-fA-F]{1,4}):){7}[0-9a-fA-F]{1,4}|' #IPv6 address Host Or
+'(?:(?:[0-9A-Za-z]+\.)+[0-9A-Za-z]+)'#Hostname string
+')\]?:?' #End group host
+'(?P<port>\d{1,6})?' #port
+'(?:\;(?P<params>[^\?]*))?' # parameters
+'(?:\?(?P<headers>.*))?' # headers
)
groupNamesList = ['scheme', 'user', 'password', 'host', 'port', 'params', 'headers'] #List of group Names
for uri in uriList: #iterate through the list of uri
mObject = mPattern.search(uri) #pattern search
if mObject: #if you find a match
groupStrings = [mObject.group(groupName) if mObject.group(groupName) else '' for groupName in groupNamesList] #extract your groupStrings
print('Scheme: {0}, User: {1}, Password: {2}, Host: {3}, Port: {4}, Params: {5}, Headers: {6}'.format(*groupStrings)) #print groupStrings
我得到的输出是:
Scheme: sip, User: , Password: , Host: 192.1.2.3, Port: , Params: , Headers:
Scheme: sip, User: 123, Password: , Host: 192.1.2.3, Port: , Params: , Headers:
Scheme: sip, User: , Password: , Host: 192.1.2.3, Port: 5060, Params: , Headers:
Scheme: sip, User: 123, Password: , Host: 2620:0:2ef0:7070:250:60ff:fe03:32b7, Port: , Params: , Headers:
Scheme: sip, User: , Password: , Host: 2620:0:2ef0:7070:250:60ff:fe03:32b7, Port: , Params: , Headers:
Scheme: sip, User: , Password: , Host: 2620:0:2ef0:7070:250:60ff:fe03:32b7, Port: 5060, Params: , Headers:
Scheme: sips, User: support, Password: , Host: voip.example.com, Port: , Params: , Headers:
Scheme: sip, User: 22444032, Password: , Host: voip.example.com, Port: 6000, Params: , Headers:
Scheme: sip, User: support, Password: pass, Host: 212.123.1.213, Port: , Params: , Headers:
Scheme: sip, User: support, Password: pass, Host: 212.123.1.213, Port: , Params: urlparams=test, Headers:
Scheme: sip, User: support, Password: pass, Host: 212.123.1.213, Port: , Params: , Headers: auth=basic
Scheme: sip, User: support, Password: pass, Host: 212.123.1.213, Port: , Params: urlparams=test, Headers: auth=basic
试试看这个,看看是否对你有用