我正在使用nltk regex提取日期和时间:
text = 'LEts have quick meeting on Wednesday at 9am'
week_day = "(monday|tuesday|wednesday|thursday|friday|saturday|sunday)"
month = "(january|february|march|april|may|june|july|august|september| \
october|november|december)"
dmy = "(year|day|week|month)"
exp2 = "(this|next|last)"
regxp2 = "(" + exp2 + " (" + dmy + "|" + week_day + "|" + month + "))"
reg2 = re.compile(regxp2, re.IGNORECASE)
found = reg2.findall(text)
found = [a[0] for a in found if len(a) > 1]
for timex in found:
timex_found.append(timex)
print timex_found
在我看来一切都是对的,但它没有标记Wednesday
任何线索?我应该做些什么改变来考虑“星期三”和“这个星期三”
威尔
^{2}$考虑我的案子?在
正则表达式正在查找
((this|next|last) (dmy|weekday|month))
。在您的输入不匹配。在
一些可行的替代方案:
相关问题 更多 >
编程相关推荐