tim和REgex摘录

2024-05-14 09:49:21 发布

您现在位置:Python中文网/ 问答频道 /正文

我正在使用nltk regex提取日期和时间:

text = 'LEts have quick meeting on Wednesday at 9am'
week_day = "(monday|tuesday|wednesday|thursday|friday|saturday|sunday)"
month = "(january|february|march|april|may|june|july|august|september| \
          october|november|december)"
dmy = "(year|day|week|month)"
exp2 = "(this|next|last)"
regxp2 = "(" + exp2 + " (" + dmy + "|" + week_day + "|" + month + "))"
reg2 = re.compile(regxp2, re.IGNORECASE)
found = reg2.findall(text)
found = [a[0] for a in found if len(a) > 1]
for timex in found:
    timex_found.append(timex)

print timex_found

在我看来一切都是对的,但它没有标记Wednesday任何线索?我应该做些什么改变来考虑“星期三”和“这个星期三”

威尔

^{2}$

考虑我的案子?在


Tags: textinreforweektimexnltkday

热门问题