从resum中提取过去的年份

def get_passingyear(self, text, education): text_lines = text.splitlines() passing_year = [] for line in text_lines: for degree in education: if degree in line: year = re.findall('\b(19|20)\d{2}\b', text) p_year = {} if len(year) > 1: year = '-'.join(year) p_year[degree]= year break else: p_year[degree]= year break

1条回答

网友

1楼 · 发布于 2024-04-26 05:47:31

您可以使用EAFP原理并尝试使用datetime模块：

import datetime

....

        if degree in line:
            try:
                year = re.findall('\b(19|20)\d{2}\b', text)
                # Try to make a date out of it
                datetime.date(year=int(year))
            except TypeError:
                # if it is not a date, you can treat it here
                pass

            ....

这样你就不会冒险得到一个不到一年的东西如果所有的日期在这些文件中都有一个模式，那么可以使用strtime fromdatetime module从这个模式中获取日期

相关问题更多 >

编程相关推荐

热门问题

热门文章

从resum中提取过去的年份

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >