基于统计数据的姓名性别预测
namegender的Python项目详细描述
根据统计数据预测名字的性别。
安装
pip install namegender
用法
>>>importnamegender>>>namegender.predict('Otto'){'name':'Otto','gender':'male','probability':99.68185288877577,'samples':31432}>>>namegender.predict_list(['Otto','Jane'])[{'gender':'male','samples':31432,'name':'Otto','probability':99.68185288877577},{'gender':'female','samples':370379,'name':'Jane','probability':99.69382713382778}]
来源
- https://github.com/ropensci/genderdata
- https://catalog.data.gov/dataset/baby-names-from-social-security-card-applications-data-by-state-and-district-of-
- https://catalog.data.gov/dataset/baby-names-from-social-security-card-applications-national-level-data
- http://www.cs.cmu.edu/afs/cs/project/ai-repository/ai/areas/nlp/corpora/names/0.html
- https://usa.ipums.org/usa/
- https://www.nappdata.org/napp/