我有以下格式的数据:
Bxxxx, Mxxxx F Birmingham AL (123) 555-2281 NCC Clinical Mental Health, Counselor Education, Sexual Abuse Recovery, Depression/Grief/Chronically or Terminally Ill, Mental Health/Agency Counseling English 99.52029 -99.8115
Axxxx, Axxxx Brown Birmingham AL (123) 555-2281 NCC Clinical Mental Health, Depression/Grief/Chronically or Terminally Ill, Mental Health/Agency Counseling English 99.52029 -99.8115
Axxxx, Bxxxx Mobile AL (123) 555-8011 NCC Childhood & Adolescence, Clinical Mental Health, Sexual Abuse Recovery, Disaster Counseling English 99.68639 -99.053238
Axxxx, Rxxxx Lunsford Athens AL (123) 555-8119 NCC, NCCC, NCSC Career Development, Childhood & Adolescence, School, Disaster Counseling, Supervision English 99.804501 -99.971283
Axxxx, Mxxxx Mobile AL (123) 555-5963 NCC Clinical Mental Health, Counselor Education, Depression/Grief/Chronically or Terminally Ill, Mental Health/Agency Counseling, Supervision English 99.68639 -99.053238
Axxxx, Txxxx Mountain Brook AL (123) 555-3099 NCC Addictions and Dependency, Career Development, Childhood & Adolescence, Corrections/Offenders, Sexual Abuse Recovery English 99.50214 -99.75557
Axxxx, Lxxxx Birmingham AL (123) 555-4550 NCC Addictions and Dependency, Eating Disorders English 99.52029 -99.8115
Axxxx, Wxxxx Birmingham AL (123) 555-2328 NCC English 99.52029 -99.8115
Axxxx, Rxxxx Mobile AL (123) 555-9411 NCC Addictions and Dependency, Childhood & Adolescence, Couples & Family, Sexual Abuse Recovery, Depression/Grief/Chronically or Terminally Ill English 99.68639 -99.053238
只需要提取人名。理想情况下,我可以使用humanName获得一组名为对象的字段name.first
,name.middle
,name.last
,name.title
。。。在
我试着迭代,直到我碰到前两个连续的大写字母表示状态,然后将前面的内容存储到一个列表中,然后调用humanName,但那是一个灾难。我不想继续尝试这种方法。在
有没有一种方法可以感知单词的开头和结尾?那可能会有帮助。。。在
建议?在
不是代码答案,但看起来您可以从http://www.abec.alabama.gov/rostersearch2.asp?search=%25&submit1=Search上的许可委员会获得您想要的大部分/所有数据。名字很容易找到。在
最好的办法是找到不同的数据源。说真的。这个是假的。在
如果你做不到,我会做一些类似的工作:
只是为了好玩,我实现了这个。享受吧。在
相关问题 更多 >
编程相关推荐