import nltk
words = set(nltk.corpus.words.words())
#https://stackoverflow.com/a/41290205
f = lambda x: " ".join(w for w in nltk.wordpunct_tokenize(x) if w.lower() in words)
#apply only for object columns (obviously strings)
cols = df.select_dtypes(object).columns
df[cols] = df[cols].applymap(f)
print (df)
ID A1 B1 C1
0 1 apple banana
1 2 I love you
2 3 dad
您可以将值与某些命令进行比较,这里是从
ntlk
开始的,如果不匹配,则删除值,但如果字典ntlk中不存在,则仍可能删除某些值,如mom
:相关问题 更多 >
编程相关推荐