字符串特征提取
string-demon的Python项目详细描述
这是后续机器学习算法的字符串特征提取项目。
>示例:
`````
>导入字符串恶魔为sd
str1
str1
垃圾邮件检查(str1)
````
>;(0.9047619047619048,2.6246719160104988, 4.833333333333333, 0.7241379310344828)
return refer to: (中文重复率,中文停顿长度,英文停顿长度,中英文长度比)
```
import string_demon as sd
str2 = "我住在南方,我住在南方。"
print sd.lcs_check(str2)
```
> (2, '\xe6\x88\x91\xe4\xbd\x8f\xe5\x9c\xa8\xe5\x8d\x97\xe6\x96\xb9',5)
>;返回参考:(重次次数,lcs,lcs.length)
>示例:
`````
>导入字符串恶魔为sd
str1
str1
垃圾邮件检查(str1)
````
>;(0.9047619047619048,2.6246719160104988, 4.833333333333333, 0.7241379310344828)
return refer to: (中文重复率,中文停顿长度,英文停顿长度,中英文长度比)
```
import string_demon as sd
str2 = "我住在南方,我住在南方。"
print sd.lcs_check(str2)
```
> (2, '\xe6\x88\x91\xe4\xbd\x8f\xe5\x9c\xa8\xe5\x8d\x97\xe6\x96\xb9',5)
>;返回参考:(重次次数,lcs,lcs.length)