计算多个文件（文本）中的两个值（总单词和单字），并在Python中输出csv

#! /usr/bin/env python # Get from each text file a total word count and a unique word count. # Output a CSV with three columns: filename, total, unique. import glob with open (file_name) as f, open ('countfile.csv', 'w') as out : list_of_files = glob.glob('./*.txt) for file_name in list_of_files: ??? out.write('{f},{t},{u}\n'.format(f =file_name, t =word_total, u =uniques)

1条回答

网友

1楼 · 发布于 2024-04-19 05:40:17

files = {}
for fpath in glob.glob("*.txt"):
    with open(fpath) as f:
         fixed_text = re.sub("[^a-zA-Z'-]"," ",f.read())
    words = fixed_text.split()
    total_words = len(words)
    total_unique = len(set(words))
    files[fpath] = (total_words, total_unique)
    print "Total words:", total_words
    print "Total unique:", total_unique

with open("some_csv.csv", "w") as f:
    for fname in files:
        print >> f, "%s,%s,%s" % (fname, files[fname][0], files[fname][1])

我想这应该行得通。。。在

编程相关推荐

java为什么clean glass fish管理控制台不工作？
java 3DES_ECB解密
classpath如何解决由以下原因引起的错误：java。lang.ClassNotFoundException
Junit 3.8中Junit 4的@RunWith注释的java等价物
获取URL时发生java Jsoup HTTP错误。进行申请后处理时，状态=403
使用href link，使用Javaservlet和hibernate从数据库中删除一行
从Get请求调用WebSocket的java
播放框架JavaRESTAPI示例
java将RxJava 1.1.5改编为反应堆堆芯3.1.0。M3
java在每个测试用例之后运行任务

相关问题更多 >

编程相关推荐

热门问题

热门文章

计算多个文件（文本）中的两个值（总单词和单字），并在Python中输出csv

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >