Python3:将Latin1转换为UTF8

for file in glob.iglob(os.path.join(dir, '*.txt')): print(file) with codecs.open(file,encoding='latin-1') as f: infile = f.read() with codecs.open('test.txt',mode='w',encoding='utf-8') as f: f.write(infile)

<Trans audio_filename="VALE_M11_070.MP3" xml:lang="español"> <Datos clave_texto=" VALE_M11_070" tipo_texto="entrevista_semidirigida"> <Corpus corpus="PRESEEA" subcorpus="ESESUMA" ciudad="Valencia" pais="España"/>

2条回答

网友

1楼 · 编辑于 2024-06-06 06:49:20

我在这件事上找到了一半。这不是你想要/需要的，但可能会帮助其他人朝着正确的方向。。。

# First read the file
txt = open("file_name", "r", encoding="latin-1") # r = read, w = write & a = append
items = txt.readlines()
txt.close()

# and write the changes to file
output = open("file_name", "w", encoding="utf-8")
for string_fin in items:
    if "Ã©" in string_fin:
        string_fin = string_fin.replace("Ã©", "é")

    if "Ã«" in string_fin:
        string_fin = string_fin.replace("Ã«", "ë")

    # this works if not to much needs changing...

    output.write(string_fin)

output.close();

*注detection

网友

2楼 · 编辑于 2024-06-06 06:49:20

对于Python3.6：

your_str = your_str.encode('utf-8').decode('latin-1')

相关问题更多 >

编程相关推荐

热门问题

热门文章