如何使用特殊字符创建字符串列表，以了解在何处进行spli

#The Piper At The Gates Of Dawn::1967 *Lucifer Sam::Syd Barrett::03:07::Lucifer Sam, Siam cat Always sitting by your side Always by your side ... ( The lyrics of the song ) *Matilda mother::Syd Barrett::03:07::There was a king who ruled the land His majesty was in command With silver eyes the scarlet eagle ... ( The lyrics of the song ) #Another album *another song song's lyrics

2条回答

网友

1楼 · 编辑于 2024-05-21 08:52:35

不是超高效，但有效：

f = "filepath"

txt = "".join([line + "#" if line.startswith("#") else line for line in open(f)])
data = [x for x in txt.split("#")][1:]
data

['The Piper At The Gates Of Dawn::1967\n',
 '*Lucifer Sam::Syd Barrett::03:07::Lucifer Sam, Siam cat\nAlways sitting by your side\nAlways by your side\n... ( The lyrics of the song )\n*Matilda mother::Syd Barrett::03:07::There was a king who ruled the land\nHis majesty was in command\nWith silver eyes the scarlet eagle\n... ( The lyrics of the song )\n',
 'Another album\n',
 "*another song\nsong's lyrics\n"]

网友

2楼 · 编辑于 2024-05-21 08:52:35

您可以使用正则表达式（re模块）来实现，考虑以下示例，假设您有文件songs.txt，如下所示：

#Song 1
First line
Second line
#Song 2
First line of second
Last line

你可以做：

import re
with open('songs.txt','r') as f:
    data = f.read()
songs = re.findall(r'(#.+?\n)([^#]+)',data)
#now songs is list of 2-tuples with song name and "song body"
songs = list(sum(songs,())) #here I am doing so called flattening
print(songs) #['#Song 1\n', 'First line\nSecond line\n', '#Song 2\n', 'First line of second\nLast line\n']

pattern（re.findall的第一个参数）包含两个用括号（()）表示的组，第一个表示标题，第二个表示歌词。第一个组的形式必须是：#，后跟一个或多个非换行符（\n），并以换行符（\n）结尾。第二组仅表示1个或多个非#字符。你知道吗

相关问题更多 >

编程相关推荐

热门问题

热门文章