创建配置文件并使用计数

'mississippi worth reading about', ' commonplace river contrary ways remarkable', ' considering missouri main branch longest river world--four miles', ' seems safe crookedest river world part journey uses cover ground crow fly six seventy-five', ' discharges water st', ' lawrence twenty-five rhine thirty-eight thames', ' river vast drainage-basin draws water supply twenty-eight states territories delaware atlantic seaboard country idaho pacific slope spread forty-five degrees longitude', ' mississippi receives carries gulf water fifty-four subordinate rivers navigable steamboats hundreds navigable flats keels', ' area drainage-basin combined areas england wales scotland ireland france spain portugal germany austria italy turkey almost wide region fertile mississippi valley proper exceptionally so']

river 4 (profile) atlantic: 1 branch: 1 commonplace: 1 considering: 1 contrary: 1 country: 1 cover: 1 crookedest: 1 crow: 1 degrees: 1 delaware: 1 drainage-basin: 1 draws: 1 fly: 1 forty-five: 1 ground: 1 idaho: 1 journey: 1 longest: 1 longitude: 1 main: 1 missouri: 1 pacific: 1 part: 1 remarkable: 1 safe: 1 seaboard: 1 seems: 1 seventy-five: 1 six: 1 slope: 1 spread: 1 states: 1 supply: 1 territories: 1 twenty-eight: 1 uses: 1 vast: 1 water: 1 ways: 1

{'austria', 'fortyfive', 'fiftyfour', 'longest', 'vast', 'almost', 'states', 'region', 'commonplace', 'wide', 'flats', 'main', 'longitude', 'part', 'gulf', 'st', 'contrary', 'missouri', 'pacific', 'hundreds', 'area', 'areas', 'turkey', 'discharges', 'twentyeight', 'fly', 'worth', 'thirtyeight', 'valley', 'seaboard', 'wales', 'ireland', 'ways', 'uses', 'scotland', 'ground', 'river', 'steamboats', 'seventyfive', 'territories', 'safe', 'degrees', 'twentyfive', 'england', 'thames', 'subordinate', 'drainagebasin', 'water', 'considering', 'fertile', 'rivers', 'spread', 'reading', 'combined', 'seems', 'france', 'crookedest', 'drainagebasin:', 'supply', 'rhine', 'portugal', 'six', 'slopea', 'draws', 'exceptionally', 'mississippi', 'idaho', 'worldfour', 'atlantic', 'italy', 'spain', 'receives', 'cover', 'remarkable', 'germany', 'crow', 'delaware', 'country', 'branch', 'carries', 'proper', 'lawrence', 'journey', 'keels', 'navigable'}

{'remarkable', 'six', 'part', 'navigable', 'england', 'areas', 'worth', 'ways', 'longest', 'lawrence', 'journey', 'longitude', 'austria', 'rivers', 'st', 'crow', 'pacific', 'thirty-eight', 'gulf', 'ireland', 'drainage-basin', 'delaware', 'spread', 'proper', 'subordinate', 'territories', 'germany', 'cover', 'fifty-four', 'slope--a', 'fertile', 'degrees', 'wales', 'seems', 'exceptionally', 'water', 'italy', 'fly', 'missouri', 'turkey', 'atlantic', 'flats', 'hundreds', 'world--four', 'branch', 'twenty-eight', 'main', 'spain', 'receives', 'keels', 'states', 'portugal', 'draws', 'almost', 'contrary', 'seaboard', 'safe', 'mississippi', 'idaho', 'scotland', 'steamboats', 'france', 'valley', 'twenty-five', 'carries', 'wide', 'crookedest', 'area', 'reading', 'rhine', 'discharges', 'uses', 'commonplace', 'combined', 'considering', 'seventy-five', 'river', 'region', 'forty-five', 'ground', 'country', 'vast', 'thames', 'supply'}

for i in unique: kw = i count_word = [i for i in temp for j in i.split() if j == kw] count_dict = {j: i.count(j) for i in count_word for j in i.split() if j != kw} print(kw) for a, c in sorted(count_dict.items(), key=lambda x: x[0]): print('{}: {}'.format(a, c)) print()

1条回答

网友

1楼 · 发布于 2024-05-23 16:50:55

为此，我们可以将kw(keyword)指定为river，然后我们可以使用列表理解来获取包含该kw的所有项，注意有些句子包含rivers，因此kw in将不起作用。从这里开始，我们可以使用字典理解来构造一个字典，我们将使用j来表示i.split()中的每个单词，i.count(j)来表示每个项目中每个单词的计数，我们还将加入if j != kw，因此我们的列表中不包括river。最后，我们可以使用for k, v in dicta.items()打印，如果需要，我们可以添加排序方法，以按字母顺序获得结果。你知道吗

kw = 'river'
lista = [i for i in temp for j in i.split() if j == kw]
dicta = {j: i.count(j) for i in lista for j in i.split() if j != kw}

for k, v in sorted(dicta.items(), key=lambda x: x[0]):
    print('{}: {}'.format(k, v))

atlantic: 1
branch: 1
commonplace: 1
considering: 1
contrary: 1
country: 1
...
twenty-eight: 1
uses: 1
vast: 1
water: 1
ways: 1
world: 1
world four: 1

扩展循环：

lista = []
for i in temp:
    for j in i.split():
        if j == kw:
            lista.append(i)

dicta = {}
for i in lista:
    for j in i.split():
        dicta[j] = i.count(j)

附加请求：

Read all entire file into one variable as string

all_words = 'some string'
all_words = all_words.split()
unique = set(all_words)

for i in unique:
    kw = i
    temp = list of sentences to check against
    rest of existing code
    maybe instead of printing the final statement append the dictionaries created to a list

相关问题更多 >

编程相关推荐

热门问题

热门文章