如何用Pandas切分句子的左右部分

sentence = "lack of association between the promoter polymorphism of the mtnr1a gene and adolescent idiopathic scoliosis" root = "mtnr1a" try: words = sentence.split() n = words.index(root) cutoff = ' '.join(words[n-4:n+5]) except ValueError: cutoff = None print(cutoff)

sentence = data['sentence'] root = data['rootword'] def cutOff(sentence,root): try: words = sentence.str.split() n = words.index(root) cutoff = ' '.join(words[n-4:n+5]) except ValueError: cutoff = None return cutoff data.apply(cutOff(sentence,root),axis=1)

sentence = "mtnr1a lack of association between the promoter polymorphism of the gene and adolescent idiopathic scoliosis" out if root in first position: "mtnr1a lack of association between" out if root in last position: "lack of association between the promoter polymorphism of the gene and adolescent idiopathic scoliosis" "adolescent idiopathic scoliosis mtnr1a"

1条回答

网友

1楼 · 发布于 2024-06-16 10:01:05

代码中的两个小调整应该可以解决您的问题：

首先，对数据帧调用^{}将函数应用于调用它的数据帧的每一行中的值。你知道吗

您不必将列作为输入传递给函数，调用sentence.str.split()也没有意义。在cutOff()函数中sentence只是一个常规字符串（不是列）。你知道吗

将函数更改为：

def cutOff(sentence,root): 
    try: 
        words = sentence.split()  # this is the line that was changed
        n = words.index(root) 
        cutoff = ' '.join(words[n-4:n+5]) 
    except ValueError: 
        cutoff = None 
    return cutoff

接下来您只需指定将作为函数输入的列—您可以使用lambda：

df.apply(lambda x: cutOff(x["sentence"], x["rootword"]), axis=1)
#0    promoter polymorphism of the mtnr1a gene and a...
#dtype: object

相关问题更多 >

编程相关推荐

热门问题

热门文章