Python2.7按行设置列的pandas条件

Name Price Rating URL Notes1 Notes2 Notes3 Foo $450 9 a.com/x NaN NaN NaN Bar $99 5 see over www.b.com Hilarious Nifty John $551 2 www.c.com Pretty NaN NaN Jane $999 8 See Over in Notes Funky http://www.d.com Groovy

df['URL'] = df['URL'].fillna('') df['Notes1'] = df['Notes1'].fillna('') df['Notes2'] = df['Notes2'].fillna('') df['Notes3'] = df['Notes3'].fillna('') to_move = df['URL'].str.lower().str.contains('see over') df.loc[to_move, 'URL'] = df['Notes1']

url_in1 = df['Notes1'].str.contains('\.com') url_in2 = df['Notes2'].str.contains('\.com') to_move = df['URL'].str.lower().str.contains('see-over') to_move1 = to_move & url_in1 to_move2 = to_move & url_in2 df.loc[to_move1, 'URL'] = df.loc[url_in1, 'Notes1'] df.loc[url_in1, 'Notes1'] = df['Notes2'] df.loc[url_in1, 'Notes2'] = '' df.loc[to_move2, 'URL'] = df.loc[url_in2, 'Notes2'] df.loc[url_in2, 'Notes2'] = ''

1条回答

网友

1楼 · 发布于 2024-06-06 21:11:33

我还在学习pandas，所以这段代码的某些部分可能不那么优雅，但总体思想是-获取所有notes列，找到其中的所有url，将其与URL列合并，然后将剩余的notes合并到Notes1列中：

import pandas as pd
import numpy as np
import pandas.core.strings as strings

# Just to get first notnull occurence
def geturl(s):
    try:
        return next(e for e in s if not pd.isnull(e))
    except:
        return np.NaN

df =  pd.read_csv("d:/temp/data2.txt")

dfnotes = df[[e for e in df.columns if 'Notes' in e]]

#       Notes1            Notes2  Notes3
# 0        NaN               NaN     NaN
# 1  www.b.com         Hilarious   Nifty
# 2     Pretty               NaN     NaN
# 3      Funky  http://www.d.com  Groovy

dfurls = dfnotes.apply(lambda x: x.str.contains('\.com'), axis=1)
dfurls = dfurls.fillna(False).astype(bool)

#   Notes1 Notes2 Notes3
# 0  False  False  False
# 1   True  False  False
# 2  False  False  False
# 3  False   True  False

turl = dfnotes[dfurls].apply(geturl, axis=1)

df['URL'] = np.where(turl.isnull(), df['URL'], turl)
df['Notes1'] = dfnotes[~dfurls].apply(lambda x: strings.str_cat(x[~x.isnull()], sep=' '), axis=1)

del df['Notes2']
del df['Notes3']

df
#    Name Price  Rating               URL           Notes1
# 0   Foo  $450       9           a.com/x                 
# 1   Bar   $99       5         www.b.com  Hilarious Nifty
# 2  John  $551       2         www.c.com           Pretty
# 3  Jane  $999       8  http://www.d.com     Funky Groovy

相关问题更多 >

编程相关推荐

热门问题

热门文章