如何替换Pandas中的值？

col_names = ["duration","protocol_type","service","flag","src_bytes", "dst_bytes","land","wrong_fragment","urgent","hot","num_failed_logins", "logged_in","num_compromised","root_shell","su_attempted","num_root", "num_file_creations","num_shells","num_access_files","num_outbound_cmds", "is_host_login","is_guest_login","count","srv_count","serror_rate", "srv_serror_rate","rerror_rate","srv_rerror_rate","same_srv_rate", "diff_srv_rate","srv_diff_host_rate","dst_host_count","dst_host_srv_count", "dst_host_same_srv_rate","dst_host_diff_srv_rate","dst_host_same_src_port_rate", "dst_host_srv_diff_host_rate","dst_host_serror_rate","dst_host_srv_serror_rate", "dst_host_rerror_rate","dst_host_srv_rerror_rate","label"]

duration protocol_type ... dst_host_srv_rerror_rate label 0 0 tcp ... 0.00 normal 1 0 udp ... 0.00 normal 2 0 tcp ... 0.00 neptune 3 0 tcp ... 0.01 normal 4 0 tcp ... 0.00 normal

df.replace(to_replace = ['ipsweep.', 'portsweep.', 'nmap.', 'satan.'], value = 'probe', inplace = True) df.replace(to_replace = ['ftp_write.', 'guess_passwd.', 'imap.', 'multihop.', 'phf.', 'spy.', 'warezclient.', 'warezmaster.'], value = 'r2l', inplace = True) df.replace(to_replace = ['buffer_overflow.', 'loadmodule.', 'perl.', 'rootkit.'], value = 'u2r', inplace = True) df.replace(to_replace = ['back.', 'land.' , 'neptune.', 'pod.', 'smurf.', 'teardrop.'], value = 'dos', inplace = True)

df['label'] = df['label'].replace(['ipsweep.', 'portsweep.', 'nmap.', 'satan.'], 'probe',regex=True) df['label'] = df['label'].replace(['ftp_write.', 'guess_passwd.', 'imap.', 'multihop.', 'phf.', 'spy.', 'warezclient.', 'warezmaster.'], 'r2l',regex=True) df['label'] = df['label'].replace(['buffer_overflow.', 'loadmodule.', 'perl.', 'rootkit.'], 'u2r',regex=True) df['label'] = df['label'].replace(['back.', 'land.' , 'neptune.', 'pod.', 'smurf.', 'teardrop.'], 'dos',regex=True)

After replacing, first 5 rows of df: duration protocol_type ... dst_host_srv_rerror_rate label 0 0 tcp ... 0.00 normal 1 0 udp ... 0.00 normal 2 0 tcp ... 0.00 neptune 3 0 tcp ... 0.01 normal 4 0 tcp ... 0.00 normal

2条回答

网友

1楼 · 编辑于 2024-06-16 09:37:18

也许您正在使用"neptune."而不是"neptune"

我的测试似乎与"neptune"一起工作

>>> df
     label
0  neptune
>>> df["label"].replace(["neptune."], "normal", regex=True)
0    neptune
Name: label, dtype: object
>>> df["label"].replace(["neptune"], "normal", regex=True)
0    normal
Name: label, dtype: object
>>> df["label"].replace(["neptune"], "normal")
0    normal
Name: label, dtype: object

网友

2楼 · 编辑于 2024-06-16 09:37:18

通过使用"neptune."作为带有regex = True的to_replace值，您告诉熊猫寻找"neptune"和任何单个附加字符（例如，“neptuneX”或“neptune！”）。由于该额外字符不存在，因此整个短语不会被替换。相反，您可以只使用"neptune"，或"neptune.?"表示0或1个额外字符，或"neptune.*"表示0或更多额外字符

如果没有regex = True，您是在告诉Pandas查找字面上的"neptune."短语

相关问题更多 >

编程相关推荐

热门问题

热门文章