擅长:python、mysql、java
<pre><code># reorganize data
df = pd.get_dummies(df.set_index('name').tags
.apply(pd.Series)
.stack()
).unstack()
# remove multilevel column and collapse counts per name
df.columns = df.columns.droplevel(1)
df.groupby(by=df.columns, axis=1).sum().add_prefix('tags_')
tags_a tags_b tags_c
name
Rob 1 0 1
Erica 0 1 1
</code></pre>