生成汇总标签页

+--------------------+--------------------+--------------------+ |Fruit |Shop |Price | +--------------------+--------------------+--------------------+ |Apple |Coles |$1.50 | |Apple |Woolworths |$1.60 | |Apple |IGA |$1.70 | |Banana |Coles |$0.50 | |Banana |Woolworths |$0.60 | |Banana |IGA |$0.70 | |Cherry |Coles |$5.00 | |Date |Coles |$2.00 | |Date |Woolworths |$2.10 | |Elderberry |IGA |$10.00 | +--------------------+--------------------+--------------------+

+----------+----------+----------+----------+ |Fruit |Coles |Woolworths|IGA | +----------+----------+----------+----------+ |Apple |$1.50 |$1.60 |$1.70 | |Banana |$0.50 |$0.60 |$0.70 | |Cherry |NULL |$5.00 |NULL | |Date |$2.00 |$2.10 |NULL | |Elderberry|NULL |NULL |$10.00 | +----------+----------+----------+----------+

2条回答

网友

1楼 · 编辑于 2024-05-15 16:51:27

在python方面，您可以使用一些itertools魔术来重新排列数据：

data = [('Apple',      'Coles',      1.50),
        ('Apple',      'Woolworths', 1.60),
        ('Apple',      'IGA',        1.70),
        ('Banana',     'Coles',      0.50),
        ('Banana',     'Woolworths', 0.60),
        ('Banana',     'IGA',        0.70),
        ('Cherry',     'Coles',      5.00),
        ('Date',       'Coles',      2.00),
        ('Date',       'Woolworths', 2.10),
        ('Elderberry', 'IGA',        10.00)]

from itertools import groupby, islice
from operator import itemgetter
from collections import defaultdict

stores = sorted(set(row[1] for row in data))
# probably splitting this up in multiple lines would be more readable
pivot = ((fruit, defaultdict(lambda: None, (islice(d, 1, None) for d in data))) for fruit, data in groupby(sorted(data), itemgetter(0)))

print 'Fruit'.ljust(12), '\t'.join(stores)
for fruit, prices in pivot:
    print fruit.ljust(12), '\t'.join(str(prices[s]) for s in stores)

输出：

Fruit        Coles      IGA     Woolw
Apple        1.5        1.7     1.6
Banana       0.5        0.7     0.6
Cherry       5.0        None    None
Date         2.0        None    2.1
Elderberry   None       10.0    None

网友

2楼 · 编辑于 2024-05-15 16:51:27

熊猫套餐能很好地处理这个问题。

>>> import pandas
>>> df=pandas.DataFrame(data, columns=['Fruit', 'Shop', 'Price'])
>>> df.pivot(index='Fruit', columns='Shop', values='Price')
Shop        Coles   IGA  Woolworths
Fruit                              
Apple         1.5   1.7         1.6
Banana        0.5   0.7         0.6
Cherry        5.0   NaN         NaN
Date          2.0   NaN         2.1
Elderberry    NaN  10.0         NaN

文件： http://pandas.pydata.org/pandas-docs/stable/reshaping.html

一些学习熊猫的IPython笔记本： https://bitbucket.org/hrojas/learn-pandas

希望这会有帮助。
致意
帕特里克·布罗克曼

相关问题更多 >

编程相关推荐

热门问题

热门文章