Pandas数据帧行更改类型

In [167]: thead = table.head() In [168]: thead Out[168]: Consolidated Balance Sheet (USD $) Sep. 30, 2012 Dec. 31, 2011 0 In Millions, unless otherwise specified None None 1 Current assets None None 2 Cash and cash equivalents $ 3,029 $ 2,219 3 Marketable securities - current 1989 1461 4 Accounts receivable - net 4409 3867 In [170]: def no_comma_or_dollar(num): if isinstance(num, unicode): return float(num.lstrip('$').replace(',','')) else: return num thead[:, 1] = thead[:, 1].apply(no_comma_or_dollar)

In [171]: thead.to_dict() Out[171]: {u'Consolidated Balance Sheet (USD $)': {0: u'In Millions, unless otherwise specified', 1: u'Current assets', 2: u'Cash and cash equivalents', 3: u'Marketable securities - current', 4: u'Accounts receivable - net'}, u'Dec. 31, 2011': {0: None, 1: None, 2: u'$ 2,219', 3: 1461.0, 4: 3867.0}, u'Sep. 30, 2012': {0: None, 1: None, 2: u'$ 3,029', 3: 1989.0, 4: 4409.0}}

2条回答

网友

1楼 · 编辑于 2024-06-16 10:11:55

如果我没听错，您正在寻找apply方法：

In [33]: import pandas as pd

In [34]: table = pd.Series([None, u'$ 3,12', u'$ 4,5'])

In [35]: table
Out[35]: 
0      None
1    $ 3,12
2     $ 4,5

In [36]: def f(cell):
   ....:     if isinstance(cell, unicode):
   ....:         return float(cell.lstrip('$').replace(',',''))
   ....:     else:
   ....:         return cell
   ....:     

In [37]: table.apply(f)
Out[37]: 
0    NaN
1    312
2     45

这会创建一个新对象。要存储新对象而不是旧对象，请执行以下操作：

^{pr2}$

网友

2楼 · 编辑于 2024-06-16 10:11:55

您只需打印这些文件，而不是将它们^{}-发送到数据框中，以下是一种方法：

创建一个函数来执行条带化（如果是unicode），或者如果已经是一个数字，则保留它：

def no_comma_or_dollar(num):
    if isinstance(num, unicode):
        return float(num.lstrip('$').replace(',',''))
    else:
        return num

table[col_name] = table[col_name].apply(no_comma_or_dollar)

例如：

^{pr2}$

更新：

对于您给出的thread，我很想给出一个稍微懒一点的no_comma_or_dollar和{a2}：

def no_comma_or_dollar2(num):
    try:
        return float(num.lstrip('$').replace(',',''))
    except: # if you can't strip/replace/convert just leave it
        return num

In [5]: thread.applymap(no_comma_or_dollar2)
Out[5]: 
        Consolidated Balance Sheet (USD $)  Dec. 31, 2011  Sep. 30, 2012
0  In Millions, unless otherwise specified            NaN            NaN
1                           Current assets            NaN            NaN
2                Cash and cash equivalents           2219           3029
3          Marketable securities - current           1461           1989
4                Accounts receivable - net           3867           4409

相关问题更多 >

编程相关推荐

热门问题

热门文章