基于等价性的Pandas数据拼写长度计算

2条回答

网友

1楼 · 编辑于 2024-04-19 04:06:54

以下内容适用于您的数据集，需要提出一个问题才能将我最初的答案简化为使用list comprehensions and itertools：

In [153]:

def num_spells(x):
    t = list(x.unique())
    return [t.index(el)+1 for el in x]

d1.apply(num_spells, axis=1)

Out[153]:
    1962  1963  1964  1965
c1     1     1     1     2
c2     1     1     2     2
c3     1     1     1     1

In [144]:
from itertools import chain, repeat
def spell_len(x):
    t = list(x.value_counts())
    return list(chain.from_iterable(repeat(i,i) for i in t))

d1.apply(spell_len, axis=1)
Out[144]:
    1962  1963  1964  1965
c1     3     3     3     1
c2     2     2     2     2
c3     4     4     4     4

网友

2楼 · 编辑于 2024-04-19 04:06:54

我已经更新了@EdChum建议的num\u拼写，并考虑了np.nan值的存在

def compute_number_of_spells(wide_df):
    """
    Compute Number of Spells in a Wide DataFrame for Each Row
    Columns : Time Data
    """
    def num_spells(x):
        """ Compute the spells in each row """
        t = list(x.dropna().unique())
        r = []
        for el in x:
            if not np.isnan(el):                
                r.append(t.index(el)+1)
            else:
                r.append(np.nan)            #Handle np.nan case
        return r
    wide_df = wide_df.apply(num_spells, axis=1)
    return wide_df

相关问题更多 >

编程相关推荐

热门问题

热门文章

基于等价性的Pandas数据拼写长度计算

相关问题 更多 >

编程相关推荐

热门问题

热门文章

相关问题更多 >