在Python中分组相似条目
我有一个numpy数组:
array([ 2.86656000e+09, 2.86688000e+09, 2.86708000e+09,
2.86860000e+09, 2.86884000e+09, 2.86908000e+09,
2.86920000e+09, 2.87024000e+09, 2.87040000e+09,
2.87056000e+09, 2.87076000e+09, 2.87108000e+09,
2.87120000e+09, 2.87152000e+09, 2.87260000e+09,
2.87272000e+09, 2.87280000e+09, 2.87448000e+09,
2.87464000e+09, 2.87476000e+09, 2.87484000e+09])
有什么好的方法可以把相似的值分组吗?比如说,差距不超过1000000的值可以归为一组?谢谢大家的回答!!
1 个回答
4
作为评论中提到的解决方案的替代方法,像这样应该可以工作:
In [1]: import numpy as np
In [2]: arr = np.array([ 2.86656000e+09, 2.86688000e+09, 2.86708000e+09,
...: 2.86860000e+09, 2.86884000e+09, 2.86908000e+09,
...: 2.86920000e+09, 2.87024000e+09, 2.87040000e+09,
...: 2.87056000e+09, 2.87076000e+09, 2.87108000e+09,
...: 2.87120000e+09, 2.87152000e+09, 2.87260000e+09,
...: 2.87272000e+09, 2.87280000e+09, 2.87448000e+09,
...: 2.87464000e+09, 2.87476000e+09, 2.87484000e+09])
In [3]: np.split(arr, np.where(np.diff(arr) > 1000000)[0] + 1)
Out[3]:
[array([ 2.86656000e+09, 2.86688000e+09, 2.86708000e+09]),
array([ 2.86860000e+09, 2.86884000e+09, 2.86908000e+09,
2.86920000e+09]),
array([ 2.87024000e+09, 2.87040000e+09, 2.87056000e+09,
2.87076000e+09, 2.87108000e+09, 2.87120000e+09,
2.87152000e+09]),
array([ 2.87260000e+09, 2.87272000e+09, 2.87280000e+09]),
array([ 2.87448000e+09, 2.87464000e+09, 2.87476000e+09,
2.87484000e+09])]