基于pandas datafram向散点图添加过滤器

2024-06-16 16:56:53 发布

您现在位置:Python中文网/ 问答频道 /正文

我假装用以下数据帧的过滤器做一个分散图(表示整个赛季的球员、球队和赛季,并计算篮球队球员所做的未助攻和未助攻得分):

player          team_name       season          assisted    notassisted
A. DANRIDGE     NACIONAL        Season_17_18    130         445
A. DANRIDGE     NACIONAL        Season_18_19    132         382
D. ROBINSON     TROUVILLE       Season_18_19    89          286
D. DAVIS        AGUADA          Season_18_19    101         281
E. BATISTA      WELCOME         Season_17_18    148         278
F. MARTINEZ     GOES            Season_18_19    52          259
D. ALVAREZ      AGUADA          Season_17_18    114         246
M. HICKS        H. MACABI       Season_17_18    140         245

在x轴上,我想放置辅助点,在y轴上放置非辅助点。但是我还想按赛季、球队和球员进行筛选,所以当我选择一个球队中的一个确定的球员时,我可以看到他们的分数是一种颜色,而其他的分数是灰色的,或者如果我想选择两个或更多的球员,我可以在他们之间进行比较(用不同的颜色),其他的点是可见的,但是是灰色的。我还想比较两个不同球队的球员和过滤器的组合。在

我正在学习数据科学,我可以用图库绘制散点图和按团队筛选,我可以比较两个不同的团队(或季节或球员)。在

但我不能以一种奇特的方式添加多个过滤器,而且我也不知道如何显示选中的过滤器,并将其他过滤器置灰(不消除它们)。在

代码如下:

^{pr2}$

图形结果如下:

Scatter plot resultant

总之,我希望有三个过滤器,一个用于季节,另一个用于团队,另一个用于球员,以便在每个过滤器中有多个选择,并获得不同的颜色,其余的点变灰,以便我可以将结果与其余的进行比较,我不确定plotly express是否可行,或者是否应该使用其他库。在


Tags: 数据过滤器颜色团队分数season球员假装
1条回答
网友
1楼 · 发布于 2024-06-16 16:56:53

所以我不能操作这个图例,但是我可以通过找到的here的下拉窗口小部件添加过滤器。根据您的IDE,您可能需要使用Jupyter来让小部件工作。我遇到了VSCode无法显示小部件的问题。我下面的功能是过滤球队名称,赛季,或球员和比较两个选项的过滤器。我希望这可以扩大,以满足您的需要。在

import pandas as pd
import plotly.express as px
import plotly.graph_objects as go
import ipywidgets as ipy
from ipywidgets import Output, VBox, widgets


# First gather the data I need and choose the display colors
playerData = pd.read_csv("playerData.csv")
teamNames = list(playerData['team_name'].unique().tolist());
seasons = list(playerData['season'].unique().tolist());
players = list(playerData['player'].unique().tolist());
color1 = 'red'
color2 = 'blue'
color3 = 'gray'

# This creates the initial figure.
# Note that px.scatter generates multiple scatter plot 'traces'. Each trace contains 
# the data points associated with 1 team/season/player depending on what the property
# of 'color' is set to.
trace1 = px.scatter(playerData, x='assisted', y='notassisted', color='team_name')
fig = go.FigureWidget(trace1)

# Create all our drop down widgets
filterDrop = widgets.Dropdown(
    description='Filter:',
    value='team_name',
    options=['team_name', 'season','player']  
)
teamDrop1 = widgets.Dropdown(
    description='Team Name:',
    value='NACIONAL',
    options=list(playerData['team_name'].unique().tolist())  
)
teamDrop2 = widgets.Dropdown(
    description='Team Name:',
    value='NACIONAL',
    options=list(playerData['team_name'].unique().tolist())  
)
playerDrop1 = widgets.Dropdown(
    description='Player:',
    value='A. DANRIDGE',
    options=list(playerData['player'].unique().tolist())  
)
playerDrop2 = widgets.Dropdown(
    description='Player:',
    value='A. DANRIDGE',
    options=list(playerData['player'].unique().tolist())  
)
seasonDrop1 = widgets.Dropdown(
    description='Season:',
    value='Season_17_18',
    options=list(playerData['season'].unique().tolist())  
)
seasonDrop2 = widgets.Dropdown(
    description='Season:',
    value='Season_17_18',
    options=list(playerData['season'].unique().tolist())  
)

# This will be called when the filter dropdown changes. 
def filterResponse(change):
    # generate the new traces that are filtered by teamname, season, or player
    tempTrace = px.scatter(playerData, x='assisted', y='notassisted', color=filterDrop.value)
    with fig.batch_update():
        # Delete the old traces and add the new traces in one at a time
        fig.data = []
        for tr in tempTrace.data:
            fig.add_scatter(x = tr.x, y = tr.y, hoverlabel = tr.hoverlabel, hovertemplate = tr.hovertemplate, \
                           legendgroup = tr.legendgroup, marker = tr.marker, mode = tr.mode, name = tr.name)
    # Call response so that it will color the markers appropriately
    response(change)

# This is called by all the other drop downs
def response(change):
    # colorList is a list of strings the length of the # of traces 
    if filterDrop.value == 'team_name':
        colorList = [color1 if x == teamDrop1.value else color2 if x == teamDrop2.value else color3 for x in teamNames]
    elif filterDrop.value == 'season':
        colorList = [color1 if x == seasonDrop1.value else color2 if x == seasonDrop2.value else color3 for x in seasons]
    else:
        colorList = [color1 if x == playerDrop1.value else color2 if x == playerDrop2.value else color3 for x in players]
    with fig.batch_update():
        # Color each trace according to our chosen comparison traces
        for i in range(len(colorList)):
            fig.data[i].marker.color = colorList[i]

# These determine what function should be called when a drop down changes
teamDrop1.observe(response, names="value")
seasonDrop1.observe(response, names="value")
playerDrop1.observe(response, names="value")
teamDrop2.observe(response, names="value")
seasonDrop2.observe(response, names="value")
playerDrop2.observe(response, names="value")
filterDrop.observe(filterResponse, names="value")

# HBox and VBox are used to organize the other widgets and figures
container1 = widgets.HBox([filterDrop]) 
container2 = widgets.HBox([teamDrop1, seasonDrop1, playerDrop1])
container3 = widgets.HBox([teamDrop2, seasonDrop2, playerDrop2])
widgets.VBox([container1, container2, container3, fig])

结果如下:enter image description here

相关问题 更多 >