繁体   English   中英

使用 Pandas 创建“实时”真值表

[英]Using Pandas to create a “live” truth table

我收到来自 API 的警报字典,其中包含有关设备状况变化的更新,例如:

        alert1 = {'equipment': 'equipment1', 'condition1': True}
        alert2 = {'equipment': 'equipment1', 'condition2': True}
        alert3 = {'equipment': 'equipment1', 'condition3': False}
        alert4 = {'equipment': 'equipment2', 'condition1': True}
        alert5 = {'equipment': 'equipment2', 'condition2': False}
        alert6 = {'equipment': 'equipment3', 'condition2': False}
        ...

传入警报将触发 function 以使用预期的 output 更新“实时”真值表:

equipment   condition1  condition2  condition3
equipment1    True        True         False
equipment2    True        False        NaN
equipment3    NaN         False        Nan

如果收到新警报,该表应更新。

使用 Pandas 实现这一目标的最佳方法是什么?

您可以创建一个空的 dataframe,然后在有新数据时更新它。

import pandas as pd


df = pd.DataFrame(
    index=[f'equipment{i}' for i in range(1, 4)], 
    columns=[f'condition{i}' for i in range(1, 4)]
)

print(df)

# update whenever you have the new data
df.loc['equipment1', 'condition1'] = True

print(df)

与首先创建一个空的 dataframe 的解决方案相同。 然后用 alter_list seq by seq 更新 df。

alter_list = [
        {'equipment': 'equipment1', 'condition1': True},
        {'equipment': 'equipment1', 'condition2': True},
        {'equipment': 'equipment1', 'condition3': False},
        {'equipment': 'equipment2', 'condition1': True},
        {'equipment': 'equipment2', 'condition2': False},
        {'equipment': 'equipment3', 'condition2': False},]
# alter_list

df = pd.DataFrame(columns=['condition1', 'condition2', 'condition3'], 
                  index=['equipment1', 'equipment2', 'equipment3'])
for alter in alter_list:
    equipment = alter.pop('equipment')
    for condition,v in alter.items():
        print(equipment, condition, v)
        df.loc[equipment, condition] = v   

结果:

print(df.fillna(''))

          condition1  condition2 condition3
equipment1       True        True      False
equipment2       True       False           
equipment3                  False

可能有点矫枉过正......您可以将 append 转换为df ,然后在信号进入时重新生成真值表df2

cl = [{'condition1': {'equipment1': True}},
 {'condition2': {'equipment1': True}},
 {'condition3': {'equipment1': False}},
 {'condition1': {'equipment2': True}},
 {'condition2': {'equipment2': False}},
 {'condition2': {'equipment3': False}}]

# fully expand list / embedded dict
df = pd.json_normalize(cl, sep="-")
# bring multiple inputs together,  there's only one signal per row
df2 = df.assign(foo=1).groupby("foo").agg({c:"first" for c in df.columns}).reset_index().drop(columns="foo")
# now restructure as required
df2 = df2.T.reset_index().assign(equip=lambda dfa: dfa["index"].apply(lambda r: r.split("-")[1]),
                          cond=lambda dfa: dfa["index"].apply(lambda r: r.split("-")[0]),
                         ).drop(columns="index").set_index(["equip","cond"]).unstack(1).droplevel(0, axis=1).reset_index()

output

cond       equip condition1 condition2 condition3
0     equipment1       True       True      False
1     equipment2       True      False        NaN
2     equipment3        NaN      False        NaN

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM