[英]Using Pandas to create a “live” truth table
我收到来自 API 的警报字典,其中包含有关设备状况变化的更新,例如:
alert1 = {'equipment': 'equipment1', 'condition1': True}
alert2 = {'equipment': 'equipment1', 'condition2': True}
alert3 = {'equipment': 'equipment1', 'condition3': False}
alert4 = {'equipment': 'equipment2', 'condition1': True}
alert5 = {'equipment': 'equipment2', 'condition2': False}
alert6 = {'equipment': 'equipment3', 'condition2': False}
...
传入警报将触发 function 以使用预期的 output 更新“实时”真值表:
equipment condition1 condition2 condition3
equipment1 True True False
equipment2 True False NaN
equipment3 NaN False Nan
如果收到新警报,该表应更新。
使用 Pandas 实现这一目标的最佳方法是什么?
您可以创建一个空的 dataframe,然后在有新数据时更新它。
import pandas as pd
df = pd.DataFrame(
index=[f'equipment{i}' for i in range(1, 4)],
columns=[f'condition{i}' for i in range(1, 4)]
)
print(df)
# update whenever you have the new data
df.loc['equipment1', 'condition1'] = True
print(df)
与首先创建一个空的 dataframe 的解决方案相同。 然后用 alter_list seq by seq 更新 df。
alter_list = [
{'equipment': 'equipment1', 'condition1': True},
{'equipment': 'equipment1', 'condition2': True},
{'equipment': 'equipment1', 'condition3': False},
{'equipment': 'equipment2', 'condition1': True},
{'equipment': 'equipment2', 'condition2': False},
{'equipment': 'equipment3', 'condition2': False},]
# alter_list
df = pd.DataFrame(columns=['condition1', 'condition2', 'condition3'],
index=['equipment1', 'equipment2', 'equipment3'])
for alter in alter_list:
equipment = alter.pop('equipment')
for condition,v in alter.items():
print(equipment, condition, v)
df.loc[equipment, condition] = v
结果:
print(df.fillna(''))
condition1 condition2 condition3
equipment1 True True False
equipment2 True False
equipment3 False
可能有点矫枉过正......您可以将 append 转换为df
,然后在信号进入时重新生成真值表df2
。
cl = [{'condition1': {'equipment1': True}},
{'condition2': {'equipment1': True}},
{'condition3': {'equipment1': False}},
{'condition1': {'equipment2': True}},
{'condition2': {'equipment2': False}},
{'condition2': {'equipment3': False}}]
# fully expand list / embedded dict
df = pd.json_normalize(cl, sep="-")
# bring multiple inputs together, there's only one signal per row
df2 = df.assign(foo=1).groupby("foo").agg({c:"first" for c in df.columns}).reset_index().drop(columns="foo")
# now restructure as required
df2 = df2.T.reset_index().assign(equip=lambda dfa: dfa["index"].apply(lambda r: r.split("-")[1]),
cond=lambda dfa: dfa["index"].apply(lambda r: r.split("-")[0]),
).drop(columns="index").set_index(["equip","cond"]).unstack(1).droplevel(0, axis=1).reset_index()
cond equip condition1 condition2 condition3
0 equipment1 True True False
1 equipment2 True False NaN
2 equipment3 NaN False NaN
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.