要按分组计算两个数据帧之间的差异的标准差,可以按照以下步骤进行:
import pandas as pd
import numpy as np
df1 = pd.DataFrame({'group': ['A', 'A', 'B', 'B', 'C', 'C'],
'value': [1, 2, 3, 4, 5, 6]})
df2 = pd.DataFrame({'group': ['A', 'A', 'B', 'B', 'C', 'C'],
'value': [2, 3, 4, 5, 6, 7]})
merged_df = pd.merge(df1, df2, on='group', suffixes=('_df1', '_df2'))
merged_df['difference'] = merged_df['value_df2'] - merged_df['value_df1']
grouped_df = merged_df.groupby('group')['difference'].std()
完整示例代码如下:
import pandas as pd
import numpy as np
df1 = pd.DataFrame({'group': ['A', 'A', 'B', 'B', 'C', 'C'],
'value': [1, 2, 3, 4, 5, 6]})
df2 = pd.DataFrame({'group': ['A', 'A', 'B', 'B', 'C', 'C'],
'value': [2, 3, 4, 5, 6, 7]})
merged_df = pd.merge(df1, df2, on='group', suffixes=('_df1', '_df2'))
merged_df['difference'] = merged_df['value_df2'] - merged_df['value_df1']
grouped_df = merged_df.groupby('group')['difference'].std()
print(grouped_df)
这将打印出按分组计算的差异的标准差。