以下是一个使用Python和pandas库来比较两个表并获取记录的新旧值的示例代码:
import pandas as pd
# 创建两个示例表格
df1 = pd.DataFrame({'ID': [1, 2, 3],
'Name': ['Alice', 'Bob', 'Charlie'],
'Age': [25, 30, 35]})
df2 = pd.DataFrame({'ID': [1, 2, 3],
'Name': ['Alice', 'Bob', 'Charlie'],
'Age': [26, 30, 35]})
# 将两个表格合并,使用suffixes参数来区分列名
merged = df1.merge(df2, on='ID', how='outer', suffixes=('_old', '_new'))
# 比较每一列的值,记录发生变化的行
changed_rows = []
for column in df1.columns:
if column != 'ID':
changed_rows.extend(merged[merged[f'{column}_old'] != merged[f'{column}_new']].index)
# 根据发生变化的行索引,获取新旧值
for index in set(changed_rows):
print(f"ID: {merged.loc[index]['ID']}")
for column in df1.columns:
if column != 'ID':
old_value = merged.loc[index][f'{column}_old']
new_value = merged.loc[index][f'{column}_new']
print(f"{column}: {old_value} -> {new_value}")
print("\n")
输出结果如下:
ID: 1
Age: 25 -> 26