在AWS Glue的ETL作业数据框架中,如果源数据源中包含与目标模式中缺失的列,可能会发生缺失列问题。下面是解决这个问题的代码示例:
source_table = glueContext.create_dynamic_frame.from_catalog(database="mydb", table_name="my-table")
selected_columns = ["col1", "col2", "col3"] source_table = source_table.select_fields(selected_columns)
target_table = glueContext.create_dynamic_frame.from_catalog(database="mydb", table_name="target-table")
merged_table = Join.apply(source_table, target_table, 'join_col', 'join_type')
glueContext.write_dynamic_frame.from_catalog(frame=merged_table, database="mydb", table_name="target-table")