Relative Content

Tag Archive for google-bigquerygoogle-cloud-storageairflow

Using Xcom in Airflow GCS to BigQuery

Airflow GCSToBigQueryOperator has a param max_id_key: If set, the name of a column in the BigQuery table that's to be loaded. This will be used to select the MAX value from BigQuery after the load occurs. The results will be returned by the execute() command, which in turn gets stored in XCom for future operators to use. This can be helpful with incremental loads--during future executions, you can pick up from the max ID.