Relative Content

Tag Archive for pythonairflow

nested operators airflow, good practice?

My goal is to pass a string of a file path from one @task into Emailoperators , so I can apply logic with the dataset that I will read from the file path to build up my operators that will send emails. My code looks like this:

Apache Airflow Unable To Run It

I have been trying to install Apache Airflow on my local Mac machine for weeks. No tutorial have been helpful, I’ve been literally reading the Airflow website docs through and through, copying step by step processes, used Claude and ChatGPT but I don’t seem to get it running.

Non-critical errors in logs cluttering output

I am working with an Airflow environment managed through AWS Managed Workflows for Apache Airflow (MWAA). In my log outputs, I have been noticing non-critical errors that are cluttering the logs, which I believe is a code quality issue. These errors are related to the SecretsManagerBackend, specifically when trying to retrieve variables from AWS Secrets Manager.

Parallel execution of chosen list of tasks

I have a DAG that has as parameter a customizable list of tasks to execute, so that I can choose for example only to execute tasks ['prod_1','prod_2','prod_5']. I do this via a BranchPythonOperator() that allows me to run only the tasks I set as input when running the dag:

Set which nodes to execute in parallel

I have a list of products, let’s say “product_1”, “product_2”, etc. for which I want to run a node that using a for loop basically runs a Python Jupyter notebook CustomPythonNotebookOperator() no more than 3 nodes at the same time as specified by the parameter parallel. This kind of node comes from my organization and cannot be changed.

Airflow – set which nodes to execute in parallel

I have a list of products, lets say “product_1”, “product_2”, etc… for which I want to run a node that using a for loop basically runs a python jupyter notebook CustomPythonNotebookOperator() no more than 3 nodes at the same time as specified by the parameter parallel. This kind of node comes form my organization and cannot be changed.