fosstodon.org is one of the many independent Mastodon servers you can use to participate in the fediverse.
Fosstodon is an invite only Mastodon instance that is open to those who are interested in technology; particularly free & open source software. If you wish to join, contact us for an invite.

Administered by:

Server stats:

9.8K
active users

#apacheairflow

1 post1 participant0 posts today

Big data can be difficult data. 🤖

Your pipeline takes several hours to process terabytes of data... and then something goes wrong. You have to start over. 😩

Incremental loading can help!

Conquer your big data by breaking it down in your Airflow DAGs. You'll reclaim your time, lower your cloud bill, and maybe even lower your cholesterol:

📽️ Watch the video: youtube.com/watch?v=g2dEcRILAT
📖 Read the blog: kpdata.dev/blog/airflow-increm

🍹 C'est l'heure du cocktail de bienvenue proposé par Satya !

- une dose de #ModernDataStack,
- un trait de géo,
- un zeste d'Open Source,
- et beaucoup d'amour 💗.

Cette recette vous est servie dans cet article qui détaille comment le Gard valorise ses géo-données.

:geotribu: geotribu.fr/articles/2025/2025

Relecture 🧐 : @geojulien & Michaël Galien

#PostgreSQL #PostGIS #GDAL #OGR #DBT #Metabase #ApacheAirflow

geotribu.frGeotribu - L'enjeu de la data au département du GardComment le département du Gard valorise son patrimoine de données classiques et de géo-données au travers de différents outils numériques.

Stuck on when to run that pipeline again? I've been there too many times! 🤯

Scheduling data pipelines can be a complex puzzle--time-based, frequency-based, event-driven... there are so many options. Let's unravel the mystery together! 🔍

Discover the methods for scheduling Airflow DAGs and make your data engineering life simpler.

📽️ Watch the video: youtu.be/NZOJZukiX6Y
📖 Read the blog: kpdata.dev/blog/airflow-schedu

Just caught up with the recent Delta Lake webinar,

> Revolutionizing Delta Lake workflows on AWS Lambda with Polars, DuckDB, Daft & Rust

Some interesting hints there regarding lightweight processing of big-ish data. Easy to relate to any other framework instead of Lambda, e.g. #ApacheAirflow tasks

youtu.be/BR9oFD0QMAs

🚀 Apache Airflow: Orchestrierung komplexer Workflows leicht gemacht 🚀

Wenn du eine Lösung suchst, um komplexe Datenpipelines zu verwalten, ist Apache Airflow eine starke Wahl! In unserem neuen Artikel zeigt unser Entwickler, wie Airflow funktioniert, welche Vorteile es bietet und wie es in Bereichen wie maschinellem Lernen eingesetzt wird. 💻✨ Mit Codebeispielen zeigt er, wie du eigene Workflows effizient aufsetzen kannst.

Hast du schon mit Apache Airflow gearbeitet? Welche Erfahrungen hast du gemacht? Lass uns darüber sprechen! 🛠️👇

elinext.de/blog/einsatz-von-ap

Still, #dagster has less dependencies, and after some battling and downgrading version, I managed to start the dev environment...

Building a #Apacheairflow container, is pure chaos. Installing via pip, is another kind of hell, with broken builds all over the place (google-re2)... A tool that has 9 years in the market, being so overwhelming its installation process

At least I can make #dagster run.

I guess we are all ill served with workflow orchestration tools anyway, on the open source world

FlowFixation: AWS Apache Airflow Service Takeover Vulnerability

Date: March 21, 2024
CVE: Not specified
Sources: Tenable Blog

Issue Summary

Tenable Research discovered a vulnerability, named FlowFixation, in AWS Managed Workflows for Apache Airflow (MWAA) that could allow session hijacking leading to a full takeover of the victim's web management panel.

Technical Key findings

FlowFixation combines session fixation and XSS via Amazon AWS domain misconfiguration, enabling attackers to authenticate known sessions and gain control over victim's Apache Airflow management panels.

Vulnerable products

  • AWS Managed Workflows for Apache Airflow (MWAA)

Impact assessment

Potential for remote code execution on underlying instances and lateral movement to other services.

Patches or workaround

AWS has addressed the vulnerability. Users should ensure they are using updated services.

Tags

Tenable® · FlowFixation: AWS Apache Airflow Service Takeover Vulnerability and Why Neglecting Guardrails Puts Major CSPs at RiskTenable Research discovered a one-click account takeover vulnerability in the AWS Managed Workflows Apache Airflow service that could have allowed full takeover of a victim’s web management panel of the Airflow instance. The discovery of this now-resolved vulnerability reveals a broader problem of misconfigured shared-parent domains that puts customers of major CSPs at risk.