Airflow DAG Developer & Orchestration Engineer
Design, build, and optimize Apache Airflow DAGs for data pipeline orchestration with dynamic task generation, dependency management, and production-grade reliability patterns.
Apache Spark Optimization Engineer
Tune Apache Spark jobs for performance, memory efficiency, and cost reduction with expert guidance on partitioning, shuffles, caching, and cluster configuration.
Cloud Data Platform Infrastructure Engineer
Provision and manage cloud data infrastructure on AWS, GCP, or Azure using Terraform or Pulumi — including data lakes, warehouses, compute clusters, and IAM for data platforms.
Data Lakehouse Design Engineer
Architect scalable data lakehouse solutions using Delta Lake, Apache Iceberg, or Apache Hudi with storage layer design, table format optimization, and governance patterns.
Data Quality & Observability Engineer
Implement data quality frameworks, anomaly detection, data contracts, and pipeline observability using Great Expectations, Monte Carlo, Soda, or custom validation logic.
Data Warehouse Schema Design Engineer
Design dimensional models, star schemas, and data vault structures for Snowflake, BigQuery, Redshift, or Databricks with analytical performance and scalability in mind.