Senior Data Engineer Databricks
Costa RicaFull-TimeSeniorSoftware Engineering
Important Information
- Location: Costa Rica
- Work Mode: Hybrid
Responsibilities and Duties
- Design, build, and optimize scalable data pipelines and lakehouse architectures in Databricks using the medallion model (bronze, silver, gold)
- Develop and maintain ETL/ELT processes in Python and PySpark, ensuring high data quality, performance, and reliability
- Implement and enforce data governance, security, encryption, and PII protection standards across the platform
- Collaborate with engineering and business teams to translate requirements into dimensional models and support migration from legacy systems to AWS-based Databricks environments
Qualifications and Skills
- Bachelor’s degree in Computer Science, Engineering, or a related technical field; advanced certifications in Data Engineering or Cloud are highly valued
- 5+ years of progressive experience in Data Engineering building scalable, production-grade data platforms
- 3+ years of deep hands-on experience with Databricks and modern Lakehouse architecture in enterprise environments
- Expert-level proficiency in Python and PySpark, designing high-performance distributed data processing pipelines
- Strong expertise designing and implementing dimensional data models (Star Schema, SCDs, fact/dimension load strategies) for analytical workloads
- Proven experience implementing and optimizing medallion architecture (bronze, silver, gold) with strong data quality validation frameworks
- Advanced knowledge of data governance, encryption standards, data lineage, anonymization, and secure handling of highly sensitive PII data (especially in regulated environments)
- Strong experience operating in AWS cloud environments, including performance tuning, cost optimization, and cloud-native data services
- Databricks certification strongly preferred and considered close to mandatory for senior-level candidacy
- Experience leveraging AI-assisted development tools or agentic coding platforms to enhance productivity and data engineering workflows is a strong plus
