Top

Middle Data Engineer

NEW

Remote, hubs in Dubai, Yerevan, ...Full-timeGlobal

📊 Mid🏠 Remote

RemoteRemote work position availableActivePosted within the last 30 days

Job Description

[AI-summarized by JobStash]

You will build and maintain data pipelines and data-related services that power analytics across products. You will extend SQL-based pipelines, add new data sources to ETL processes, refactor Python scripts into modular production-quality code, configure CI for linting and tests, and update pipeline documentation. You will work closely with senior engineers and analysts, ask clarifying questions, and gradually take ownership of data platform components while learning modern data workflows.

Requirements

●Confident communication and proactive clarification seeking
●Responsibility, ownership, and proactive communication of challenges
●Comfortable with IDEs and version control systems like Git
●Basic understanding of clean code principles and software delivery workflows
●Essential Python skills including language fundamentals and data structures
●Confident with SQL basics
●Regular and thoughtful use of AI tools
●Motivation to learn and grow in data engineering
●Knowledge of data engineering fundamentals including ETL, data modeling, data quality, and storage systems
●Experience using Apache Airflow
●Familiarity with containerization tools such as Docker
●Exposure to cloud platforms such as GCP
●Basic experience with cloud platforms (GCP, AWS, or Azure) is a plus
●Understanding of orchestration tools such as Airflow is a plus
●Basic Docker usage (build, run, logs) is a plus
●Experience with BI tools (Superset, Metabase, Power BI) is a plus
●Personal data projects (ETL scripts, dashboards, analytics) are a plus

Responsibilities

●Build and maintain data pipelines and data-related services
●Contribute to shared tools and libraries
●Upgrade data platform components and services
●Communicate with analysts to understand data needs
●Extend SQL-based pipelines with new transformations
●Add new data sources to ETL processes
●Refactor Python scripts into modular code and add logging
●Configure CI for linting and tests for data repositories
●Update pipeline documentation after logic or schema changes

Benefits & Perks

●Remote work setup with access to hubs in Dubai, Yerevan, London and Belgrade
●Compensation for medical expenses
●Provision of necessary equipment
●20 working days of paid vacation annually
●11 days off per year
●14 days of paid sick leave
●Access to internal conferences
●Access to English courses
●Access to corporate events
●Regular performance reviews

Tech Stack

data pipelineloggingPythonSQLETLCIdata modelingtestingGCPGitproject:The Open Platform