×
Register Here to Apply for Jobs or Post Jobs. X

Data Engineer; PySpark

Job in Dubai, Dubai, UAE/Dubai
Listing for: GSS Tech Group
Full Time position
Listed on 2026-02-28
Job specializations:
  • IT/Tech
    Data Engineer, Big Data
Salary/Wage Range or Industry Benchmark: 120000 - 200000 AED Yearly AED 120000.00 200000.00 YEAR
Job Description & How to Apply Below
Position: Data Engineer (PySpark)

We are seeking a highly skilled Data Engineer with strong expertise in Py Spark and the Cloudera Data Platform (CDP). The ideal candidate will design, develop, and maintain scalable data pipelines while ensuring high data quality, performance, and availability across the organisation.

This role requires hands-on experience in big data ecosystems, cloud-native technologies, and advanced data processing frameworks. You will collaborate with cross-functional teams to build reliable and high-performance data solutions that drive business insights.

Key Responsibilities 1. Data Pipeline Development
  • Design, develop, and maintain scalable ETL/ELT pipelines using PySpark on CDP
  • Ensure data integrity, reliability, and performance optimisation
2. Data Ingestion
  • Develop ingestion frameworks to collect data from relational databases, APIs, streaming sources, and file systems
  • Load structured and unstructured data into Data Lake/Data Warehouse environments
3. Data Transformation & Processing
  • Process, cleanse, and transform large-scale datasets using Py Spark
  • Build reusable data processing components
4. Performance Optimisation
  • Tune Spark jobs and Cloudera components for optimal performance
  • Optimise memory, partitioning, and execution plans
  • Reduce ETL runtime and improve cluster efficiency
5. Data Quality & Validation
  • Implement data validation checks and monitoring mechanisms
  • Ensure end-to-end data quality and governance standards
6. Automation & Orchestration
  • Automate workflows using tools such as Apache Oozie, Apache Airflow, or similar orchestration frameworks
  • Maintain CI/CD integration for data pipelines
7. Monitoring & Support
  • Monitor pipeline health and troubleshoot failures
  • Provide production support and continuous improvements
Required Skills & Qualifications
  • 5+ years of experience in Data Engineering
  • Strong hands‑on experience in Py Spark
  • Experience working on Cloudera Data Platform (CDP)
  • Strong knowledge of Hadoop ecosystem (HDFS, Hive, Impala, YARN)
  • Proficiency in SQL and data modelling concepts
  • Experience with workflow orchestration tools (Airflow, Oozie, etc.)
  • Good understanding of data warehousing concepts
  • Experience with performance tuning and optimisation
Good to Have
  • Experience with cloud platforms (AWS, Azure, GCP)
  • Knowledge of streaming tools (Kafka, Spark Streaming)
  • Exposure to Dev Ops practices and CI/CD pipelines
  • Banking/Financial Services domain experience
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary