More jobs:
Hadoop Admin - SRE
Job in
500016, Prakāshamnagar, Telangana, India
Listed on 2026-02-03
Listing for:
Confidential
Full Time
position Listed on 2026-02-03
Job specializations:
-
IT/Tech
Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below
Key Responsibilities:
Administer, monitor, and support the Hadoop ecosystem including HDFS, YARN, Hive, HBase, Spark, Kafka , and related big data services.
Manage and maintain Spark workloads for both batch and streaming applications, ensuring performance and resource efficiency.
Deploy, manage, and troubleshoot Hadoop/Spark clusters on Kubernetes platforms.
Ensure site reliability by implementing proactive monitoring, alerting, incident response, and root cause analysis (SRE best practices).
Automate infrastructure provisioning, configuration, and application deployments using Infrastructure as Code (IaC) and scripting.
Monitor system performance and plan capacity to optimize cluster resource utilization.
Manage AWS infrastructure components including EC2, S3, EKS (Kubernetes), IAM, VPC, and other cloud services.
Implement security best practices such as Kerberos , access controls, encryption, and service authentication.
Perform cluster upgrades, patching, backups, and disaster recovery planning and execution.
Collaborate with data engineering, Dev Ops, and application teams for seamless integration and operations.
Provide L2/L3 production support and participate in on‑call rotations for critical incident handling.
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×