×
Register Here to Apply for Jobs or Post Jobs. X

Principal Site Reliability Engineer

Job in Atlanta, Fulton County, Georgia, 30383, USA
Listing for: QGenda
Full Time position
Listed on 2026-01-12
Job specializations:
  • IT/Tech
    Cloud Computing, SRE/Site Reliability, Systems Engineer, IT Support
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below

QGenda is redefining healthcare workforce management everywhere care is delivered. Were on a mission to empower the healthcare industry to better onboard, deploy, and manage their workforce. With more than 4,500 healthcare organizations trusting our unified software platform and over 700 employees across the US, we are united in our vision and culture to make a difference for our customers.

At QGenda, we value our employees and their contributions toward the success of the business. We strive to create a dynamic work environment that fosters growth, innovation, and collaboration, where employees can be proud of the work they do and the impact it has on the healthcare industry.

How Youll Make an Impact

System Reliability and Performance:

  • Design, implement, and manage scalable systems that ensure high availability, fault tolerance, and optimal performance.
  • Continuously monitor and enhance system health and performance through data analysis and metrics.
  • Embed observability (metrics, logs, traces, alerts) with actionable thresholds and up-to-date runbooks.

Automation and Tooling:

  • Eliminate toil by building automation and self-service tools for common operational workflows.
  • Own CI/CD pipelines (build, test, security scans) and enable progressive delivery (blue/green, canary).
  • Manage infrastructure as code via Terraform and configuration management with Git-backed workflows.

Incident Management and Troubleshooting:

  • Participate in on-call; triage, mitigate, and resolve incidents within defined SLAs.
  • Lead incident response and blameless post-incident reviews; document RCAs and drive corrective actions to closure.
  • Maintain runbooks/playbooks and regularly perform disaster recovery scenarios.

Infrastructure Management:

  • Operate and secure AWS environments (IAM, VPC, EC2/ECS, RDS, S3, Lambda, etc.) with a focus on resilience and compliance.
  • Optimize cost, performance, and reliability (rightsizing, autoscaling, reservations/savings plans, tagging, spend monitoring, etc.).
  • Serve as a technical advisor to engineering teams on infrastructure and operations best practices.
  • Mentor peers on SRE practices; promote observability, continuous improvement, and a blameless culture.
  • Contribute to roadmaps and capacity planning to align reliability goals with product objectives.
Who You Are
  • Availability for off-hours deployment and upgrades of production systems during release and maintenance windows. This is a rotational setup where you would be on two weeks at a time.
  • Strong problem-solving skills and ability to work effectively under pressure.
  • Excellent communication skills for cross-functional collaboration as well as documentation creation.
Experience You Bring
  • B.S. in Computer Science, Computer Information Systems, or Computer Engineering from a major U.S. university or equivalent industry experience
  • 8+ years of experience as a Dev Ops, SRE or Systems Engineer
  • Advanced proficiency with at least one scripting or programming language
  • Experience with Docker and container orchestration tools such as AWS ECS
  • Hands-on experience building infrastructure and supporting applications in AWS using services such as Lambda, EC2, ECS, S3, SNS, SQS, RDS, Redshift, and Elasticache
  • Experience with logging, creating dashboards, and alerts using observability tools such as Datadog and Amazon Cloud Watch
  • Strong understanding of networking and DNS
  • Familiarity with configuration management and infrastructure as code (IaC) tools such as Terraform
  • Firm understanding and experience with Agile and Scrum SDLC processes
  • Using distributed version control system experience (Git preferred) to check-in code, branching, merging, pull request, code review, etc
  • Knowledge of CI/CD best practices and tools such as AWS Code Build, Jenkins and/or Team City
  • Experience designing and delivering secure, high performance and highly available cloud services
Not Required, But Nice to Have
  • Experience with automation tools related to MLOps or AIOps such as AWS Bedrock and/or Sage Maker.
Whats In It For You

We offer a comprehensive total rewards package to support our full-time employees and their familys day-to-day needs, well-being and major life events, which includes:

  • Fully company-paid options for medical (both in-person and virtual), dental and vision insurance
  • Generous paid time off (PTO) policy to enjoy periods of uninterrupted rest and relaxation for a healthy work/life balance
  • Paid parental leave for birth, adoption or permanent placement
  • 401(k) with company match
  • Options to work in a hybrid-working model or remotely from home, depending on the position
  • Annual Costco membership, cell phone stipend, commuter benefits, in-office perks and more

QGenda delivers technology solutions to improve how healthcare is delivered and increase access. We are committed to creating a culture of embracing diversity, inclusion and equity for all. We are an Equal Employment Opportunity employer and make all employment decisions without regard to race, color, religion, creed, gender, sex, national origin, age, disability or any…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary