Site Reliability Engineer
Job in
500016, Prakāshamnagar, Telangana, India
Listed on 2026-02-04
Listing for:
Confidential
Full Time
position Listed on 2026-02-04
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability, IT Support
Job Description & How to Apply Below
Job Description
Tri Net is a leading provider of comprehensive human resources solutions for small to midsize businesses (SMBs). We enhance business productivity by enabling our clients to outsource their HR function to one strategic partner and allowing them to focus on operating and growing their core businesses. Our full-service HR solutions include features such as payroll processing, human capital consulting, employment law compliance and employee benefits, including health insurance, retirement plans and workers' compensation insurance.
Tri Net has a nationwide presence and an experienced executive team. Our stock is publicly traded on the NYSE under the ticker symbol TNET. If you're passionate about innovation and making an impact on the large SMB market, come join us as we power our clients' business success with extraordinary HR.
Don't meet every single requirement Studies have shown that many potential applicants discourage themselves from applying to jobs unless they meet every single requirement. Tri Net always strives to hire the most qualified candidate for a particular role, ensuring we deliver outstanding results for our small and medium-size customers. So if you're excited about this role but your past experience doesn't align perfectly with every single qualification in the job description, nobody's perfect – and we encourage you to apply.
You may just be the right candidate for this or other roles.
Job Summary
We are seeking a skilled and motivated Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a crucial role in ensuring the reliability, availability, and performance of our systems and applications. Leveraging your technical expertise and knowledge of SRE practices, you will collaborate with cross-functional teams, drive automation initiatives, and implement best practices to enhance system resilience.
If you are a dedicated and detail-oriented SRE professional with a passion for maintaining highly reliable systems, we encourage you to apply for this position.
Essential Duties/Responsibilites
System Monitoring and Incident Response:
Monitor system health, proactively detect issues, and respond to incidents in a timely manner. Participate in incident response activities, including triage, troubleshooting, and resolution, ensuring minimal disruption to services.
Automation and Tooling:
Develop and maintain automation scripts, tools, and utilities to streamline operational tasks, reduce manual effort, and improve system efficiency. Leverage scripting languages and configuration management tools to automate routine tasks.
Performance Optimization:
Identify performance bottlenecks, analyze system metrics, and optimize system performance. Collaborate with Development and Operations teams to implement performance tuning measures and ensure optimal resource utilization.
Infrastructure and Configuration Management:
Manage infrastructure resources, including cloud platforms, servers, and network devices. Implement and maintain configuration management practices to ensure consistency and reliability across environments.
Capacity Planning:
Conduct capacity planning exercises to forecast resource requirements and support scalability. Analyze usage patterns, monitor system performance, and recommend infrastructure adjustments to meet demand.
Incident Analysis and Post-Mortems:
Perform root cause analysis for incidents and contribute to post-incident reviews. Identify areas for improvement, implement preventive measures, and update documentation and runbooks accordingly.
System Documentation:
Contribute to the development and maintenance of system documentation, runbooks, and standard operating procedures (SOPs). Ensure documentation is accurate, up-to-date, and accessible to the team.
Collaboration and Communication:
Collaborate effectively with cross-functional teams, including Development, Operations, and Support, to address system issues, implement changes, and improve system reliability. Communicate updates, findings, and recommendations to stakeholders in a clear and concise manner.
Continuous Improvement:
Identify opportunities for automation, process…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×