Job Description & How to Apply Below
The Role
Grade Level (for internal use):
11
Job Title:
Senior Site Reliability Engineer
Role Overview
As a Site Reliability Engineer at Chart
IQ, you'll play a critical role not only in building, maintaining, and scaling the infrastructure that supports our Development our Development and QA needs, but also in driving new, exciting cloud-based solutions that will add to our offerings.
Your work will ensure that the platforms used by our team remain available, responsive, and high-performing. In addition to maintaining the current infrastructure, you will also contribute to the development of new cloud-based solutions, helping us expand and enhance our platform's capabilities to meet the growing needs of our financial services customers.
You will also contribute to light JavaScript programming, assist with QA testing, and troubleshoot production issues. Working in a fast-paced, collaborative environment, you'll wear multiple hats and support the infrastructure for a wide range of development teams.
This position is in India, and will require working overlapping hours with teams in the US. The preferred working hours will be until 12 noon EST to ensure effective collaboration across time zones.
Key Responsibilities
Design, implement, and manage infrastructure using Terraform or other Infrastructure-as-Code (IaC) tools.
Leverage AWS or equivalent cloud platforms to build and maintain scalable, high-performance infrastructure that supports data-heavy applications and JavaScript-based visualizations.
Understand component-based architecture and cloud-native applications.
Implement and maintain site reliability practices, including monitoring and alerting using tools like Data Dog, ensuring the platform's availability and responsiveness across all environments.
Design and deploy high-availability architecture to support continuous access to alerting engines.
Support and maintain Configuration Management systems like Service Now CMDB.
Manage and optimize CI/CD workflows using Git Hub Actions or similar automation tools.
Work with OIDC (OpenID Connect) integrations across Microsoft, AWS, Git Hub, and Okta to ensure secure access and authentication.
Contribute to QA testing (both manual and automated) to ensure high-quality releases and stable operation of our data visualization tools and alerting systems.
Participate in light JavaScript programming tasks, including HTML and CSS fixes for our charting library.
Assist with deploying and maintaining mobile applications on the Apple App Store and Google Play Store.
Troubleshoot and manage network issues, ensuring smooth data flow and secure access to all necessary environments.
Collaborate with developers and other engineers to troubleshoot and optimize production issues.
Help with the deployment pipeline, working with various teams to ensure smooth software releases and updates for our library and related services.
Required Qualifications
Proficiency with Terraform or other Infrastructure-as-Code tools.
Experience with AWS or other cloud services (Azure, Google Cloud, etc.).
Solid understanding of component-based architecture and cloud-native applications.
10 to 20 years'
Experience with site reliability tools like Data Dog for monitoring and alerting.
Experience designing and deploying high-availability architecture for web based applications.
Familiarity with Service Now CMDB and other configuration management tools.
Experience with Git Hub Actions or other CI/CD platforms to manage automation pipelines.
Strong understanding and practical experience with OIDC integrations across platforms like Microsoft, AWS, Git Hub, and Okta.
Solid QA testing experience, including manual and automated testing techniques (Beginner/Intermediate).
JavaScript, HTML, and CSS skills to assist with troubleshooting and web app development.
Experience with deploying and maintaining mobile apps on the Apple App Store and Google Play Store that utilize web-based charting libraries.
Basic network management skills, including troubleshooting and ensuring smooth network operations for data-heavy applications.
Knowledge of package publishing tools such as Maven, Node, and Cocoa Pods to ensure seamless dependency management and distribution across platforms.
Additional Skills and Traits for Success in a Startup-Like Environment:
Ability to wear multiple hats:
Adapt to the ever-changing needs of a startup environment within a global organization.
Self-starter with a proactive attitude, able to work independently and manage your time effectively.
Strong communication skills to work with cross-functional teams, including engineering, QA, and product teams.
Ability to work in a fast-paced, high-energy environment.
Familiarity with agile methodologies and working in small teams with a flexible approach to meeting deadlines.
Basic troubleshooting skills to resolve infrastructure or code-related issues quickly.
Knowledge of containerization tools such as Container Platforms and Amazon ECS is a plus.
Understanding of Dev Sec Ops and basic…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×