Senior Observability Platform Engineer
Listed on 2026-01-12
-
IT/Tech
Systems Engineer, IT Support
Job Title
Senior Observability Platform Engineer at SS&C Technologies
Company OverviewSS&C Technologies is a leading financial services and healthcare technology company headquartered in Windsor, Connecticut. With over 27,000 employees worldwide, we serve more than 20,000 organizations, providing expertise, scale, and technology across the industry.
Job DescriptionThe Senior Observability Platform Engineer will design, develop, and maintain our comprehensive observability stack. This role focuses on open source observability tooling, Linux, Kubernetes, and cloud-native environments to ensure the reliability, performance, and operational efficiency of our services.
Responsibilities- Design, develop, implement, and maintain the observability stack, including tracing, telemetry, logging, health monitoring, visualization, and dashboards.
- Build a robust observability framework using composable open source solutions such as Prometheus, Alert manager, Open Telemetry, Grafana, Alloy, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar.
- Develop and maintain health monitoring and alerting systems for compute platforms, databases, network infrastructure, and Kubernetes-based platforms (including GPU-supported environments).
- Create and manage visualization dashboards to monitor system performance, resource utilization, and operational health.
- Implement scalable, distributed logging and tracing solutions to diagnose, troubleshoot, and resolve system issues effectively.
- Collaborate with development and operations teams to integrate observability practices into the lifecycle.
- Conduct performance analysis and optimization to ensure system reliability and efficiency.
- Stay updated with the latest trends and technologies in observability and performance monitoring.
- Collaborate with cross‑functional teams (Cloud Engineering, Network, Dev Ops/Solutions Engineering) to troubleshoot and resolve infrastructure issues.
- Proven experience in observability, system and network monitoring, and system performance analysis in cloud or data center environments.
- Expertise in implementing and managing observability tools such as Prometheus, Alert manager, Open Telemetry, Grafana, Alloy, Loki, Promtail, Tempo, Thanos, ELK stack, Zabbix, and similar commercial solutions.
- Hands‑on experience with Kubernetes.
- Experience with infrastructure‑as‑code and configuration management tools such as Consul, Git Hub, Salt Stack, Terraform, etc.
- Proficiency in scripting and automation using Go, Python, Shell.
- Excellent problem‑solving skills and ability to work independently or as part of a team.
- Strong communication skills and ability to work in a fast‑paced, dynamic environment.
- Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.
SS&C offers excellent benefits including health, dental, 401k plan, tuition and professional development reimbursement plan.
Equal Employment OpportunitySS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).