Site Reliability Engineer - Fulltime
Job in
Jersey City, Hudson County, New Jersey, 07390, USA
Listed on 2026-01-12
Listing for:
VBeyond Corporation
Full Time
position Listed on 2026-01-12
Job specializations:
-
IT/Tech
Systems Engineer, SRE/Site Reliability, Cloud Computing, Network Engineer
Job Description & How to Apply Below
Site Reliability Engineer - Fulltime Only
Direct message the job poster from VBeyond Corporation
- SRE role focused on observability, Kubernetes, and cloud infrastructure (AWS/GCP/EKS)
- Ownership of observability stack:
Prometheus, Grafana, Open Telemetry, ELK/Loki/Splunk, Jaeger, Alert manager, SLOs - Build and maintain reliable monitoring pipelines for metrics, logs, tracing, dashboards, and alerts
- Develop Terraform modules for observability infrastructure, Kubernetes components, and cluster add-ons
- Improve cluster reliability through automation, performance tuning, capacity planning, and remediation
- Implement AI-assisted diagnostics for anomaly detection, alert tuning, and noise reduction
- Collaborate with Platform Engineering on Istio/service mesh telemetry and platform health
- Lead SLO reporting, incident management, and root cause analysis
- 4–8 years of experience in SRE, infrastructure, or Kubernetes operations
- Strong expertise in observability tools, Terraform, automation (Python/Go), CI/CD, and cloud networking
Mid-Senior level
Employment typeFull-time
New York, NY $-$ 4 days ago
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×