More jobs:
Lead Reliability Engineer
Job in
Irving, Dallas County, Texas, 75084, USA
Listed on 2026-01-12
Listing for:
Citi
Full Time
position Listed on 2026-01-12
Job specializations:
-
IT/Tech
Cybersecurity, IT Support
Job Description & How to Apply Below
1 day ago Be among the first 25 applicants
Overview of the CompanyCiti, the leading global bank, has approximately 200 million customer accounts and operates in more than 160 countries. Citi provides consumers, corporations, governments, and institutions with a broad range of financial products and services, including consumer banking and credit, corporate and investment banking, securities brokerage, transaction services, and wealth management.
Overview of the RoleThe selected candidate will become the key engineer supporting and advancing the platform used for threat‑modeling. Responsibilities include maintaining and supporting the threat‑modeling application, developing relevant tools, and working within a regulated and change‑controlled environment.
Responsibilities- Ensure high availability and optimal performance of the threat‑modeling application through proactive monitoring, incident management, and efficient troubleshooting.
- Perform routine and emergency application and infrastructure maintenance, including patching, upgrades, and configuration management, adhering strictly to change control procedures.
- Conduct root cause analysis for production incidents and implement preventative measures to minimize future occurrences.
- Develop and maintain automation scripts and tools (e.g., using Python, Bash) to streamline operational tasks, improve monitoring, and facilitate efficient deployments.
- Proactively identify, recommend, and implement enhancements to existing application maintenance practices, operational workflows, and system reliability.
- Serve as a technology subject matter expert for internal and external stakeholders, contributing to technology domain roadmaps and firm‑mandated controls and compliance initiatives.
- Appropriately assess and mitigate risk in all technical decisions, ensuring compliance with applicable laws, rules, regulations, and internal policies, while escalating and reporting control issues with transparency.
- Present technical work to senior stakeholders, the team, and other technical teams.
- Mentor and train junior team members, fostering a culture of knowledge sharing and continuous improvement.
- 6+ years of relevant experience in an engineering role, preferably in financial services or a large, complex, and/or global environment.
- Experience managing and troubleshooting Linux operating systems (e.g., Red Hat Enterprise Linux, CentOS, Ubuntu), including system administration tasks such as user management, service restarts, and file system checks – Must Have.
- Proficiency in scripting for automation (e.g., Bash, Python) and configuration management tools (e.g., Ansible, Puppet, Chef) – Must Have.
- Experience with container orchestration using Helm and Kubernetes on platforms like AWS EKS, GCP GKE, or Open Shift – Must Have.
- Working knowledge of relational databases (e.g., Postgre
SQL), including basic querying – Must Have. - Proven track record of maintaining applications and their technology stacks compliant with security and configuration requirements, successfully passing internal and external security audits by demonstrating secure configuration of applications and infrastructure – Must Have.
- Demonstrated adherence to strict change control procedures, executing all changes through a formalized change management process with proper documentation and approvals – Must Have.
- Experience with ticketing systems (e.g., Jira, Service Now) – Must Have.
- Working understanding of middleware components (e.g., Nginx, Tomcat or equivalents).
- Familiarity with development concepts (e.g., Git, CI/CD, pipelines, SDLC).
- Strong communication skills, both written and verbal, for technical and non‑technical audiences.
- Demonstrated analytical and diagnostic skills, with an ability to identify process improvements and best practices.
- Ability to work independently, manage multiple tasks, take ownership of initiatives, and operate effectively in a matrixed environment under pressure and tight deadlines.
- Kubernetes and Cloud Native Associate (KCNA), Certified Kubernetes Application Developer (CKAD), Certified Kubernetes Administrator (CKA), Kubernetes and Cloud…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×