More jobs:
Sr. System Development Engineer, HWEng Acceletor Systems
Job in
Cupertino, Santa Clara County, California, 95014, USA
Listed on 2026-01-13
Listing for:
Amazon Web Services (AWS)
Full Time
position Listed on 2026-01-13
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing, Hardware Engineer
Job Description & How to Apply Below
Sr. System Development Engineer, HWEng Acceletor Systems
Amazon Web Services (AWS) seeks a Senior System Development Engineer to lead accelerator server and rack system development, solving complex architectural problems while debugging hardware, software, and network issues across NPI and fleet operations. You will build automation to manage and improve large-scale infrastructure, focusing on operational excellence, reliability, and efficiency at the intersection of hardware, software, networking, and cloud services.
Keyjob responsibilities
- Lead technical solutions for complex accelerator server and rack system architectural challenges
- Own end‑to‑end system reliability, proactively identifying and resolving deficiencies before customer impact
- Write code and implement solutions to address system‑level issues at large scale
- Decompose complex server system problems (testability, reliability, diagnostics) into deliverable tasks and features
- Lead cross‑functional delivery of system improvements through direct execution and team coordination
- Apply expertise across hardware, software, system design, x86 architecture, processes, and operations
- Drive system scalability and performance optimization for accelerator workloads
- Collaborate with hardware and software teams to ensure robust system integration
- Develop and implement diagnostic tools and monitoring solutions for production systems
- Debug complex system failures in time‑sensitive settings
- 3+ years of programming experience with a modern language such as C++, C#, Java, Python, Go, Power Shell, Ruby
- 4+ years of professional software development experience
- 2+ years of designing or architecting production systems
- Experience leading design, build, and deployment of complex and high‑performance software solutions
- Strong analytical skills, attention to detail, and effective communication
- 4+ years of experience in site reliability engineering, systems engineering, or Dev Ops
- Experience leading and influencing teams as a mentor or tech lead
- Knowledge of operating systems, hardware, storage, networking, security, database administration, and cloud infrastructure
- Deep understanding of x86 architecture, server hardware, and system‑level debugging
- Experience with accelerator technologies (GPUs, AI/ML hardware) and/or high‑performance computing systems
- 6+ years of development experience in Python, Java, Bash, or Perl
- Knowledge of engineering practices and patterns across the software/hardware/network development life cycle
- Experience building complex software or computing infrastructure successfully delivered to customers
Amazon is an equal‑opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Job : A3152752
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×