Senior Software Engineer- AI Hardware
Listed on 2026-01-02
-
Engineering
Systems Engineer, Software Engineer
Senior Software Engineer – AI Hardware at Bloomberg
Location: New York, NY
Base Pay Range: $160,000 – $240,000 per year
The RoleWe are seeking an engineer to join our hardware management team that is responsible for provisioning, monitoring, and supporting thousands of servers that power the AI stack ideal candidate will have experience designing, implementing, and maintaining system software that enables communication between GPUs, CPUs, and storage in scale‑out AI and HPC systems. The role also includes overseeing ongoing monitoring, support, and maintenance of our HPC/AI clusters to ensure peak performance and reliability.
We'llTrust You To
- Design, build, and maintain highly reliable, scalable, and efficient infrastructure platforms that support our engineering teams and business needs.
- Participate in system design discussions and contribute to architectural decisions.
- Ensure code quality through standard methodologies, code reviews, and alignment to clean‑code principles.
- Produce clear documentation for a wide audience.
- Communicate effectively across diverse teams.
- Participate in scheduled on‑call rotations.
- Be a self‑starter, manage priorities, and work independently.
- Stay updated with the latest infrastructure technologies and evaluate their potential impact on existing and future solutions.
- Hold yourself to high standards.
- Exude our ambitious, collaborative, and empathetic values.
- Have a self‑starter mentality with an eagerness to solve previously unsolved problems.
- Excellent collaboration skills and openness to giving and receiving critical feedback.
- Own scalability and reliability as core principles.
- Have publicly available writing samples, blog posts, demos, or recordings of technical presentations.
- A unique opportunity to be part of a rapidly growing team in one of Bloomberg’s most exciting engineering groups.
- An inclusive and supportive culture that fosters learning and growth.
- Continuous professional development, product training, and clear career pathing.
- In‑department mentor and buddy program for networking.
- Participation in Community Guilds and company‑wide initiatives.
- 4+ years of experience in Kubernetes environments (deployments, storage, services, jobs, ingress, egress, etc.).
- BA/BS/MS/PhD in Computer Science, Electrical Engineering, or related field.
- Hands‑on management of GPU‑based systems, including kernel and driver management, and development of tooling for provisioning and maintenance.
- Design, implementation, and maintenance of system software enabling communication between GPUs, CPUs, and storage in scale‑out AI and HPC systems.
- Oversight of ongoing monitoring, support, and maintenance of HPC/AI clusters.
- Driving system upgrades and customizations, coordinating with software developers, network operations, and data center teams.
- Managing diverse computer systems and application software to meet high standards of functionality and efficiency.
- Expertise in low‑latency/high‑bandwidth, interconnected infrastructure (Infini Band, Ethernet, RDMA/RoCE, etc.).
- Monitoring and evaluating infrastructure service delivery efficiencies.
- Partnering with internal teams to develop capacity‑planning metrics and publish performance reports to senior leaders.
- Benchmarking and recommending infrastructure improvements.
- Expertise with Kubernetes design patterns (operators, Helm charts, kustomize, etc.).
- Experience with data center planning (rack elevations, cabling plans, transceivers).
- Hands‑on data center operations and management experience.
Salary Range: $160,000 – $240,000 USD per year, plus benefits and bonus. Actual compensation may vary based on location, experience, and market conditions.
Benefits include merit increases, paid holidays, paid time off, medical, dental, vision, short‑ and long‑term disability, 401(k) with match, life insurance, and wellness programs. Contingent workers, contractors, and interns are not eligible for benefits.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).