Senior/Principal Infrastructure Engineer, Data Centers
Ann Arbor, Washtenaw County, Michigan, 48113, USA
Listed on 2026-03-04
-
IT/Tech
Systems Engineer, Cloud Computing
Utilidata is a fast-growing NVIDIA-backed edge AI company enabling greater visibility and control of power utilization in energy-intensive infrastructure, like the electric grid and data centers. Karman, the company’s distributed AI platform powered by a custom NVIDIA module, is transforming the way utility companies operate the grid edge and will enable data centers to unlock more compute for the same provisioned power.
Responsibilities- Deploy and configure Karman systems in high‑density data center environments, ensuring adherence to best practices and organizational standards
- Monitor, troubleshoot, and resolve technical issues related to Karman applications, networking, and infrastructure components
- Manage and maintain B300 or equivalent rack systems, including PDU (Power Distribution Units) and PSU (Power Supply Units) configuration and optimization
- Perform Linux system administration tasks including installation, configuration, patch management, and performance tuning. Design and implement network configurations to support Karman deployments, including routing, switching, and connectivity optimization
- Develop a deep understanding of how compute, power, and networking resources are being consumed by internal teams, and proactively identify constraints, risks, and scaling bottlenecks
- Collaborate with cross‑functional teams to plan capacity requirements and scale infrastructure to meet growing demands
- Document deployment procedures, configuration standards, and troubleshooting guides for knowledge sharing and operational continuity
- Provide technical support and training to internal teams on Karman system operations and best practices
- Conduct regular system health checks, performance monitoring, and proactive maintenance to prevent downtime
- Participate in on‑call rotation to ensure 24/7 system availability and rapid incident response
- 8+ years of experience with Linux system administration (RHEL, Ubuntu, CentOS, or similar distributions)
- Proven experience deploying and managing applications in high‑density data center environments
- Strong understanding of data center infrastructure including rack systems (B300 or equivalent), PDUs, PSUs, cooling systems, and power management
- Hands‑on experience with enterprise networking concepts including TCP/IP, DNS, DHCP, VLANs, firewall concepts, routing protocols, switching, and network troubleshooting
- Demonstrated ability to operate effectively in environments with evolving processes and incomplete tooling, using first‑principles debugging and cross‑domain reasoning.
- Strong ownership mindset with a bias toward proactive improvement rather than reactive ticket resolution.
- Willingness to travel up to 20% of the time, including international travel
- Proficiency with configuration management and automation tools (Ansible, Puppet, Chef, or similar)
- Familiarity with tail‑scale/wireguard experience
- Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack, or similar)
- Knowledge of containerization and orchestration technologies (Docker, Kubernetes)
$160,000 to $195,000 base compensation depending on experience and stock options. Salary will be commensurate with an individual's skills, training, years of experience, and in line with internal compensation bands.
LocationThis position is based onsite at our company headquarters in Ann Arbor, Michigan, with flexibility for occasional remote work.
Our CommitmentsUtilidata values the diversity of our team. We provide equal employment opportunities without regard to race, color, religion, creed, sex, gender, sexual orientation, gender identity or expression, national origin, age, physical disability, mental disability, medical condition, pregnancy or childbirth, sexual orientation, genetics, genetic information, marital status, or status as a covered veteran or any other basis protected by applicable federal, state and local laws.
- Creating a diverse and inclusive workplace that is welcoming, supportive, affirming and respectful
- Empowering employees to solve problems and work together to make a difference
- Providing mentorship and growth opportunities as part of a collaborative team
- A flexible work environment with flexible paid time off
- Competitive compensation and benefits, including health, dental, vision, and employer‑match 401k
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).