AI Infrastructure Engineer
Utilidata
- Karma, Pohjois-Karjala Yhdysvallat
- Vakituinen
- Täyspäiväinen
- Lead the design and build of Utilidata's AI inference platform — establishing architecture patterns, deployment standards, and operational practices that will scale with the company
- Own end-to-end model serving infrastructure for Utilidata's AI infrastructure (on-prem and datacenter)
- Build and maintain fault-tolerant, high-performance systems for serving AI models at scale, with a focus on low latency, reliability, and cost efficiency
- Collaborate closely with algorithms engineers to integrate AI inference data and configuration with power optimization algorithms
- Optimize GPU utilization and inference performance across our hardware fleet, including NVIDIA accelerators central to Utilidata's edge AI platform
- Establish MLOps best practices including CI/CD pipelines for model deployment, monitoring, and rollback across environments
- Contribute to infrastructure roadmap decisions, including build vs. buy tradeoffs, tooling selection, and platform evolution as the team grows
- 5+ years of software engineering experience with a strong focus on AI infrastructure, backend systems, or distributed systems
- Hands-on experience with AI model serving frameworks (e.g., vLLM, SGLang, Triton, TensorRT, TorchServe, or similar)
- Understanding of container orchestration and cluster management (Kubernetes, Docker)
- Experience deploying and operating infrastructure across both datacenter and on-prem environments
- Strong knowledge of GPU workloads and the tradeoffs that come with them — you understand how inference differs from training, and why it matters
- Proficiency in Python; C++, CUDA, Go, Rust a plus
- Excellent communication skills and comfort working cross-functionally in a lean, fast-moving environment
- Willingness to travel up to 10% of time
- Dynamo experience a plus
- Experience with edge AI deployments or constrained compute environments
- Familiarity with infrastructure as code (Terraform, Helm)
- Experience with observability platforms (Datadog, Prometheus, Grafana)
- Background in energy, utilities, or industrial IoT
- Contributions to open-source ML infrastructure projects
Utilidata values the diversity of our team. We provide equal employment opportunities without regard to race, color, religion, creed, sex, gender, sexual orientation, gender identity or expression, national origin, age, physical disability, mental disability, medical condition, pregnancy or childbirth, sexual orientation, genetics, genetic information, marital status, or status as a covered veteran or any other basis protected by applicable federal, state and local laws.We are committed to:
- Creating a diverse and inclusive workplace that is welcoming, supportive, affirming and respectful
- Empowering employees to solve problems and work together to make a difference
- Providing mentorship and growth opportunities as part of a collaborative team
- A flexible work environment with flexible paid time off
- Competitive compensation and benefits, including health, dental, vision, and employer-match 401k