✅ Collaborate on training, reporting & SLA improvements to keep teams sharp and systems resilient
This isn’t a “keep the lights on” role - you’ll be the forensic engineer behind solving failures, preventing repeat incidents, and engineering long-term reliability.
✅ Challenge OEMs and vendors to raise standards and push long-term fixes
✅ Shape maintenance standards & asset strategies for complex systems (power & cooling)
✅ Partner with Operations, Engineering & Construction to close gaps and improve design handoffs
✅ Lead deep-dive root cause analyses and turn failures into system-wide improvements
We’re searching for an Infrastructure Reliability Engineer to own performance, resiliency, and risk reduction across mission-critical infrastructure.
⚡ Loves uncovering root causes (not quick fixes)
✅ Drive uptime & reliability with failure mode mitigation strategies