Lead DevOps Engineer

Trimble Inc.

The Role

Overview

Design, implement, and maintain cloud infrastructure and CI/CD pipelines for AI platform.

Key Responsibilities

  • monitoring
  • iac
  • containerization
  • cloud infra
  • ci/cd
  • architecture

Tasks

-Share knowledge, document designs and processes effectively. May mentor junior engineers on DevOps practices. -Implement and improve monitoring, logging, and alerting systems to ensure high availability and performance. -Take ownership of the technical implementation for specific infrastructure components or pipelines. -Automate infrastructure provisioning and configuration using Infrastructure-as-Code tools (e.g., Terraform). -Write high-quality, maintainable, and well-tested code for infrastructure automation and services. -Troubleshoot and debug complex issues in production environments, spanning infrastructure, networking, and service layers. -Manage containerization and orchestration using technologies like Docker and Kubernetes. -Design, implement, and maintain scalable and reliable cloud infrastructure on platforms like AWS, GCP, or Azure. -Collaborate with software engineers and AI researchers to define infrastructure requirements and translate them into effective technical solutions. -Participate in on-call rotation to address production issues and ensure system stability. -Develop and manage CI/CD pipelines to automate the build, test, and deployment of our services. -Contribute significantly to technical design and architecture discussions, considering reliability, scalability, security, and cost-effectiveness. -Contribute to improving the scalability, performance, security, and cost-effectiveness of the platform. -Evaluate and prototype new technologies and frameworks relevant to MLOps, platform infrastructure, and DevOps practices. -Work closely with teammates and cross-functional partners (Product, Research, other Engineering teams) to ensure seamless delivery and operation of our services.

Requirements

  • bachelor's
  • python
  • docker
  • kubernetes
  • terraform
  • ci/cd

What You Bring

-Bachelor's degree (or equivalent practical experience) in Computer Science, Engineering, or a related field. -Strong problem-solving skills and attention to detail. Good communication and collaboration skills. -Experience with security best practices for cloud infrastructure and applications. -5+ years (Senior) of professional DevOps or SRE (Site Reliability Engineer) experience. -Interest in or practical experience with AI/ML concepts and MLOps practices. -Hands-on experience with agentic AI concepts/frameworks (e.g., LangChain, LlamaIndex), vector databases (e.g., Pinecone, Weaviate), or RAG techniques. -Experience with CI/CD tools (e.g., GitLab CI/CD, Jenkins, GitHub Actions). -Hands-on experience with containerization (Docker) and orchestration (Kubernetes). -Experience with platform or infrastructure-as-a-service components. -Experience with software development best practices (testing frameworks, code reviews). -Extensive experience building, deploying, and operating services in a cloud environment (AWS, GCP, or Azure). -Strong proficiency in one or more scripting or programming languages (e.g., Python, Go, Bash). -Solid understanding of Infrastructure-as-Code (IaC) principles and experience with tools like Terraform. Solid understanding of computer science fundamentals (data structures, algorithms, operating systems, networking). -Experience with building and maintaining developer tools or internal platforms. -Experience with monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, DataDog, NewRelic).

The Company

About Trimble Inc.

-Offers integrated solutions across construction, agriculture, and transportation. -Cutting-edge technology streamlines workflows and improves efficiency. -Provides both hardware and software solutions, focusing on automation, geospatial data, and real-time analytics. -Notable projects include smart city infrastructure, autonomous vehicles, and precision farming systems. -Played a key role in developing GPS technology and transforming resource management. -Solutions help achieve higher productivity, safety, and sustainability.

Sector Specialisms

Construction

Geospatial

Engineering and Construction

Field Solutions

Mobile Solutions

Advanced Devices

Transportation and Logistics

Field Service Management

Telecommunications

Utilities

Construction Logistics

Forestry

Aerial Survey

Civil Construction

Earthworks

Mining

Military and Defense

Automotive

Telecommunications

Mapping and Navigation

Surveying

Mobile Mapping

Enterprise

Water Resources

Infrastructure

Buildings