Want to hear how I work? Hit play.Kablio AI applies for you. You just show up to the interviewKablio AI helps you secure roles in construction, clean energy, facilities management, engineering, architecture, sustainability, environment and other physical world sectors.
Get hired, get rewarded!
Land a job through Kablio and earn a 5% salary bonus.
Exclusive benefits
5%Bonus
Senior ML Ops Engineer (Machine Learning Infrastructure)
Parallel Systems
Developing autonomous electric freight vehicles for more efficient and sustainable transportation.
Lead design and development of scalable MLOps infrastructure for autonomous rail vehicles.
After 90 Days: You have delivered the core features of the MLOps pipeline and successfully integrated key tools (e.g., MLflow, SageMaker, or Kubeflow). You’ve also initiated the implementation of the remaining features, ensuring the infrastructure supports scalable, repeatable workflows for model experimentation and deployment in both R&D and production environments.
Architect, deploy, and manage scalable ML infrastructure for distributed training and inference.
Support the automation of model evaluation, selection, and deployment workflows.
Build and operate cloud-based systems (e.g., AWS, GCP) optimized for ML workloads in R&D, and production environments.
Build scalable ML infrastructure to support continuous integration/deployment, experiment management, and governance of models and datasets.
Design and implement robust MLOps solutions, including automated pipelines for data management, model training, deployment and monitoring.
Collaborate with ML engineers to gather requirements and develop strategies for data management, model development and deployment.
What you bring
ci/cd
mlops
distributed training
python
5+ years
bachelor's
Deep understanding of CI/CD practices applied to ML workflows.
Hands-on experience with MLOps tools (e.g., MLflow, Kubeflow, SageMaker, Airflow, Metaflow, or similar).
Hands-on experience with distributed training tools (e.g., PyTorch DDP, Horovod, Ray).
After 30 Days: You have developed a deep understanding of the product goals, existing infrastructure, and stakeholder requirements. You've conducted technical discovery and proposed a preliminary MLOps architecture—evaluating various ML tools, cloud services, and workflow strategies—clearly outlining pros and cons for each option.
Experience with cloud platforms (AWS, GCP, or Azure) and designing ML architectures in those environments.
Bachelor’s or higher degree in Computer Science, Machine Learning, or a relevant engineering discipline.
Proven experience architecting and deploying production-grade ML pipelines and platforms.
Background in real-time ML systems and batch inference, including CPU/GPU-aware orchestration.
5+ years of experience building large-scale, reliable systems; 2+ years focused on ML infrastructure or MLOps.
Proficiency in Python, Git, and system design with solid software engineering fundamentals.
Experience with deep learning architectures (CNNs, RNNs, Transformers) or computer vision.
Strong knowledge of ML lifecycle: data ingestion, model training, evaluation, packaging, and deployment.
Previous work in autonomous vehicles, robotics, or other real-time ML-driven systems.
Hey there! Before you dive into all the good stuff on our site, let’s talk cookies—the digital kind. We use these little helpers to give you the best experience we can, remember your preferences, and even suggest things you might love. But don’t worry, we only use them with your permission and handle them with care.