Want to hear how I work? Hit play.Kablio AI applies for you. You just show up to the interviewKablio AI helps you secure roles in construction, clean energy, facilities management, engineering, architecture, sustainability, environment and other physical world sectors.
Get hired, get rewarded!
Land a job through Kablio and earn a 5% salary bonus.
Exclusive benefits
5%Bonus
Principal Machine Learning Operations Developer for AEC
Autodesk
Design and make software for architecture, engineering, construction, and entertainment industries.
Develop and scale ML training pipelines and infrastructure for AEC foundation models.
Design efficient data processing workflows for large-scale design datasets and industry-specific file formats
Implement and maintain robust, testable code that is well documented and easy to understand
Optimize distributed training systems and develop solutions for model parallelism, checkpointing, and efficient resource management
Support AI researchers by building scalable ML training pipelines and infrastructure for foundation model development
Collaborate on projects at the intersection of research and product with a diverse, global team of researchers and engineers
Analyze performance bottlenecks and provide solutions to scaling problems
What you bring
python
cloud
pytorch
docker
bsc/msc
autodesk
A self-starter who can solve problems with minimal supervision while collaborating effectively with a global, remote-first team
Adaptable and creative, comfortable building new infrastructure or working within existing codebases
Experience with cloud services and architectures (AWS, Azure, etc.)
Familiarity with version control, CI/CD, and deployment pipelines
Experience with Autodesk or similar products (Revit, Sketchup, Forma)
Excellent communicator who can convey complex technical concepts clearly to diverse audiences
BSc or MSc in Computer Science or related field, or equivalent industry experience
Thrives in ambiguous, rapidly evolving areas where learning and flexibility are essential
Experience with performance optimization, monitoring, and efficiency in large-scale ML systems
Strong knowledge of ML infrastructure and model parallelism techniques, including frameworks like PyTorch, Lightning, Megatron, DeepSpeed, and FSDPProficiency in Python and strong software engineering practices
Excellent written documentation skills to document code, architectures, and experiments
Experience with AEC data formats (e.g., BIM models, IFC files, CAD files, Drawing Sets)
Experience with distributed data processing and ML infrastructure (e.g., Apache Spark, Ray, Docker, Kubernetes)
Experience scaling ML training and data pipelines for large datasets
Experience with distributed systems for machine learning and deep learning at scale
Knowledge of the AEC industry and its specific data processing challenges
Hey there! Before you dive into all the good stuff on our site, let’s talk cookies—the digital kind. We use these little helpers to give you the best experience we can, remember your preferences, and even suggest things you might love. But don’t worry, we only use them with your permission and handle them with care.