Want to hear how I work? Hit play.Find roles with Kablio AI to help build and power the world.Kablio AI helps you secure roles in construction, clean energy, facilities management, engineering, architecture, sustainability, environment and other physical world sectors.
Get hired, get rewarded!
Land a job through Kablio and earn a 5% salary bonus.
Exclusive benefits
5%Bonus
Senior Site Reliability Engineer- Cloud Platform
Baker Hughes
Provides innovative solutions for energy, industrial, and infrastructure sectors globally.
Design, automate, and support cloud infrastructure and DevOps for Baker Hughes.
Working on critical, complex customer problems that may span multiple services
Collaborating with stakeholders to understand requirements, set priorities, and communicate progress and challenges.
Ensuring security best practices are integrated into the development lifecycle, including compliance with data protection regulations.
Providing technical application support for enterprise-level systems
Co-ordinating with Cloud infrastructure partners for Server, Network, Database, service-related incidents, and projects
Deploying application upgrades/patches in production and test environments
Troubleshooting application alerts, Azure and AWS Policy from monitoring tools and code inspection and performing RCAs
Running our infrastructure with Chef, Ansible, Terraform, Github CI/CD, and Kubernetes
Automating deployment of applications and infrastructure
Collaborating with cross functional stakeholders
Providing mentorship and guidance to team members
Participating in 24x7 on-call rotation and working with global teams
Monitoring configuration management, platform layout, and hosting infrastructure.
Following security guidelines to develop secure and compliant Cloud services by working with Risk and Security teams.
Writing tutorials, how-to videos, and other technical articles for the customer community and knowledgebase articles and keep them up to date
Participating in Capacity planning, system performance monitoring, resource utilization trending and incident and change management.
What you bring
python
bsc computer
linux
docker
kubernetes
aws
Have experience in infrastructure optimization in Cloud.
Have Knowledge in automation scripting language like Python/Linux Shell scripting / Windows Powershell
Have bachelor's degree in computer science or “STEM” Majors (Science, Technology, Engineering and Math) with 7-10 years of experience in total.
Have 5+ years of Experience in Linux (RHEL) operating system performance monitoring parameters and their interpretation, commands used for monitoring
Have experience in Change management and Incident management process
Have experience in RDBMS and NoSQL database technologies
Have knowledge of application design patterns, J2EE application architectures, Microservices, Spring boot & Cloud native architectures
Have Mastery in collaborative software development using Git, Jira, Confluence etc.
Have 5+ years of Hands-on experience with Public Cloud-based applications, technologies and tools, deployment, monitoring, and operations, such as Docker, Kubernetes, etc.
Have proficiency in Java runtimes, Core Java, Garbage collection, JVM parameters tuning
Be able to work independently and in a team environment managing a range of customers and technical situations.
Have 5-8 years of experience with cloud infrastructure platforms such as AWS and Azure. Have prior experience in setting up, running and configuring Cloud applications.
Have deep understanding of operating and monitoring Java applications and Dockerized containers
Be an expert in performance monitoring and capacity management of enterprise systems using various tools.
Have experience in Observability - APM tools (Dynatrace, AppDynamics etc.), metrics / log consolidation (Splunk) and logging tools such as Prometheus, Grafana, and the ELK stack is essential.
Have hands-on experience in CI-CD (AWS CodePipeline, Azure DevOps, GitLab CI/CD, Jenkins) and IaC tools (Terraform, AWS CloudFormation, Ansible etc.)
Demonstrating best practices pertaining to Cloud DevOps development along with a willingness to continually learn Cloud native technologies.
Benefits
Safety net of life insurance and disability programs
Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productive
Contemporary work-life balance policies and wellbeing activities
Hey there! Before you dive into all the good stuff on our site, let’s talk cookies—the digital kind. We use these little helpers to give you the best experience we can, remember your preferences, and even suggest things you might love. But don’t worry, we only use them with your permission and handle them with care.