Skip To Main Content
backgo to search

senior site reliability engineer

bullets
Site Reliability Engineering, Azure DevOps, Go Language, Kubernetes, Terraform, Bash, Google Cloud Platform, Helm, Istio, Microsoft Azure, Prometheus, Python
warning.png
Sorry the job is no longer available.

We are seeking a highly skilled Senior Site Reliability Engineer to join our remote team and work on exciting projects with cutting-edge technologies.

As a Senior SRE, you will be responsible for designing, modifying, and troubleshooting modules to deploy resources using Terraform and Infrastructure as Code. You will also work on stories assigned in Azure DevOps as per agile processes, create new observability and monitoring capabilities, and program application monitoring and alerting. Additionally, you will prepare and perform stress tests, automate manual processes in the CI/CD pipelines, and create and automate Playbooks and Alerts for Auto Healing.

responsibilities
  • Design, modify, and troubleshoot modules to deploy resources using Terraform and Infrastructure as Code principles
    • Work on Stories assigned in Azure DevOps as per agile process
      • Create new observability and monitoring capabilities and visualizations
        • Program application monitoring and alerting
          • Prepare and perform stress tests
            • Automate manual processes in the CI/CD pipelines
              • Create and automate Playbooks if they don't exist
                • Automate Alerts for Auto Healing as defined with each program
                  • Collaborate with cross-functional teams to deliver high-quality software solutions in line with project goals and timelines
                    • Ensure the implementation and maintenance of infrastructures using Infrastructure as Code principles and tools
                      • Guide and mentor junior team members, fostering a culture of growth and continuous learning within the team
                        requirements
                        • Minimum of 3 years of experience in Site Reliability Engineering, managing complex cloud and microservices environments
                          • Proficient in Azure DevOps as the main CI/CD tool
                            • Expertise in Kubernetes, with a good understanding of Helm, Istio, and Google Cloud Platform
                              • Experience with Terraform, ARM, and Infrastructure as Code principles for efficient and scalable infrastructure management
                                • Strong understanding of Linux OS and scripting languages such as Bash and PowerShell
                                  • Experience with Observability toolsets such as Prometheus and Grafana, with an understanding of SLI/SLO concepts
                                    • Strong experience with at least one programming language, such as Go or Python
                                      • Excellent problem-solving and analytical skills, enabling effective decision-making in complex environments
                                        • Advanced English language proficiency (Upper-Intermediate level) enabling clear communication and collaboration with the team and stakeholders
                                          nice to have
                                          • Working knowledge of Golang and Angular for efficient application development
                                            • Experience with Google Cloud/OpenShift for cloud infrastructure design, deployment, and management
                                              • Knowledge of Jaeger, Kiali, and Loki for efficient observability and monitoring

                                                These jobs are for you

                                                benefits for locations

                                                colombia.svg
                                                For you
                                                • Prepaid Medicine with Colsanitas for you and your legal dependents 
                                                • MetLife Life Insurance for you 
                                                • Thousands of projects for top brands
                                                • Stable income
                                                For your comfortable work
                                                • 100% remote work forever
                                                • Free licensed software
                                                • Possibility to work on your own device (BYOD)
                                                • Stable workload
                                                • Flexible engagement models
                                                For your growth
                                                • Free trainings for technical and soft skills
                                                • Free access to LinkedIn Learning platform
                                                • Support from a personal Skill Advisor
                                                • Language courses
                                                • Free access to internal and external e-Libraries
                                                • Access to internal communities and competency centers
                                                • Certification opportunities
                                                get job alerts in your inboxHundreds of open jobs for Software Engineers, QA, DevOps, Business Analysts and other tech professionals
                                                a smiling man wearing sunglasses