Trabajos

    Senior Site Reliability Engineer - Buenos Aires, Argentina - Techunting

    Techunting
    Techunting Buenos Aires, Argentina

    Encontrado en: Talent AR C2 - hace 25 minutos

    Techunting background
    De jornada completa
    Descripción

    Senior Site Reliability Engineer who supports the reliability of the production environments of the highly-loaded critical business application that operates 24/7 for one of our key clients in the US.

    The Senior Site Reliability Engineer is responsible for ensuring our Client's SaaS products are fast, stable and optimized for our customers. Our SRE's take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.

    You either have an infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can operate independently to deliver solutions.

    If it sounds like you, welcome to join us

    Responsibilities

    • Champion and implement a culture of SRE to maintain a high-quality platform infrastructure
    • Champion and implement application and infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs
    • Optimize application performance at scale
    • Automate everything including system operational runbooks
    • Define and support continuous integration and deployment pipelines (CI/CD) aligned to branching and quality assurance strategies
    • Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes
    • Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly
    • Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations
    • Learn continuously and apply lessons learned
    • Evangelize best practices, eliminate bottlenecks, and improve process




    Requirements:

    Requirements

    • Azure infrastructure and IaC provisioning with Terraform
    • Experience instrumenting Application Performance Monitoring using New Relic within Azure infrastructure, services, and application functionality (three tiers)
    • Continuous Integration and Delivery and Continuous Deployment experience
    • 2+ years experience creating automated deployments with tools such as Azure DevOps Pipelines, Ansible, Jenkins or other scripting languages to manage infrastructure, software build and deployment in a continuous integration (CI) environment
    • 2+ years experience writing scripts in PowerShell or Python/Bash to automate system operations as runbooks for Windows and Linux environments.
    • 2+ years experience implementing production performance, availability, and scalability monitoring and alerting best practices using a tool such as New Relic
    • 2+ years experience as a global admin of Azure
    • Strong DevOps focus and experience building and deploying Infrastructure as Code with Terraform or similar technology
    • Experience with Solarwind DPA for database monitoring
    • Experience planning, coordinating, developing and executing all stages of test scripts
    • Experience securing Windows or Linux systems in 24x7 production environment
    • Experience with common networking, firewall and load balancing protocols

    Nice to have

    • Experience with Harness is a big plus
    • Dev experience with C#.NET, Java, Javascript, , React is a plus (you will need it when you work on TerraForm and infrastructure)
    • Experience with containerization and managing Kubernetes clusters is a value-add