SRE Engineer - Buenos Aires, Argentina - SA Technologies

    SA Technologies
    SA Technologies Buenos Aires, Argentina

    hace 3 semanas

    Default job background
    Descripción

    Role : SRE Engineer --REMOTE

    Locations :Remote

    Duration : Long Term Contract

    Position Summary
    Our client is looking for an experienced SRE Engineer to work in their platform reliability team. The Platform is composed of Mobile SDKs, Web Apps, Ad and Back-End services. Their tech stack is composed of a mix of .NET, Kotlin, Java, and Golang microservices running on Kubernetes in Azure. Understanding how to scale multi-tenant mobile applications and high-performance infrastructure is critical to a successful performance, we are interested in engineering talent that is versatile and has the intellectual curiosity to learn new things and find the best tools to solve our problems.


    Responsibilities Include

    • Design and drive consensus for the operational infrastructure of our platform team. This includes how our microservices are provisioned, how they will run in test and production, how configurations and networking is managed per environment, and how all systems are monitored, supported, and scaled in production.
    • Partner with Platform and Professional Services engineering stakeholders to identify operational needs and deliver solutions as part of the infrastructure.
    • Write, deliver, and maintain infrastructural services that can be used by directly by development teams, including documentation and support materials.
    • Establish and reinforce healthy software engineering practices – including code quality, continuous delivery, and automated testing.
    • Aggressively automate all the things and make it easy to do the right thing when it comes to security, reliability, and resource management using GitOps principles. Drive the automation from build Pipelines (Azure DevOps/ Github actions), Kubernetes operators (Flux/Argo) and scaling via KEDA.
    • Mentor and force multiply other members of the engineering team.
    • Ensure the uptime and scalability or systems through best practices such as auto-healing and circuit breaking.
    • Be a trusted on call support engineer as part of a rotation
    • Other related duties as assigned.

    Required Qualifications:

    • BS or MS in Computer Science / Engineering or relevant work experience
    • 8+ years of professional experience in software development
    • Experience with Kubernetes, GitOps, and Infrastructure as Code
    • Experience operating distributed systems and highly available backend services.
    • Demonstrated ability to be analytical and have strong problem-solving skills.
    • Excellent communication, collaboration skills, and a strong teamwork ethic with both technical and non-technical audiences
    • Proficient in building non trivial applications in Python or Go for infrastructure concerns.
    • Experience working with Azure, and preferably AKS.
    • Strong understanding of modern software engineering practices, including logging, monitoring, continuous integration/deployment, infrastructure as code, and automated testing practices
    • Proven Success operating in production Azure or similar cloud-native deployment and operational contexts, including Kubernetes, Pulumi/Terraform, and Open Telemetry/Grafana/New Relic/Datadog

    Preferred Qualifications:

    • Azure Kubernetes Service (AKS)
    • Kubernetes Service Mesh
    • Terraform
    • E-commerce or Retail experience
    • Experience designing and deploying infrastructure automation for use by multiple application development teams.
    • Experience running Kubernetes workloads in multiple cloud regions