Back to jobs

Site Reliability Engineer

Job description

Full-time, Permanent Opportunity
Location- Cork

Force recruitment have partnered with a leading German automation manufacturing company who are currently looking for a Site Reliability Engineer or Dev Ops Engineer. The profiles within their Software team include: Software Architects, Software Developers, Software Testers, Requirements Engineers, Product Owners, Scrum Masters and Agile Coaches. This is an opportunity for an experienced SRE or DevOps engineer to join a new team and to enter at an early stage, with the ability to shape the future of the systems and cloud platform services in EMEA.

Responsibilities:

  • Design, implementation and maintainability for robust, scalable, high-quality software and systems within the SRE domain.

  • Management and support of the in-house development systems, CI/CD pipelines and tools, monitoring and alerting, leaning on automation to streamline activities and to reduce toil.

  • Work closely with developers and architects to ensure designed solutions meet non-functional requirements such as availability, performance, security and maintainability.

  • Incident management, post-mortem reviews and continuous improvement initiatives, contributing to the evolution of processes and systems within the organization.

  • Contribute to the definition of key metrics and technical decisions driving products and their delivery: SLOs and SLAs, architecture, best practices, cost optimization.

  • Take responsibility for complex project tasks, strive for and achieve higher standards of individual and team performance.

  • Build relationships external to the team.

  • Drive and achieve knowledge sharing across all product.

  • Identify personal development opportunities, set goals and deliver against them, turn learning into impactful on-the-job contributions.

  • Mentor and train other engineers throughout the company and drive company-wide improvement.

 

What are we looking for:

  • 3-5 years using AWS platform and services.

  • Experience with common tools and technologies used within CI/CD and Build pipelines: Terraform, Jenkins, Gitlab, Nexus, Ansible, Maven, Docker, Helm.

  • BSc in an IT related field (e.g Computer Science, Cloud Computing, Engineering) or 3-5 years’ professional experience on cloud operations and/or cloud platforms in a DevOps engineering or SRE role.

  • Familiarity with commonly used.

  • Experience with, and high-level understanding of, common operating systems including Ubuntu and Windows.

  • Experience scripting in Bash, Python or Powershell.

  • Experience troubleshooting issues in a cloud environment.

  • Experience working with multiple teams to facilitate orderly project and release plans

  • Familiarity with VMWare vSphere (ESXi, vCenter) desirable.

Essential Criteria:

  • Experience on as many AWS services as possible: compute (EC2, Lambda) and containerization (EKS, ECR), storage (S3), database (RDS, Dynamo), networking (ELB, VPC), automation (CloudFormation), IAM (Cognito), security (Security Hub, Shield, GuardDuty, Control Tower, KMS…), monitoring and logging (Prometheus, CloudWatch, Cloudtrail…), backup and configuration management (Backup, Config).

  • Experience in an SRE team and understanding of SRE principles.

  • Experience in backup and restore processes and procedures.

  • Experience in cloud based multi tenancy.

  • Experience in emergency response & on call.

  • Understanding of the current best practices around Security Management and patching, branching strategy, release management, Linux administration.

  • Experience working in an agile development team using SCRUM.