Site Reliability Engineer
We are forming a team of rapid response to resolve business impacting technical incidents and to shore up processes and build automation to reduce or mitigate downtime. This role involves being the first point of technical escalation of issues within our infrastructure both in cloud and on-prem. It also includes participating in stand-ups with development teams and informing your squad of updates and changes to our platform. The role focuses on automating everything, including workflow and tool automation, such as deployments of distributed applications and infrastructure using various scripting languages to allow 24/7 Incident Engineers to mitigate incidents without escalation. The Site Reliability Engineer will be able to analyse, diagnose and solve issues in the production environment with minimal escalations to supporting 3rd Level support teams. This position also involves participating in the Change Management process via review of RFC’s to ensure “Definition of Done” as well as executing and supporting software and hardware deployments. Developing and documenting ways-of-working between the LiveOps (NOC) Team and the development teams to improve efficiencies in diagnostics and impact mitigation is also a key aspect of this role.
- Being the first point of technical escalation of issues within our infrastructure both in cloud and on-prem.
- Participating in stand-ups with the development teams and informing your squad of updates and changes to our platform.
- Automating everything – Workflow and tool automation - such as deployments of distributed applications and infrastructure using various scripting languages to allow our 24/7 Incident Engineers to mitigate incidents without escalation.
- Able to analyse, diagnose and solve issues in the production environment with minimal number of escalations to supporting 3rd Level support teams.
- Participate in Change Management process via review of RFC’s to ensure “Definition of Done” as well as executing and supporting software and hardware deployments.
- Developing and Documenting ways-of-working between the LiveOps(NOC) Team and the development teams to improve efficiencies in diagnostics and impact mitigation.
- Supporting and troubleshooting.
- Using Automation and configuration management tools (Octopus, Team City, Terraform) (required).
- AWS Cloud infrastructure, CDNs, and other various systems running in multiple data centres and environments (required).
- Cloud Application Load Balancer, preferably with experience on AWS ALB (required).
- Cloud DNS support such as AWS Route 53, GCP Cloud DNS, or Azure DNS (required).
- Serverless Computing such as AWS Lambda (required).
- Cloud Firewall such as AWS WAF (required).
- Server virtualisation such as VMware, IaaS and PaaS cloud such as AWS and Azure (required).
- Open-source monitoring and alerting tools (Prometheus, Loki, Grafana and Jaeger) (required).
- Scripting in Python, Bash, Powershell or others (required).
- Microsoft SQL databases via Stored Procedures, Locking/Unlocking tables and running select statements to assess impact and diagnose problems (required).
- Bachelors degree or equivalent experience, technical degree beneficial (preferred).
- Aws Cloud practitioner or equivalent would be beneficial (preferred).
- You will be working on 24/7 shift basis with opportunity for remote working on limited basis.
From a single slot machine in 1963 to a Nasdaq Stockholm-listed organisation with licences across multiple jurisdictions, Betsson has evolved into a diversified, multinational business. Today we employ around 3,000 people representing more than 75 nationalities across +20 locations. Betsson AB is headquartered in Stockholm, while our operational headquarters in Ta’ Xbiex, Malta, drive the day-to-day business under what we refer to as Betsson Group. Our vision is to deliver the best customer experience in the industry. Through a portfolio of leading brands such as Betsson, Betsafe and NordicBet, we offer casino, sportsbook and other gaming products in regulated markets across Europe, South America, North America and Central Asia. Our proprietary technology underpins a scalable model that serves both B2C customers and B2B partners. Sustainability is embedded in our strategy. Responsible growth, customer protection and a commitment to our people and the communities we operate in remain central to how we create long-term value.
