Our Client is seeking a Site Reliability Engineer to join their passionate development team focused on creating engaging games. This role involves improving the service lifecycle, implementing monitoring, and optimizing deployments to ensure reliable and scalable solutions for players and developers. Candidates should have strong experience with GoLang, gRPC, monitoring tools, and build pipelines, alongside excellent communication skills in English.
Key Responsibilities
Engage in improving the whole lifecycle of services from inception through deployment, operations, and refinement
Implement service monitoring and alerting solutions to track service health and performance
Define service level objectives and gather the required metrics to track these objectives
Improve the software lifecycle with automated tests, deployments and builds, as well as performing load testing
Analyse and resolve issues in software, systems, tools, and services to minimize down time and interruption to development
Identify and mitigate risks with both current infrastructure, systems, and technologies as well as potential future risks with scalability and reliability
Work with software engineers to develop new game features with a high standard of code quality, stability, maintainability, readability, and performance.
Work with electrical engineers to integrate custom hardware interfaces and implement tools for thoroughly testing hardware.
Requirements
Able to thrive in a diverse, collaborative team working productively and autonomously
Strong communication skills in English, both written and verbal.
A passion for games is critical as you'll be collaborating with the games development team and contributing to our products.