At NetworkGuard, our mission is to help everyone protect their online privacy, security, and digital rights.
As a Site Reliability Engineer (SRE), not only will you be at the forefront of the next phase of our evolution where you’ll be applying SRE principles to maximise the engineering velocity of developer teams while improving overall reliability through the application of automation, you’ll also be facing and solving unique challenges that we’re confident you won’t face in the majority of other industries.
If you’re looking to have a significant impact on the lives of people all over the world in preserving the right to the freedom of speech, while solving interesting engineering problems at a global scale, then drop us a line and come join us!
What you’ll be doing
- Analysing and troubleshooting a variety of complex operational problems surrounding our VPN and other services, including speed and quality issues, resilience and situational awareness
- Providing engineering support to other operational teams, including advice, technical reviews, and architecture analysis
- Leading or joining project teams to execute specific operational changes or deployments
- Specialising in one or more core services provided by ExpressVPN in order to provide in-depth analysis, escalation support, and insights into service operations and issues
- Being the conduit for requirements for new operational tooling from other teams and translating them into technical requirements our development teams can execute on
- Mentoring and coaching junior engineers and engineers/analysts from other operational departments
- Building and maintaining scalable monitoring capabilities for operational metrics and alerting
- Having an in-depth understanding of the operational technologies and metrics that help us measure quality, speed, user experience, and explaining the way data flows through our operational systems from customer to destination
- Providing escalation support to our Operations Service Desk
What you’ll need to succeed
- DevOps or Site Reliability Engineering experience
- Strong understanding and demonstrated experience of automation concepts and deployment at scale
- Solid understanding of the full network stack
- Accomplished in two or more programming languages
- Proven capability in guiding and mentoring junior engineers
- Good verbal and written communication skills in English, with the ability to create clear, concise documentation, and strong capability to advocate for your ideas among engineering peers
- Will have supported production operations at scale, investigating and resolving intricate and diverse problems
- Ability to work with minimal direction and be adaptable to unique challenges