Senior Site Reliability Engineer
@
Cabify
Site Reliability Engineers at Cabify work on improving all aspects of our platform and have an impact across the whole organisation. They are a blend of systems engineers and software developers who solve scalability issues with software and implement the best production engineering and security practices.
Cabify
Do you want to change the world? At Cabify, that's what we're doing. We aim to make cities better places to live by improving mobility for the people living in them, connecting riders to drivers, providing mobility alternatives such as scooters and mopeds and many others to come, all at the touch of a button. Maybe one day cities will be places where nobody needs a private car. But we've still got a long way to go… Fancy joining us?
As a Site Reliability Engineer, you will be:
- Evolving our infrastructure platform building self-service components that will be used by all the engineering team and by millions of users around the world.
- Working closely with our Product and Infrastructure teams to architecture and develop world-class infrastructure components.
- Designing and implementing tooling to improve the availability, scalability, observability and latency of our services, which are used by internal customers to deploy and operate their services.
- Increasing reliability awareness with other teams, helping with the adoption of reliability principles and reviewing observability implementations or software architectures.
- Defining SLIs, SLOs and SLAs as part of the services' lifecycle.
- Sharing an on-call schedule for the platform services you own.
- Solving problems in our highly available platform together with other teams, then build automations to prevent incidents from happening again.
- Participating in our recruiting process to help grow our engineering team.
As a Site Reliability Engineer, you will be:
- Evolving our infrastructure platform building self-service components that will be used by all the engineering team and by millions of users around the world.
- Working closely with our Product and Infrastructure teams to architecture and develop world-class infrastructure components.
- Designing and implementing tooling to improve the availability, scalability, observability and latency of our services, which are used by internal customers to deploy and operate their services.
- Increasing reliability awareness with other teams, helping with the adoption of reliability principles and reviewing observability implementations or software architectures.
- Defining SLIs, SLOs and SLAs as part of the services' lifecycle.
- Sharing an on-call schedule for the platform services you own.
- Solving problems in our highly available platform together with other teams, then build automations to prevent incidents from happening again.
- Participating in our recruiting process to help grow our engineering team.