Backend Engineers in Infrastructure, will work alongside SREs, and are experienced Ruby developers who work in the product with a focus on reliability, observability, performance and scalability at the application level, as well as on resource optimization from an Infrastructure perspective and on operationally relevant features.
The Scalability team is responsible for optimising GitLab.com performance through improving the reliability, availability and performance of individual GitLab services and the application as a whole. The Projections Team is responsible for forecasting & projection systems that enable development engineering to understand system growth (planned and unplanned) for their areas of responsibility. We manage the capacity planning and the Error Budget tools and processes. We also support the Frameworks Team when needed. Ultimately, this role has a huge scope of influence as the team works with the whole of engineering on performance and availability concerns.
GitLab.com is a unique site and it brings unique challenges–it’s the biggest GitLab instance in existence. In fact, it’s one of the largest single-tenancy open-source SaaS sites on the internet. The experience of our team feeds back into other engineering groups within the company, as well as to GitLab customers running self-managed installations.
You may be a fit to this role if you:
Have strong programming skills as a backend engineer with Ruby and/or Ruby on Rails.
Strong experience with profiling and metrics analysis
Strong experience with observability tools, and monitoring tools such as Prometheus and Thanos.
Have experience regularly interacting with customers, and a focus on resolving their requests with a sense of urgency
Are able to leverage GitLab as your day to day go-to tool
Are able to reason about large systems - how they work and can be operated on a large scale, edge cases, failure modes, behaviors
Have experience in collaborating and communicating asynchronously
Have an urge to document all the things so you don't need to learn the same thing twice
Have an enthusiastic, go-for-it attitude. When you see something broken, you contribute to fixing it
Have a strong sense for action and know how to iterate through a problem quickly
Share our values, and work in accordance with those values
Bonus points if you:
A background in working with the Elastic and Kibana
Strong programming skills in Golang
Experience with Sidekiq and Redis
Good knowledge around SQL and/or PostgreSQL
Working from the Scalability team’s issue tracker, driving changes required to reliably support GitLab at GitLab.com size.
Work on projects in Capacity Planning, Error Budgets, and Infrastructure Cost Data.
Improving our Prometheus Monitoring or building new Metrics
You share our values, and work in accordance with those values.
Success story sharing