Site Reliability Engineer - (100% remote)

Permanent employee, Full-time · remote

General info
We are looking for a Site Reliability Engineer

Giant Swarm is a fast-growing open-source infrastructure management platform used by modern enterprises. Our vision is to empower developers around the world to ship great products.
We're a distributed, diverse, and growing team, spread across Europe. The company is based in Cologne, Germany, where we have a small office in a coworking space. However, only a few people work there. All workflows are created to function remotely - but of course, if you want to visit Cologne, you are more than welcome!

While we are remote-first, we appreciate quality time with our co-workers, so we meet in person twice a year to work and have fun together.

Work-life integration

  • Flexible working hours, and working from home or anywhere you prefer but please note that your permanent location should be somewhere in Europe or on the US east coast.
  • Currently, the number of kids from our team members outnumbers the number of employees.
  • We don’t only care about the kids “within” the company, but also about all children - for example, we compensate the carbon of all our flights.
  • As an international company, we want to create similar standards for everyone, regardless of location. So, additional perks (for example, a location-aware, fixed amount paid each month to cover costs like co-working, phone contracts or gym memberships), paid parental leave and healthcare compensation are compulsory.
Description

Your Job

  • You maintain, operate and upgrade our own and our customer’s Kubernetes clusters.
  • You will design, configure, build, and maintain our core infrastructure, from kernel parameters to the cloud provider templates.
  • You understand how servers and systems work and you tweak their behaviour to your needs.
  • You will be responsible for our monitoring, logging and alerting.
  • You will help resolve incidents on our own and our customer’s clusters.
  • You participate in the on-call support schedule (~ one 24 hours shift every 2 weeks)
  • You are a go-to person in case our developers need advice regarding infrastructure.
  • You will automate all the things.
Your profile

Requirements

  • You must have deep, hands-on knowledge of Kubernetes from both the end-user and the operation side.
  • You have wide experience with and are able to debug Networking, Security, Linux (Kernel, Namespaces, cgroups).
  • You have great debugging skills and you are not afraid to deep dive into thousands of lines of logs.
  • You have decent coding skills, preferably in Go. You have experience with maintaining infrastructure with code.
  • You know the good and bad parts of various automatization tools (Terraform, Chef, Puppet, Ansible or Saltstack).
  • You are fluent with CNCF products running on top of Kubernetes (prometheus, grafana, ingress controller, …) you know how to use them and how to configure them.
  • You have a decent knowledge of storage including software-defined storage.
  • You like reverse and performance engineering.
  • You automate all the things by writing code. Using bash scripts for it makes you sad :)
  • We (and our customers) are currently mostly distributed around Europe (around UTC), thus, your main timezone should be somewhere between -2UTC to +2UTC to ensure better communication.
Why us?

Why we think this job is worth applying for (challenge us!)

Impact, Impact, Impact! We are a remote-first organization with a growing team from 15+ European countries. Every new team member changes the team. This is great! People who know things we don’t are highly welcome.

“It's easier to ask forgiveness than it is to get permission” (Grace Hopper) - sure, it’s not 100% like this, but we have a strong culture of failure which, is part of our agile mindset. We don’t do things like in the guidebook. You can try things out! Our default to 100% transparency will help you here.
We play a key role in our customers' digital transformation. We have partnered up with Amazon and Microsoft to provide our solution on their cloud platforms - more will follow.
We have been in this ecosystem from the get-go and as part of the CNCF family, we feel at home in the community. As a part of Giant Swarm, you will also join this extended family.
We serve some of Europe's leading organizations and are talking to many more.

WHY Giant Swarm?

We like to give you a glimpse on how working with is like:

Self-organization

Creative work needs freedom and openness. We encourage you to do your work wherever and whenever you want. We expect passion and encourage sustainability. If you need rest, take it. We don't count holidays - but we are also aware that this combined with remote work can also lead to working too much. So we encourage you to take holidays and help you to manage the freedom and flexibility.

Teamwork

We are a growing company with team members distributed all over Europe and plans on expanding to the US. Our ambitious goals are only achievable as a team. Everybody’s input is highly welcome and appreciated. Although sometimes rules and processes are necessary, we try to keep them as lean as possible. Always question the status quo and find new ways of collaboration and teamwork.

Learning

Learning is mandatory and fun at the same time. If you realize you want to expand your knowledge in a specific area, we support you with conferences, books etc.

Basics

We offer fair (transparent and open) salaries with benefits like choosing your own laptop, additional perks (for example, a location-aware, fixed amount paid each month to cover costs like co-working, phone contracts or gym memberships), paid parental leave and healthcare compensation are compulsory. And you will participate in our stock options program. Currently, our team members have more children than we are employees. So family-friendliness is a must.

Interested? Questions? Coffee? Contact Larissa (Larissa@giantswarm.io) or apply directly!
About us
Giant Swarm is a fast growing open source infrastructure management platform used by modern enterprises. Our vision is to empower developers around the world to ship great products.

We're a distributed, diverse, and growing team, spread across Europe. The company is based in Cologne, Germany, where we have a small office in a coworking space. However, less than 5% of us actually work there. All workflows are designed to function remotely - but of course, if you want to visit Cologne, you are more than welcome!

Yes, all positions are remote! But we (and our customers) are currently mostly distributed around Europe (around UTC), thus, your main timezone should be somewhere between -2UTC to +2UTC to ensure better communication.

What we offer on top: 
  • Choose the hardware you like the most!
  • Family first - we have more kids than employees!
  • Join our team at conferences all over the globe!
  • Internal Hackathons - we love to challenge ourselves!
  • 2 Off-Sites per year (check our photos on instagram)!

We are not hiring job descriptions. We hire humans. :) We welcome applications from everybody, regardless ethnic or national origin, religion, gender identity, sexual orientation or age.
We are looking forward to hearing from you!
Thank you for your interest in Giant Swarm. Please fill out the following short form. Should you have difficulties with the upload of your data, please send an email to jobs@giantswarm.io.

Please upload your CV, recent certificates and a short cover letter (max. 20 MB in total).

Click to select multiple files or use drag-and-drop
Uploading document. Please wait.