Back to careers
Careers
Posted March 19, 2024

Systems Engineer

locationBerlin, Germany
quote

Systems Engineer:

About Us:

Sauce Labs is the leading provider of continuous test and error reporting solutions that give companies the confidence to develop, deliver and update high quality software at speed. The Sauce Labs Continuous Testing Cloud identifies quality signals in development and production, accelerating the ability to release and update web and mobile applications that look, function and perform exactly as they should on every browser, operating system and device, every single time. Sauce Labs is a privately held company funded by TPG and Riverwood Capital.

The Role:

We’re looking for a Systems Engineer to join our Ops Team. This role will be responsible for the successful operations and scaling of the infrastructure and software that powers Sauce Labs and launches over 10 million VMs a month.

Responsibilities:

  • Install, configure and debug hardware and systems in our data center
  • Deploy new services to cloud based providers via Terraform
  • Write tools and scripts to provide automation and self-service solutions for ourselves and other teams
  • Design new systems to support production services
  • Creatively solve scale challenges regarding a rapidly expanding cloud environment
  • Work with real hardware - Dell, SuperMicro Twin-Pro, storage (NAS and SAN), Apple, custom appliances for mobile devices, load balancers, and beyond
  • Help improve monitoring and identify key performance metrics
  • Proactive R&D - discovering and implementing new tools, emerging technology, etc.
  • Disaster recovery design, implementation, and maintenance
  • Create NOC runbooks, procedures, documentation, and diagrams of the environments you manage
  • Troubleshooting and resolution of server/application and network issues
  • Help build out new data centers around the globe
  • Lifecycle management of infrastructure
  • Participation in 24x7 on-call rotation

Here are a couple of examples of the kind of projects you might work on:

  • Optimize hardware and configuration for improving hypervisor performance
  • Automating Deployment of operating systems to bare metal servers
  • Building and optimizing a Prometheus/Alertmanager/Grafana for our development team to monitor and analyze production system usage
  • Migrate an on-prem service to a cloud provider
  • Build new pipeline based tooling to lifecycle systems

We have a lot of big projects and decisions that need to be made, and in this role you would be a key part of that process. Sounds like fun? Here’s what we’re looking for:

Required Skills:

  • Able to execute on high level goals independently and with cross functional teams
  • 2+ years recent experience working as a Linux administrator/engineer at scale (hundreds of systems) and designing/deploying ‘highly available’ solutions
  • 1+ years of recent professional experience designing, developing, and operating Configuration Management solutions Ansible (preferred) at scale
  • Solid experience in Linux tuning, profiling, and monitoring
  • Strong skills in at least one language: Python (preferred), Go
  • Experience deploying/managing KVM-Qemu
  • Experience with Kubernetes, Docker and their ecosystems.
  • Experience deploying services with Terraform
  • Solid understanding of cloud/networking/distributed computing environment concepts; including TCP/ IP connections, firewalls, VLANs, etc.
  • Familiarity with running services in GCP and AWS 
  • Familiar with ZFS on Linux and storage appliances (iSCSI and NFS)
  • Experience and understanding of contemporary metrics, monitors, and logging solutions especially prometheus, grafana, sumologic, stackdriver, Nagios, etc.
  • Highly organized, able to multi-task, able to work individually, as well as within a team, and across teams
  • Excellent communication skills, both verbal and written across all user levels
  • Deployment automation in physical and virtual environments (PXE, Foreman (preferred))
  • Experience with InSpec, ansible molecule or a similar tool for testing configuration management.
  • Working knowledge of load balancing technologies (hard/soft)
  • Proven experience collaborating in a cross functional team environment
  • Familiarity with software engineering practices, configuration management, development methodologies (e.g. kanban, agile, waterfall, spiral, prototyping), etc.

We are a hybrid workplace that recognizes the importance of flexibility while valuing in-person collaboration and relationship building. As a result, Saucers located near an office location must be able and willing to come into the office. Those hired remotely must be able and willing to travel to an office as required by the specific role. 

Please note our privacy terms when applying for a job at Sauce Labs.

Sauce Labs is proud to be an Equal Opportunity employer and values diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender identity/expression/status, sexual orientation, age, marital status, veteran status or disability status.

Security responsibilities at Sauce

At Sauce, we will commit to supporting the health and safety of employees and properties, partnering with internal stakeholders to learn and act on ever-evolving security protocols and procedures. You’ll be expected to fully comply with all policies and procedures related to security at the department and org wide level and exercise a ‘security first’ approach to how we design, build & run our products and services.

Share this job
Copy Share Link

Careers

Apply for this job

Please fill out this form if you are interested in applying for this position or sign in to apply through LinkedIn.

© 2023 Sauce Labs Inc., all rights reserved. SAUCE and SAUCE LABS are registered trademarks owned by Sauce Labs Inc. in the United States, EU, and may be registered in other jurisdictions.