Senior Site Reliability Engineer (remote Europe)

Forbach (57600), Moselle Temps-plein Mise à jour le 26 juillet 2024

Entreprise: Booming Games

About the role

Join our team at Booming Games as a Site Reliability Engineer and ensure the peak performance and reliability of our systems across multiple geographical locations! As a key player in troubleshooting and resolving complex issues, you will collaborate with engineers to drive automation, standardization, and optimization efforts. Your expertise in operating systems, networking, and distributed systems, combined with your passion for problem-solving, will make you an invaluable asset. If you are ready to revolutionize the reliability and scalability of our services while working with cutting-edge technologies, this role is perfect for you.

Responsibilities

Daily interactions ensuring the health and maintenance of systems in different geographical locations: hardware, software, application and network are operating at peak performance.Perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes.Troubleshoot issues across the entire stack: hardware, software, application and network.Drive standardization efforts across multiple disciplines and services in conjunction with SREs throughout the organization.Identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services.Represent the SRE organization in design reviews and operational readiness exercises for new and existing services.Work with software engineers to improve upon deployment processes.Participate in the on-call rotation for production systems.3rd line support in the networks and infrastructure team being the last line of defense in Engineering Support EscalationManage the server and network infrastructure, assist in the development of security strategies and their implementation and participate in global network infrastructure upgrades with upstream providersWork with both SRE & Development teams on new projects and technologies such as: New Infrastracture Setup, Kubernetes Migrations, New GeographicLocations, Monitoring & Upgrades and moreDaily interactions ensure the health and maintenance of stacks in different geographical locations: hardware, software, application and network are operating at peak performance.Perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes.Troubleshoot issues across the entire stack: hardware, software, application and networkPromote openness, diversity of opinions and inclusive discussions at all times to evaluate a wide variety of ideas and perspectives in solving challenging problemsDemonstrate clear decision making and good trade-offs in complex situations comprising multiple opinions, needs, teams, technologies, cloud providers, and architectural settingsCommunicate effectively with stakeholders ranging from executives to junior engineers across the breadth and depth of the engineering organizationEnable the engineering organization to innovate and deliver with greater speed and safetyAny other tasks or responsibilities that may be given in the due course of role.

Daily interactions ensuring the health and maintenance of systems in different geographical locations: hardware, software, application and network are operating at peak performance.

Perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes.

Troubleshoot issues across the entire stack: hardware, software, application and network.

Drive standardization efforts across multiple disciplines and services in conjunction with SREs throughout the organization.

Identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services.

Represent the SRE organization in design reviews and operational readiness exercises for new and existing services.

Work with software engineers to improve upon deployment processes.

Participate in the on-call rotation for production systems.

3rd line support in the networks and infrastructure team being the last line of defense in Engineering Support Escalation

Manage the server and network infrastructure, assist in the development of security strategies and their implementation and participate in global network infrastructure upgrades with upstream providers

Work with both SRE & Development teams on new projects and technologies such as: New Infrastracture Setup, Kubernetes Migrations, New Geographic

Locations, Monitoring & Upgrades and more

Daily interactions ensure the health and maintenance of stacks in different geographical locations: hardware, software, application and network are operating at peak performance.

Perform deep dives into both systemic and latent reliability issues; partner with software and systems engineers across the organization to produce and roll out fixes.

Troubleshoot issues across the entire stack: hardware, software, application and network

Promote openness, diversity of opinions and inclusive discussions at all times to evaluate a wide variety of ideas and perspectives in solving challenging problems

Demonstrate clear decision making and good trade-offs in complex situations comprising multiple opinions, needs, teams, technologies, cloud providers, and architectural settings

Communicate effectively with stakeholders ranging from executives to junior engineers across the breadth and depth of the engineering organization

Enable the engineering organization to innovate and deliver with greater speed and safety

Any other tasks or responsibilities that may be given in the due course of role.

Requirements

Sound fundamentals in operating systems, networking, and distributed systems.Exemplify high accountability, integrity, and resilience to maintain focus on both big-picture goals and milestones to get thereStrong familiarity with Linux systems administration and management best practices.Familiarity with container technologies: Kubernetes, CRI, Docker, namespaces, cgroups.Strong understanding of: Ethernet, VLANs, IPv4/IPv6, ARP, DHCP, DNS, and TCP.Familiarity with distributed system problems: leader election, Raft consensus, etc.Solid understanding of systems and application design, including the operational trade-offs of various designs.Expert level understanding with at least one public or private cloud technology such as Amazon AWS, Google GKE, or OpenStack.Practical knowledge of various aspects of service design, including messaging protocols and behavior, caching strategies and software design practices.Practical intermediate knowledge of shell scripting, some Ruby is a plus.Demonstrable knowledge of TCP/IP, HTTP, web application security, and experience supporting multi-tier web application architectures.Excellent knowledge of Linux/UNIX systems administration and performance tuning.Comfortable configuring DNS, DHCP, and LAN/WAN technologies.Minimum 5 years of managing services in an internet scale *nix environment.Must be able to communicate well with technical as well as non-technical colleagues to achieve business goals.Must be adaptable and able to focus on the simplest, most efficient and reliable solutions.Track record of successful practical problem solving, excellent written and interpersonal communication in English, and documentation skills.Curiosity and an interest in networking, systems software, and distributed systems.Experience as a systems administrator or operations engineer.Experience with a 24/7 production environment.Experience with managed deployments providing software, platforms, or infrastructure as a service.Experience with SuperMicro server and storage gear is a plus.

Sound fundamentals in operating systems, networking, and distributed systems.

Exemplify high accountability, integrity, and resilience to maintain focus on both big-picture goals and milestones to get there

Strong familiarity with Linux systems administration and management best practices.

Familiarity with container technologies: Kubernetes, CRI, Docker, namespaces, cgroups.

Strong understanding of: Ethernet, VLANs, IPv4/IPv6, ARP, DHCP, DNS, and TCP.

Familiarity with distributed system problems: leader election, Raft consensus, etc.

Solid understanding of systems and application design, including the operational trade-offs of various designs.

Expert level understanding with at least one public or private cloud technology such as Amazon AWS, Google GKE, or OpenStack.

Practical knowledge of various aspects of service design, including messaging protocols and behavior, caching strategies and software design practices.

Practical intermediate knowledge of shell scripting, some Ruby is a plus.

Demonstrable knowledge of TCP/IP, HTTP, web application security, and experience supporting multi-tier web application architectures.

Excellent knowledge of Linux/UNIX systems administration and performance tuning.

Comfortable configuring DNS, DHCP, and LAN/WAN technologies.

Minimum 5 years of managing services in an internet scale *nix environment.

Must be able to communicate well with technical as well as non-technical colleagues to achieve business goals.

Must be adaptable and able to focus on the simplest, most efficient and reliable solutions.

Track record of successful practical problem solving, excellent written and interpersonal communication in English, and documentation skills.

Curiosity and an interest in networking, systems software, and distributed systems.

Experience as a systems administrator or operations engineer.

Experience with a 24/7 production environment.

Experience with managed deployments providing software, platforms, or infrastructure as a service.

Experience with SuperMicro server and storage gear is a plus.

Good to know

We kindly ask for your understanding that we can only consider applicants within the the Central European Timezone +/-2To be considered for the role, we kindly ask that you submit your resume/CV in EnglishThis full-time position can be a permanent employment in Malta or on a freelance basis for contractors in the other countries

We kindly ask for your understanding that we can only consider applicants within the the Central European Timezone +/-2

To be considered for the role, we kindly ask that you submit your resume/CV in English

This full-time position can be a permanent employment in Malta or on a freelance basis for contractors in the other countries

Why Work for Booming Games

Being part of Booming Games' growing team provides the chance to work with market-leading products and services, while enjoying freedom and flexibility. Joining our modern, rapidly expanding company means contributing to the ultimate gaming experience for clients and players. We are proud of our one team mentality that allows us to foster a collaborative environment, where everyone works together towards a common goal. So, why consider working for us? We stand on a solid foundation of core values that guide every action. Moreover, we offer fantastic employee benefits, including fully remote work, flexible hours, engaging team events, and thoughtful gifts. Don't hesitate — apply today and become part of our exciting journey!

Our perks

We’re a 100% remote-first company, offering a best-in-class remote experienceCompetitive packageEducational allowance per yearDog friendly offices in Berlin and MaltaFamily friendly policiesCareer growth opportunitiesZestful working environmentKnowledge sharing opportunitiesDynamic culture surrounded by industry expertsCompany social events for team bonding

We’re a 100% remote-first company, offering a best-in-class remote experience

Competitive package

Educational allowance per year

Dog friendly offices in Berlin and Malta

Family friendly policies

Career growth opportunities

Zestful working environment

Knowledge sharing opportunities

Dynamic culture surrounded by industry experts

Company social events for team bonding

Contact

HR Team

Booming Games

Booming Games delivers high-end gaming solutions to the online casino slots market. Our uniquely themed games with innovative features are our key to success.