Site Reliability Engineer

Localisation: Worldwide

Sound is hiring a Site Reliability Engineer to help shape the future of a new music economy that values artists and their music while connecting fans more closely to the music they love.

Sound is a suite of web3-native music and economic tools powering the next generation of artists and their communities. We’re passionate about helping artists capture more value from their art, and connecting fans more closely to the music they love. Since launch, we’ve onboarded over 200 artists (including Snoop Dogg, Pussy Riot, Salem Ilese, RAC, Soulection, and more) and generated over $3.5 million in proceeds that have gone directly to those artists.

As a Site Reliability Engineer, you’ll be responsible for providing a world class experience to our users and artists by detecting and resolving incidents, minimizing outages, and collaborating with product owners to understand user engagement on Sound. We’re looking for a software engineer with a curious mind and the passion to build the next generation of SRE tools and processes within Sound.

What you'll be doing:

Expand and keep customer-facing services available at top performance by maintaining the health of supporting systems
Work closely with our engineering team to define best practices and goals around availability and resiliency
Act in key response roles during major incidents and participate in the technical review of each incident
Contribute to technical design and architecture discussions and decisions as well as technical troubleshooting across our stack
Design, build and operate core infrastructure that enables scaling to support hundreds of thousands of concurrent users
Setup and perform regular load and stress testing, interpreting the results and leading the implementation of improvements to address bottlenecks, increase resiliency and improve scalability
Develop, manage and operate real-time production monitoring, instrumentation and telemetry
Ability to operate in a fast paced environment and troubleshoot complex issues quickly while successfully juggling multiple priorities

Who we're looking for:

4+ years of experience in a senior hands-on site/system reliability role
Proficiency in TypeScript and GraphQL
Experience with building and scaling services using technologies from AWS and CloudFlare
Experience deploying and operating EKS services and databases such as Postgres and Redis
Ability to participate in an on-call rotation and to work independently with minimal supervision
Excellent problem solving skills with a systematic and thorough approach and a bias for action
Track record for being able to diagnose problems within complex systems

Nice-to-haves:

Experience with message queues and caching infrastructure at-scale
Experience designing and implementing microservices and event-driven architectures
Experience with modern frameworks (React, Relay, Next.js)
Understanding of Ethereum, Arweave, IPFS architectures
History of open source contributions

Benefits at Sound:

We offer top-of-the-line benefits, including health, mental health, dental, and vision insurance.
Remote-first teamwork with team and community members around the world
Work-from-home/remote office stipend
Team offsites for periodic collaborative strategy sessions in person
Passionate, supportive team dedicated to learning and growing together in web3

Sound is an equal opportunity employer. We do not discriminate based on gender, ethnicity, sexual orientation, religion, age, civil or family status, disability or race.

POSTULER POSTULER

D'autres postes #sre

RECRUT-INFO

DevOps / SRE / Cloud Engineer [Full Remote possible] F/H

Rattaché·e au CTO, vous interviendrez sur toutes les tâches DevOps / Cloud / Systèmes et en deviendrez la personne référente sur ces sujets au sein de l'équipe constituée de 6 personnes. Vous serez a…

Salaire: 45 - 60 k€ brut annuel
Localisation: Marseille 09 - 13

Seyos

Senior SRE DevOps / Full remote - F/H

Notre client est un éditeur de logiciels RH qui compte plus de 1 500 clients et 250 000 utilisateurs. Leur métier : automatiser les process administratifs et RH des PME et ETI : gestion des congés, n…

Salaire: 55 - 90 k€ brut annuel
Localisation: Paris 13 - 75

Skill Hunter

Backend / SRE Node JS F/H

Vous souhaitez évoluer dans l’équipe Française d’un leader mondial dans le monde de la blockchain ? Cette entreprise qui existe depuis 2018 compte aujourd’hui plus de 7 000 personnes à travers le mo…

Salaire: 120 - 150 k€ brut annuel
Localisation: Paris 06 - 75

SAS RECHERCHES ET HORIZONS

Site Reliability Engineer F/H

Au sein d'une équipe agile, vous travaillez sur un produit générant un grand nombre de requêtes quotidiennes. Vous fournissez l'outillage et êtes en charge de l'automatisation afin d'optimiser le Run…

Salaire: 40 - 55 k€ brut annuel
Localisation: Lambesc - 13

En voir d'autres