Senior Site Reliability Engineer
Moonpay
Estimated Salary: $120K-$200K
Location: Lisbon / Madrid / Barcelona / London / Cape Town
Hi, we’re MoonPay. We’re here to onboard the world to the decentralized economy.
Why?
Because crypto and blockchain aren’t just technologies—they’re tools for global financial empowerment. They give people control over their money, their digital assets, and their future, unlocking opportunities that traditional systems have kept out of reach.
What we do
At MoonPay, we’re building the infrastructure that powers this new financial system. We make it easy for anyone, anywhere, to buy, sell, and trade crypto using everyday payment methods like cards, Apple Pay, PayPal, Revolut and Venmo. We provide simple tools to send, receive, and manage stablecoins, so anyone can participate in the crypto economy confidently.
Trusted by nearly 30 million customers and over 500 companies, our secure, enterprise-grade platform is driving mainstream crypto adoption worldwide.
We collaborate with innovative brands and projects to build secure, scalable solutions for a blockchain-powered future. And we’re committed to doing it right—fully licensed in the U.S. and regulated across the UK, EU, Canada, and Australia—because trust and compliance are non-negotiable.
But we’re just getting started. We’ve launched a consumer app that makes crypto accessible, intuitive, and usable for everyone, and it’s growing fast. We’re iterating every day to make it the best it can be.
If you believe financial freedom should be for everyone—if you believe in building a fairer, more open financial system—we want you with us. To build systems that benefit all, we need contributions from all, regardless of background.
Come build the future of payments and the decentralized economy with MoonPay. Let’s make financial freedom and autonomy the new normal.
🌔 About the Opportunity
Site Reliability Engineering at MoonPay is responsible for providing a resilient, secure, production-ready platform that enables MoonPay to safely deploy applications and services in a self-serve, repeatable manner. We believe that SRE should support both our product delivery and operational teams by surfacing data from our production environment and driving meaningful change based upon what we learn from it.
🚀 What you will do
In the
short term
we need to increase the resiliency and reliability of our current PaaS solution with things such as:
·
Improving the maintainability of our infrastructure as code
·
Building dashboards, monitoring & alerting mechanisms with Datadog
·
Load testing and performance tuning our production services
·
Lifecycling and maintenance of our Kubernetes clusters
In the
medium to long term
you’ll get to:
·
Implement
new and shiny technologies on top of Kubernetes as you see fit to ensure our tech can scale with the business.
·
Develop
and integrate solutions with a bias for automation in order to improve and maintain reliability across the production estate and make recovery easier.
·
Design
and track metrics for site uptime and performance ensuring high levels of visibility are maintained.
·
Own
the deployment pipelines and continuously improve our monitoring and alerting capabilities.
·
Collaborate
closely with all other engineering functions to provide timely feedback from our environments.
·
Support
Engineering on their journey to deliver better software, faster and more safely (think “It’s OK to deploy on Fridays” 😎).
💻 What you will be working with
·
Typescript
·
Node.js
·
TypeORM
,
TypeDI
,
TypeGraphQL
and
routing-controllers
·
React
and
NextJS
hosted on
Vercel
·
Google Cloud Platform
·
Postgres
·
Redis
·
Bull
, BullMQ
·
DataDog
·
ArgoCD
·
Kubernetes
·
GitHub
·
Jes
t
🧑🚀 About You
· S
trong systems administration skills, know the difference between a container and a virtual machine, and know your way around a Linux terminal
· P
latform engineering/SRE experience at leading startups or fast growing tech companies
· E
ither experience with some of our tech stack or are confident you can cross train and up skill quickly
· E
xperience working in a regulated industry
· C
onfident working with and guiding developers on monitoring and logging of complex systems at scale
· W
orked on complex projects
· W
ork collaboratively with different teams i.e. Security, Data, Engineering
· W
ant to forge and own MoonPays reliability & recovery processes
· Have
at least a basic understanding of complex reliability structures, theories, principles, and best practices
· W
orked with JavaScript codebases and frameworks e.g Typescript,
Node.JS
and React