All Gravy is the all-in-one employee app designed for front-line, hospitality, and retail workers. 🥞
The next generation of employees are used to apps like Instagram, TikTok and ChatGPT in their personal life, but when they go to work at a restaurant chain or retail store, they're met with software created before they were born. We are transforming the industry through AI, and are modernizing the tools for frontline employees and transforming the experience into one that is engaging and motivating; making their lives easier - from coordinating work to developing their careers.
We've grown 3x over the last year and are looking to accelerate further. We already work with some of the most exciting brands in the Nordics and the UK (Pizza Pilgrims, Honest Burgers, Ottolenghi, Dishoom, Q8, 7-Eleven) and are looking to scale even faster.
🤩 About the role
We're looking for an ambitious Site Reliability Engineer to join our Platform team. Our products already power operations for some of the most recognizable brands in the Nordics and the UK, and the bar for uptime, performance, and security keeps rising. You'll join a Platform team that owns the foundations the rest of engineering builds on, and within that team you'll be the main person responsible for application security, performance, and production health.
As a Site Reliability Engineer on the Platform team, you'll work primarily in the codebase - not in infrastructure configs. Your focus is on how our application behaves in production: whether it's secure, whether it's fast, and whether we know about it when it isn't.
You'll work closely with the rest of the Platform team and with engineering leadership to shape strategy, but this is a hands-on role. You'll be reading code, profiling queries, investigating security vulnerabilities, building monitoring, and responding to incidents - not configuring cloud resources and writing Terraform.
🎯 What You Will Do
Security: Finding, Investigating, and Resolving Issues: Own application security across our stack. Proactively hunt for vulnerabilities in our codebase, triage and resolve security alerts, manage dependency patching, and run point on security incidents. You'll review code and architecture for security risks, and work with product teams to fix issues at the source rather than papering over them at the perimeter.
Performance: Query Optimisatio and Application Speed: Make our product fast. Profile and optimize database queries, identify slow endpoints and bottlenecks, and work with product teams to fix what matters. From MongoDB query patterns to API response times to mobile app cold starts - you'll dig into the code and data to understand where time is being spent and eliminate waste.
Monitoring & Observability: Define what "good" looks like for our services. Build and maintain the metrics, logs, traces, and alerts that tell the story of what the system is doing. Create dashboards that surface real problems rather than noise, and ensure that when something goes wrong, the right people know about it quickly and have the context to act.
Incident Response: Build and run our incident management practice. Lead incidents when they happen, drive blameless postmortems, and turn findings into durable improvements. Make on-call a reasonable experience for every engineer.
Reliability Culture: Be the person who makes reliability feel like a partner rather than a blocker. Teach engineers to own their services, help them understand production behavior, and level up the whole engineering org on how their code actually runs.
Our Ideal Candidate has
Code-First SRE Mindset: You solve reliability problems by reading and improving application code, not just by tuning infrastructure. You're comfortable diving into a TypeScript codebase, understanding query patterns, and profiling performance at the application layer.
Security Instinct: You can look at a codebase and spot problems - injection risks, auth gaps, insecure defaults, dangerous dependencies. You stay current on common vulnerability patterns and know how to systematically find and fix them.
Database & Query Expertise: You understand how queries translate to performance. You can read an explain plan, spot a missing index, recognize an N+1 problem, and reason about data access patterns at scale.
Pragmatism at Startup Pace: You understand that perfect is the enemy of shipped. You can identify what actually matters, right-size solutions to the stage of the company, and revisit decisions as we grow.
Influence Without Authority: Owning reliability and security across a product means your impact depends on bringing engineers along with you rather than policing them. You teach, you explain the "why," and you build trust with product teams so that reliability and security become shared values.
AI-Native Development: A versatile understanding of how to leverage AI in your day-to-day work. The best candidates are top-tier at working with AI tools, can demonstrate concrete ways they've integrated them into how they operate, and actively contribute back to how the wider team adopts and improves its AI-assisted ways of working.
We have a full-fledged TypeScript-based tech stack in All Gravy. We are looking for exceptional people with great energy and a good sense of humour. The ideal candidate will love to think about how to solve
🙌 You'll thrive here if you
Have solid hands-on experience in SRE, backend engineering, or a security-focused engineering role.
Be comfortable working in application code daily, this is not a pure infrastructure role.
Startup or scale-up background. You know what it means to move fast, make pragmatic calls, and revisit them when the time is right.
Have a good sense of humour, and be willing to contribute to this being a fun place to work.
Be excited about building a big company and want to grow personally on the way there.
Be passionate, organized and self-driven.
Fluent English, spoken and written.
Based within European time zones, whether from our Copenhagen or London office or remotely.
The ideal candidate has experience with:
Application security: vulnerability discovery, dependency auditing, secure coding practices.
Database performance: query optimization, indexing strategies, profiling slow queries.
Observability tooling: metrics, logs, traces, and alerting (e.g. Datadog, Grafana, or similar).
Incident response and on-call practice.
Working in production TypeScript / Node.js codebases.
MongoDB or other document databases.
An advantage if you have experience with:
Multi-tenant SaaS products.
Performance engineering on mobile or web products.
Penetration testing or security auditing.
Cost optimization at the application layer (query efficiency, caching strategies).
💎 Why All Gravy
Grow fast, for real. We promote from within, move quickly, and give you real responsibility from day one.
Top equipment. You'll receive all new equipment to do your best work.
A culture that actually develops you. Honest feedback, active coaching, and the autonomy to experiment. We invest in you because your growth is our growth.
Work that matters. Your work will directly impact the daily lives of millions of frontline workers.
You'll work with people who've built, scaled, and won. Ex-founders, operators, all here because they want to do it again, bigger.
The good stuff too. Prime office location, team lunches, Friday bars, padel tournaments, wine tastings, offsites, top-tier equipment, and a fridge that's always stocked. We work hard and celebrate harder.
💡 Even if you do not tick every box on this page, you might be the perfect fit for the job! We treasure our learning culture and encourage all humble people to apply.
This job comes with several perks and benefits
