Didn't find the right job?

Get expert career advice to help you find the ideal role and improve your job search strategy.

What Jobs are available for Devops Engineers in London?

Showing 143 Devops Engineers jobs in London

Site Reliability Engineer

£75000 - £90000 annum orbit

Posted 655 days ago

Tap Again To Close

Job Description

Permanent

Is this job a match or a miss?

This advertiser has chosen not to accept applicants from your region.

0

Site Reliability Engineer (SRE)

Farringdon, London £80000 - £95000 Annually Charles Simon Associates Ltd

Posted 5 days ago

Job Viewed

Tap Again To Close

Job Description

permanent

Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote

Location: Remote (occasional travel to Nottinghamshire HQ)
Salary: Up to £95,000 per annum + benefits
Start Date: ASAP

Charles Simon Associates are working with a global organisation who are looking to recruit a Site Reliability Engineer (SRE) on a permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery.

Responsibilities include:

Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance.

Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics).

Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments.

Automating with PowerShell, Python, or Bash to drive efficiency.

Supporting Kubernetes and AKS environments in production.

Leading incident response, postmortems, and continuous improvement processes.

Driving cost optimisation, capacity planning, and load testing.

Championing best practices in cloud security and resilience.

Key Skills & Experience Required:

Proven Site Reliability Engineering background.

Strong Terraform skills with live environment deployment.

Kubernetes / AKS expertise.

Scripting in PowerShell, Python or Bash.

Monitoring experience (Datadog preferred, Azure or Grafana considered).

Background in web applications and distributed systems.

Desirable Skills:

Knowledge of Microservices Architecture.

Familiarity with Kanban.

Experience with Puppet or Chef

If you’re passionate about Site Reliability Engineering and want to work in an environment where “that will do” is never good enough, this role is for you.

Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote

Is this job a match or a miss?

This advertiser has chosen not to accept applicants from your region.

1

Site Reliability Engineer (SRE)

Farringdon, London Charles Simon Associates Ltd

Posted 10 days ago

Job Viewed

Tap Again To Close

Job Description

full time

Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote

Location: Remote (occasional travel to Nottinghamshire HQ)
Salary: Up to £95,000 per annum + benefits
Start Date: ASAP

Charles Simon Associates are working with a global organisation who are looking to recruit a Site Reliability Engineer (SRE) on a permanent basis. This is an exciting opportunity to join a forward-thinking business where reliability, scalability, and automation are at the heart of technology delivery.

Responsibilities include:

Designing and enforcing SLOs, SLIs, and SLAs to ensure high reliability and performance.

Building and maintaining monitoring/observability solutions (Datadog, Grafana, Azure Application Insights, Log Analytics).

Managing Infrastructure as Code (Terraform, Pulumi, CloudFormation) for scalable, repeatable deployments.

Automating with PowerShell, Python, or Bash to drive efficiency.

Supporting Kubernetes and AKS environments in production.

Leading incident response, postmortems, and continuous improvement processes.

Driving cost optimisation, capacity planning, and load testing.

Championing best practices in cloud security and resilience.

Key Skills & Experience Required:

Proven Site Reliability Engineering background.

Strong Terraform skills with live environment deployment.

Kubernetes / AKS expertise.

Scripting in PowerShell, Python or Bash.

Monitoring experience (Datadog preferred, Azure or Grafana considered).

Background in web applications and distributed systems.

Desirable Skills:

Knowledge of Microservices Architecture.

Familiarity with Kanban.

Experience with Puppet or Chef

If you’re passionate about Site Reliability Engineering and want to work in an environment where “that will do” is never good enough, this role is for you.

Site Reliability Engineer – (SRE, Terraform, AKS, Azure, Kubernetes, PowerShell, Python, Bash, Datadog, Monitoring Tools) – Permanent – Remote

Is this job a match or a miss?

This advertiser has chosen not to accept applicants from your region.

2

Site Reliability Engineer II

London, London American Express Global Business Travel

Posted today

Job Viewed

Tap Again To Close

Job Description

Amex GBT is a place where colleagues find inspiration in travel as a force for good and - through their work - can make an impact on our industry. We're here to help our colleagues achieve success and offer an inclusive and collaborative culture where your voice is valued.
**What You'll Do on a Typical Day:**
+ Design and implement next-generation highly scalable, and reliable applications using SaaS technology.
+ Translate functional specifications into logical, component-based technical designs.
+ Own delivery of application features end to end by working with internal and external teams.
+ Innovate and implement new ideas to solve complex software problems.
+ Work closely with geographically distributed team members
**What We're Looking For:**
+ Amex GBT Egencia's Technology organisation is looking for a highly motivated, self-driven, self-starter, and fast-growing potential individual to be part of a growing team of technologists. You are well-versed in SDLC and Agile methodologies.
+ You have at least 1-3 years of experience in software development and troubleshooting.
+ An independent thinker, who works around problems and who isn't shy of trying new technologies. You have validated experience working in parallel technologies apart from your core technology area (Java).
+ Prior experience in working harmoniously with a cross-geography team will be an added advantage. You should be equally appropriate in development, test, and debugging roles and be ready to wear many hats. This team values "fail-fast" learners and technology enthusiasts who view learning new technology as a fun experience.
+ Strong knowledge of Object Oriented Programming, Data Structures, and Algorithms
+ Good proficiency in any of the programming languages from Java, Golang, Python, or Bash
+ Proven ability to develop and support large-sized highly scalable software systems
+ Experience in AWS Services
+ Good knowledge of container orchestration frameworks primarily Kubernetes
+ Basic understanding of logging and monitoring frameworks
+ Knowledge of cloud computing concepts along with an understanding of application communication and routing is a plus
+ Good experience in developing and deploying AWS cloud-based platforms
+ Good understanding of network topologies with experience in hybrid cloud architecture will be a plus
+ Experience with the Agile Tool set and Programming Practices
+ Knowledge of CI-CD principles
+ Knowledge of server-side design patterns is a plus
+ Ability to quickly pick up new technologies, and languages with ease
+ A standout colleague who collaborates and incorporates feedback from all partners
+ Excellent written and verbal communication skills
+ BS or MS in Computer Science or equivalent degree
#GBTJobs
**Location**
London, United Kingdom
**The #TeamGBT Experience**
Work and life: Find your happy medium at Amex GBT.
+ **Flexible benefits** are tailored to each country and start the day you do. These include health and welfare insurance plans, retirement programs, parental leave, adoption assistance, and wellbeing resources to support you and your immediate family.
+ **Travel perks:** get a choice of deals each week from major travel providers on everything from flights to hotels to cruises and car rentals.
+ **Develop the skills you want** when the time is right for you, with access to over 20,000 courses on our learning platform, leadership courses, and new job openings available to internal candidates first.
+ **We strive to champion Inclusion** in every aspect of our business at Amex GBT. You can connect with colleagues through our global INclusion Groups, centered around common identities or initiatives, to discuss challenges, obstacles, achievements, and drive company awareness and action.
+ And much more!
All applicants will receive equal consideration for employment without regard to age, sex, gender (and characteristics related to sex and gender), pregnancy (and related medical conditions), race, color, citizenship, religion, disability, or any other class or characteristic protected by law.
Click Here ( for Additional Disclosures in Accordance with the LA County Fair Chance Ordinance.
Furthermore, we are committed to providing reasonable accommodation to qualified individuals with disabilities. Please let your recruiter know if you need an accommodation at any point during the hiring process. For details regarding how we protect your data, please consult the Amex GBT Recruitment Privacy Statement ( .
**What if I don't meet every requirement?** If you're passionate about our mission and believe you'd be a phenomenal addition to our team, don't worry about "checking every box;" please apply anyway. You may be exactly the person we're looking for!
Click Here to Learn More (

Is this job a match or a miss?

This advertiser has chosen not to accept applicants from your region.

3

Site Reliability Engineer - Azure

London, London Vitesse PSP

Posted 17 days ago

Job Viewed

Tap Again To Close

Job Description

Permanent

Formed in 2014 by a team of proven FinTech entrepreneurs, we are an FCA-regulated business providing global claim funds management and payment solutions. Operating one of the largest banking and payment settlement networks in the world, we give our customers direct access to 200 countries and currencies. Through a single integration, insurers can use this network to pay claims in as fast as 45 seconds and deliver a superior claimant experience. Our market-leading treasury proposition provides insurers with transparency and control over their claim funds, even when delegated to third parties, allowing them to have their money in the right place, at the right time, to make that all-important payment when customers need it most.

With over 260 employees across our London headquarters, Europe, and the US, $93m Series C funding secured, and exceeding £15bn in processed transactions, we are only just getting started.

We are collaborative, customer centric and work with integrity, whilst partnering with some of the biggest insurance leaders including Lloyd's of London and Many Pets. We take huge pride in our company culture, ensuring that everyone has a part to play, an opportunity to be heard, be involved, and the ability to make a real difference. As we continue to scale up, we want like-minded humans to join us on this exciting journey.
Are you ready?

Your mission:
As a Site Reliability Engineer (SRE), you will play an important role in designing, building, and maintaining the infrastructure and tools necessary to support our software applications and services. You will collaborate closely with the product engineering squads, technical operations, and security teams to ensure the reliability, scalability, and security of our platform. Your responsibilities will include automating infrastructure provisioning, configuration management, and deployment pipelines, utilizing best practices and modern technologies to streamline processes and improve efficiency. You will also be responsible for monitoring system performance, identifying bottlenecks, and implementing solutions to enhance system reliability and performance.

Your responsibilities
Cloud Platform Management: Using Azure/AWS to manage and optimize infrastructure components, ensuring scalability, reliability, and cost management.

Infrastructure Design and Implementation: Designing, building and maintaining the cloud-based infrastructure that supports our software applications and services

System Reliability: Ensuring the reliability, availability, and performance of systems and services by designing, implementing, and maintaining robust infrastructure.

Infrastructure as Code (IaC): Implementing and maintaining tools for automation, monitoring, and deployment to improve efficiency and reduce manual intervention.

Collaboration and Support: Working closely with product engineering to ensure efficient workflows and support continuous integration and delivery pipelines (CI/CD).

Capacity Planning and Scalability: Assessing system capacity requirements and planning for future growth to ensure the system can scale and is cost efficient.

Incident Response and Management: Monitoring system health, promptly responding to incidents, and assisting with the resolution process.

Risk Management: Identifying potential risks and vulnerabilities in systems and implementing measures to mitigate these risks effectively.

Monitoring and Observability: Implement and oversee monitoring tools to proactively detect and mitigate issues, ensuring high application and system availability.

Documentation and Knowledge Sharing: Maintaining documentation and sharing knowledge with the team to ensure transparency and facilitate cross-functional collaboration.

Requirements
3+ years of experience in an SRE or Platform/Cloud Engineer, or similar role.

Strong knowledge and experience in cloud platforms, we primarily host in Azure and AWS but recognize that skills are transferable.

Experience in running and maintaining highly available and scalable platforms.

Expertise in containerisation tools like Docker and orchestration tools such as Kubernetes.

Experience with infrastructure as code (IaC) tools such as Terraform, Ansible, or Chef for automation and configuration management.

Strong understanding of monitoring and observability tools.

Knowledge of networking, security principles, and best practices in a cloud environment. Cloudflare experience would be a bonus.

Demonstrated experience of CI/CD tools like GitHub Actions, GitLab CI/CD, or Azure DevOps for continuous integration and delivery.

Problem-solving mindset and meticulous attention to detail.

Strong collaboration and communication skills to work effectively with cross-functional, internationally distributed teams.

Comfortable working in a fast-paced environment, handling incidents, and participating in on-call rotations.

Adaptability to evolving technologies and eagerness to learn new tools and methodologies.

Benefits

25 days Holiday per year + Bank Holidays 

Hybrid working arrangements. 

Contributory pension scheme 

Enhanced parental leave.  

Cycle to Work Scheme 

Private Medical Insurance through Vitality 

Access to Oliva our Mental Health Therapy partners

Discounted Gym membership  

Financial Coaching with Octopus Wealth 

2 days of volunteering leave per year 

Sabbatical after 5 years’ service  

Ongoing Learning and Development to support you reach your career goals. 

WE ARE AN EQUAL OPPORTUNITY EMPLOYER
We are committed to creating an inclusive environment that enables everyone to perform at their best, where we recognise the rights of all individuals to mutual respect and where there is an 
unbiased acceptance of others. Our policies and practices aim to promote an environment that is free from all forms of Unfair discrimination and values the diversity of all people. At the heart of our policy, we seek to treat people fairly and with dignity and respect.

Is this job a match or a miss?

This advertiser has chosen not to accept applicants from your region.

4

Site Reliability Engineer (UK)

London, London £30000 - £55000 annum WALT Labs

Posted 18 days ago

Job Viewed

Tap Again To Close

Job Description

Permanent

WALT Labs, a leading managed service provider, is dedicated to empowering businesses by harnessing the power of cloud technology. Our team specializes in delivering customized solutions tailored to meet the unique needs of our clients, driving growth and operational efficiency across industries. From supporting small businesses with seamless data migration to enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements.
We are seeking a skilled Site Reliability Engineer - UK with a strong focus on Google Cloud Platform (GCP) to join our dynamic team. In this role, you’ll be responsible for maintaining cloud infrastructure, managing incidents, and ensuring seamless operations for our clients. You’ll use tools like incident.io and JIRA to manage and resolve support requests efficiently.
This is an in-office role: Monday - Friday, 9 AM - 6 PM GMT / BST
Requirements
Qualifications for Site Reliability Engineer:

Proven experience with Google Cloud Platform (GCP) services - 3+ years. (Kubernetes a must!)

Understanding of Google Workspace (admin experience a plus)

Familiarity with incident.io for incident tracking and management (of equivalent)

Proficiency in using JIRA for task management and support workflows.

Strong experience working with observability tools (Grafana and DataDog )

Strong troubleshooting and problem-solving skills in cloud environments.

Understanding of cloud security and performance optimization best practices.

Knowledge of scripting or automation tools (e.g., Python , Terraform ) is a plus.

Excellent written communication and customer service skills.

Certifications in GCP (e.g., Google Cloud Associate or Professional certifications) are highly desirable.

Ability to work under pressure and prioritize tasks effectively.

Bachelor’s degree in Computer Science, Information Technology, or related field (or equivalent experience).

Role responsibilities

Provide technical support and resolve issues related to Google Cloud Platform (GCP) services and AWS.

Provide client support for Google Workspace

Manage and respond to cloud incidents using incident.io , ensuring timely resolution.

Use JIRA to log, track, and prioritize support tickets and workflow tasks.

Monitor and maintain cloud infrastructure for performance, reliability, and security.

Collaborate with teams to identify and implement solutions to technical challenges.

Assist in deploying, configuring, and optimizing GCP resources.

Create and maintain documentation for troubleshooting processes and best practices.

Proactively identify opportunities to improve cloud environments and support processes.

Support clients and stakeholders by providing clear communication and updates during incident resolution.

Stay up-to-date with the latest GCP developments and contribute to team knowledge sharing.

Benefits

Private Medical Insurance

Paid Time Off that increases with longevity (additional 1.5 days every 2 years)

Professional development and advancement opportunities

Pension

Growth opportunities

Is this job a match or a miss?

This advertiser has chosen not to accept applicants from your region.

5

Site Reliability Engineer - Remote

London, London ESL FACEIT Group

Posted 341 days ago

Job Viewed

Tap Again To Close

Job Description

Permanent

At EFG (ESL FACEIT Group) we create worlds beyond gameplay where players and fans become community. We pride ourselves in having a corporate social responsibility which is that “IT’S NOT GG (Good Game), UNTIL IT’S GG FOR ALL”. We are passionate about the culture we foster that ultimately helps to create and shape the world of esports, gaming tournaments, leagues, events and holistic ecosystems staged for our millions of players, fans and heroes.
The Team:
As a Site Reliability Engineer at EFG, you will be designing, analyzing, and troubleshooting large-scale distributed systems. You will demonstrate a systematic problem-solving approach, and the ability to debug and optimize code and to automate routine tasks. You will ensure that EFG’s services and systems are reliable, that they have uptime appropriate to users' needs and they have a fast rate of improvement.
Apart from monitoring our systems' capacity and performance, you will also focus on optimizing existing systems, on building infrastructure and on eliminating work through automation. You will work collaboratively with the software engineering teams to deploy and operate our systems, and you will help to automate and streamline our operations and processes. Within this role, you will be given real responsibilities, and you have the opportunity to drive change and have a big impact on our products and platform.

What you will do:
Maintaining and improving the monitoring and observability tools (Grafana/Prometheus/Thanos/Jaeger);

Working closely with your team and with other cross-functional teams to help design, maintain and operate systems at scale;

Developing and driving adoption of SRE best practices across the company;

Leading on incident management process and adoption;

Using your troubleshooting skills to help identify and fix operational issues;

Working with Cloud Native technologies such as Kubernetes, Envoy, Istio, Prometheus and Helm;

Working with the “Hashi Stack” (terraform, packer, vault);

Experimenting with and introducing cutting edge technologies.

Requirements
Proven experience as a Site Reliability Engineer, DevXP Engineer or Software Engineer, focusing on building and maintaining scalable infrastructures;

Excellent working knowledge on at least one of the major cloud providers (GCP/AWS/Azure);

You have experience with cluster management systems (Kubernetes);

Knowledge of incident management: ability to investigate, troubleshoot, recover and prevent the recurrence of incidents that interfere with the normal delivery of IT services;

Proficient in Go language and some level of proficiency in at least another language: Java, Python, Rust…;

You have knowledge of GitOps practices;

You have production scale experience with one of the following; MongoDB, Redis, MySQL;

Experience contributing to open source technologies would be an added bonus.

Is this job a match or a miss?

This advertiser has chosen not to accept applicants from your region.

Be The First To Know

About the latest Devops engineers Jobs in London !

Set Email Alert:

Enter your email

Job title

Location

6

Site Reliability Engineer, Region Services

London, London Amazon

Posted today

Job Viewed

Tap Again To Close

Job Description

Description
Would you like to help implement innovative cloud computing solutions and solve the most complex technical problems? Are you excited by the prospect of building and running the world's largest cloud computing infrastructure to provide a better world for future generations?
Amazon Web Services (AWS) builds and operates some of the largest internet infrastructure on the planet; providing companies of all sizes with an infrastructure web services platform in the cloud. With AWS, customers provision compute power, storage, database, and other cloud resources as their business demands them. To meet the growing demand for AWS Services around the globe, we need exceptionally motivated people who are driven by learning and innovation.
AWS Utility Computing (UC) provides product innovations - from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development and management of Compute, Database, Storage, Internet of Things (Iot), Platform, and Productivity Apps services in AWS, including support for customers who require specialized security solutions for their cloud services.
If you join us, you'll be part of a world-class team in a dynamic environment that has the entrepreneurial feel of a start-up. This is an opportunity to operate and engineer systems on a massive scale, and to gain world class experience in cloud computing. You'll be surrounded by people who are passionate about cloud computing, believe that first class service is critical to customer success, and are committed to improvement.
Top reasons to join our team:
- Be a catalyst to deliver truly disruptive products that are growing rapidly
- Define, build, own, and run services in high growth environments
- Solve unique and first-order problems to enable our internal teams to deliver for our customers
- Build and operate distributed systems
- Design and build the tools and utilities that are part of the AWS fleet running our internal services
Key job responsibilities
The Systems Development engineer will be a key member of a new team pioneering automated build and deployment of Windows based services. The team is adopting a code-first and hands off CI/CD based approach to drive operational excellence and cross environment parity. This will involve building Ansible based Infrastructure as Code as well as custom integrations with existing and new Windows services.
About the team
About AWS
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the preferred qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn't followed a traditional path, or includes alternative experiences, don't let it stop you from applying.
Why AWS?
Amazon Web Services (AWS) is the world's most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating - that's why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Inclusive Team Culture
AWS values curiosity and connection. Our employee-led and company-sponsored affinity groups promote inclusion and empower our people to take pride in what makes us unique. Our inclusion events foster stronger, more collaborative teams. Our continual innovation is fueled by the bold ideas, fresh perspectives, and passionate voices our teams bring to everything we do.
Mentorship & Career Growth
We're continuously raising our performance bar as we strive to become Earth's Best Employer. That's why you'll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why we strive for flexibility as part of our working culture. When we feel supported in the workplace and at home, there's nothing we can't achieve.
Basic Qualifications
- Knowledge of systems engineering fundamentals (networking, storage, operating systems)
- Experience (non-internship) in professional software development
- Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems
- Experience in networking, storage systems, operating systems and hands-on systems engineering
- Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby
Preferred Qualifications
- Experience with Ansible (preferred), Powershell or Javascript/Typescript
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice ( ) to know more about how we collect, use and transfer the personal data of our candidates.
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.

Is this job a match or a miss?

This advertiser has chosen not to accept applicants from your region.

7

Senior Site Reliability Engineer I

Farringdon, London RELX INC

Posted today

Job Viewed

Tap Again To Close

Job Description

Senior Site Reliability Engineer
Are you enthusiastic about designing and managing cloud platforms? Do you find satisfaction in ensuring the reliability and performance of complex systems?
About the Team:
The LexisNexis Intellectual Property (IP) division ( ) provides international patent content and a suite of online and analytic tools that meet the evolving needs of the intellectual property market. We deliver data to support LexisNexis IP search and analytics applications, empowering our customers with actionable insights and metrics for critical business decisions.
Our corporate culture thrives on excellence, innovation, and a strong dedication to our customers, employees, and communities. Working here means joining a vibrant, diverse, and collaborative team where you are free to grow and contribute actively.
About the Role:
We are a high-performing systems engineering team operating in a fast-paced enterprise environment, focused on modernising our infrastructure while upholding strict security and compliance standards. This position provides assistance and input to management, develops and leads large multifunctional development activities, solves complex technical problems, writes complex code for computer systems, and serves as a senior source of expertise.
Requirements:
+ Deep knowledge of cloud services (e.g., EC2, S3, RDS, Lambda, Azure VMs, Azure Functions).
+ Good experience in Cloud Engineering with a strong focus on Azure and/or AWS
+ Experience with Infrastructure as Code (Terraform, ARM/BICEP).
+ Proficiency in containerization and orchestration tools (Docker, Kubernetes/EKS).
+ Skilled in scripting languages (Python, Bash, TypeScript, PowerShell).
+ Strong understanding of Linux/UNIX/Windows systems and storage.
+ Experience with monitoring tools (Datadog, Coralogix, CloudWatch, Azure Monitor).
+ Familiarity with SRE and DevOps practices.
+ Knowledge of networking and security best practices.
+ Excellent problem-solving and stakeholder management skills.
+ Databricks Knowledge is an added advantage.
Responsibilities:
+ Leading Kubernetes deployment and management, including orchestration, architecture, networking, CI/CD, storage, and security.
+ Collaborating with cross-functional teams to design and implement high-quality cloud solutions.
+ Administering and supporting Databricks environments, including permissions, storage, and networking.
+ Troubleshooting complex technical issues using observability tools and root-cause analysis.
+ Implementing infrastructure management best practices and automating repetitive tasks.
+ Supporting program installations, system configurations, and user modifications.
+ Refining system monitoring and reporting in collaboration with support teams.
+ Operating across Agile and Waterfall methodologies to deliver timely solutions.
+ Mentor junior team members and contribute to a culture of continuous learning.
Why Join Us?
Join our team and contribute to a culture of innovation, collaboration, and excellence. If you are ready to advance your career and make a significant impact, we encourage you to apply.
Work in a way that works for you
We promote a healthy work/life balance across the organisation. We offer an appealing working prospect for our people. With numerous wellbeing initiatives, shared parental leave, study assistance and sabbaticals, we will help you meet your immediate responsibilities and your long-term goals.
+ Working flexible hours - flexing the times when you work in the day to help you fit everything in and work when you are the most productive.
Working for you
We know that your well-being and happiness are key to a long and successful career. These are some of the benefits we are delighted to offer:
+ Annual Profit Share Bonus
+ Comprehensive Pension Plan
+ Home, office or commuting allowance
+ Generous vacation entitlement and option for sabbatical leave
+ Maternity, Paternity, Adoption and Family Care leave
+ Internal communities and networks
+ Recruitment introduction reward
About Our Business
The LexisNexis Intellectual Property (IP) division ( ) provides international patent content and a suite of online and analytic tools that meet the evolving needs of the intellectual property market. We deliver data to support LexisNexis IP search and analytics applications, empowering our customers with actionable insights and metrics for critical business decisions.
We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form or please contact .
Criminals may pose as recruiters asking for money or personal information. We never request money or banking details from job applicants. Learn more about spotting and avoiding scams here .
Please read our Candidate Privacy Policy .
We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law.
USA Job Seekers:
EEO Know Your Rights .
RELX is a global provider of information-based analytics and decision tools for professional and business customers, enabling them to make better decisions, get better results and be more productive.
Our purpose is to benefit society by developing products that help researchers advance scientific knowledge; doctors and nurses improve the lives of patients; lawyers promote the rule of law and achieve justice and fair results for their clients; businesses and governments prevent fraud; consumers access financial services and get fair prices on insurance; and customers learn about markets and complete transactions.
Our purpose guides our actions beyond the products that we develop. It defines us as a company. Every day across RELX our employees are inspired to undertake initiatives that make unique contributions to society and the communities in which we operate.

Is this job a match or a miss?

This advertiser has chosen not to accept applicants from your region.

8

Senior Site Reliability Engineer (SRE)

SW1A 0AA London, London £70000 Annually WhatJobs Direct

Posted 2 days ago

Job Viewed

Tap Again To Close

Job Description

full-time

Our client, a rapidly growing technology firm, is seeking a highly skilled and experienced Senior Site Reliability Engineer (SRE) to join their engineering team. This role is critical for ensuring the availability, performance, and scalability of our production systems. As an SRE, you will bridge the gap between software development and IT operations, applying software engineering principles to infrastructure and operations problems. This is a fully remote position, offering the opportunity to work with cutting-edge technologies and contribute to a culture of continuous improvement.

Responsibilities:
Design, build, and maintain scalable and reliable infrastructure on cloud platforms (e.g., AWS, Azure, GCP).
Develop and implement automation tools and scripts to improve system efficiency and reduce manual intervention.
Monitor system performance, identify bottlenecks, and implement solutions to enhance stability and speed.
Respond to and resolve production incidents, performing root cause analysis and implementing preventative measures.
Collaborate with development teams to ensure applications are designed for reliability and scalability.
Implement and manage CI/CD pipelines for seamless software deployments.
Develop and maintain comprehensive documentation for infrastructure and operational procedures.
Participate in on-call rotations to provide 24/7 support for critical systems.
Contribute to the design and implementation of disaster recovery and business continuity plans.
Mentor junior engineers and share best practices in SRE and DevOps.
Champion a culture of reliability and operational excellence throughout the engineering organization.

Qualifications:
Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
5+ years of experience in SRE, DevOps, or a related infrastructure engineering role.
Strong proficiency in at least one programming language (e.g., Python, Go, Java).
Extensive experience with cloud platforms (AWS, Azure, or GCP) and containerization technologies (Docker, Kubernetes).
Deep understanding of networking concepts, protocols, and security.
Experience with monitoring and alerting tools (e.g., Prometheus, Grafana, Datadog).
Proven experience with CI/CD tools and practices.
Strong problem-solving and debugging skills.
Excellent communication and collaboration skills, with the ability to work effectively in a remote team.
Experience with infrastructure as code (IaC) tools like Terraform or Ansible is a plus.
Familiarity with distributed systems and microservices architectures.

This is a remote position, offering the chance to work on challenging projects from **London, England, UK** or any location within the UK. We are looking for talented individuals who are passionate about building highly available and performant systems.

Is this job a match or a miss?

This advertiser has chosen not to accept applicants from your region.

9

View All Jobs in London

Industry

Accounting

Administrative

Agriculture Forestry

AI & Emerging Technologies

Apprenticeships & Trainee

Architecture

Arts & Entertainment

Automotive

Aviation

Banking & Finance

Beauty & Wellness

Catering

Charity & Voluntary

Chemical Engineering

Childcare

Civil Engineering

Cleaning & Sanitation

Community & Social Care

Construction

Creative & Digital

Crypto & Blockchain

Customer Service & Helpdesk

Dental

Driving & Transport

E Commerce & Social Media

Education & Teaching

Electrical Engineering

Energy

Fmcg

Government & Non Profit

Graduate

Healthcare

Hospitality & Tourism

Human Resources

Industrial Engineering

Information Security

Installation & Maintenance

Insurance

IT & Software

Legal

Leisure & Sports

Logistics & Warehousing

Management

Management Consultancy

Manufacturing & Production

Marketing

Mechanical Engineering

Media & PR

Medical

Military & Public Safety

Mining

Nursing

Oil & Gas

Pharmaceutical

Project Management

Purchasing

Real Estate

Recruitment Consultancy

Retail

Sales

Scientific Research & Development

Telecoms

Therapy

Veterinary

+ View More

Boost your Job Search Now!

Upload your CV on WhatJobs today and:

Get discovered by top employers.

Apply to jobs in one click.

Receive personalized job recommendations.

Upload your CV

Nearby Locations

Other Jobs Near Me

Industry

Accounting

Administrative

Agriculture Forestry

AI & Emerging Technologies

Apprenticeships & Trainee

Architecture

Arts & Entertainment

Automotive

Aviation

Banking & Finance

Beauty & Wellness

Catering

Charity & Voluntary

Chemical Engineering

Childcare

Civil Engineering

Cleaning & Sanitation

Community & Social Care

Construction

Creative & Digital

Crypto & Blockchain

Customer Service & Helpdesk

Dental

Driving & Transport

E Commerce & Social Media

Education & Teaching

Electrical Engineering

Energy

Fmcg

Government & Non Profit

Graduate

Healthcare

Hospitality & Tourism

Human Resources

Industrial Engineering

Information Security

Installation & Maintenance

Insurance

IT & Software

Legal

Leisure & Sports

Logistics & Warehousing

Management

Management Consultancy

Manufacturing & Production

Marketing

Mechanical Engineering

Media & PR

Medical

Military & Public Safety

Mining

Nursing

Oil & Gas

Pharmaceutical

Project Management

Purchasing

Real Estate

Recruitment Consultancy

Retail

Sales

Scientific Research & Development

Telecoms

Therapy

Veterinary

View All Devops Engineers Jobs View All Jobs in London

Search Suggestions

Recent Searches

Popular Searches

Location Suggestions

Popular Locations

Nearby Locations

Other Jobs Near Me

Industry

What Jobs are available for Devops Engineers in London?

Site Reliability Engineer

Job Description

Is this job a match or a miss?

Site Reliability Engineer (SRE)

Job Description

Is this job a match or a miss?

Site Reliability Engineer (SRE)

Job Description

Is this job a match or a miss?

Site Reliability Engineer II

Job Description

Is this job a match or a miss?

Site Reliability Engineer - Azure

Job Description

Is this job a match or a miss?

Site Reliability Engineer (UK)

Job Description

Is this job a match or a miss?

Site Reliability Engineer - Remote

Job Description

Is this job a match or a miss?

Be The First To Know

Site Reliability Engineer, Region Services

Job Description

Is this job a match or a miss?

Senior Site Reliability Engineer I

Job Description

Is this job a match or a miss?

Senior Site Reliability Engineer (SRE)

Job Description

Is this job a match or a miss?

Nearby Locations

Other Jobs Near Me

Industry