CM Group Logo

CM Group

Principal DevOps/SRE Engineer

Job Posted 21 Days Ago Posted 21 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United Kingdom
Senior level
Remote
Hiring Remotely in United Kingdom
Senior level
The Principal DevOps/SRE Engineer will enhance SRE culture, automate tasks, troubleshoot issues in cloud-based environments, monitor application performance, and collaborate with engineering teams to uphold non-functional requirements. Responsibilities include documentation, maintenance of backend infrastructure, and participation in an on-call rotation.
The summary above was generated by AI

The Company:
Marigold helps brands foster customer relationships through the science and art of connection. Marigold Relationship Marketing is a suite of world-class martech solutions that help marketers create long term customer love and loyalty. Marigold provides the most comprehensive set of use cases for marketers at any level. Headquartered in Nashville, Tennessee, Marigold has offices globally across the United States, Europe, Australia, New Zealand, South America and Central America, as well as in Japan.

What You’ll Do

  • Help build a Site Reliability Engineering culture by sharing your best practices, approaches, documentation, and code with other engineering teams

  • Apply automation and software to any tasks or parts of the system that would benefit from it or are performed manually

  • Troubleshoot complicated issues handling OS, Networking, Database in a cloud-based SaaS environment/on-premises environment and handle live production incidents, debug/troubleshoot application and infrastructure issues, follow and implement SRE best practices

  • Monitor application performance, take steps to improve overall application performance and stability and follow through with implementation

  • Conduct system analysis, configuration management and develop improvements for system software performance, availability and reliability

  • Work closely with software and QA engineers to ensure the system is responding properly to non-functional requirements such as performance, security, and availability

  • Document your system knowledge as you acquire it over time, create runbooks, and ensure critical system information is readily available to those who need it

  • Maintain and monitor deployments, orchestration, databases, and general backend infrastructure

  • Keep up-to-date with security and proactively identify, diagnose, and solve complex security issues.

  • Be part of an on-call rotation to support the global platform providing an excellent customer experience

Ideal Qualifications:

  • Degree in Computer Science or equivalent combination of education and experience

  • 7+ yrs experience in DevOps or SRE role

  • 7+ yrs Linux experience

  • 5+ years managing production environments in AWS

  • 5+ years experience in Kubernetes preferably EKS

  • 3+ years creating and maintaining infrastructures with Terraform

  • Experience using infrastructure as code principles to design, build and maintain cloud platforms using Terraform/OpenToFu

  • Experience working with database and data store technologies such as RDS/MySQL, Elasticache/Redis or equivalent

  • Knowledge of core server-side concepts and experience working with cloud networking, load balancers, HTTP or GRPC protocols, and large scale microservice environments

  • Experience with observability stacks, instrumenting environments for logging and monitoring and building and designing dashboards and alerts

  • Knowledge of DevOps methodologies, basic programming and the tools involved in CI/CD automation

Nice to Have:

  • Experience managing high scale web application platforms or SaaS platforms

  • Strong Kubernetes, EKS or ECS/Fargate experience 

  • Deep understanding of security principles

  • History of contributing to FOSS projects

  • Experience with AWS networking concepts such as VPC peering, Transit Gateway

  • Experience with multi-geography, multi-tenant applications 

  • Experience designing and performing disaster recovery

  • Experience programming with Go or Python

  • Experience with cost management

  • Experience with NoSQL databases such as ScyllaDB.

  • Experience working with Stream processing and big data technology stacks such as  Kafka or Trino

What We Offer:

  • The competitive salary and benefits you’d expect!

  • Generous time off (we call it Open Time Away) as well as paid holidays and a birthday benefit day off.

  • Retirement contributions. 

  • Employee-centric and supportive remote work environment with flexibility.

  • Support for life events including paid parental leave.

Top Skills

AWS
Ci/Cd
DevOps
Eks
Elasticache
Go
Kafka
Kubernetes
Linux
MySQL
Python
Rds
Redis
Sre
Terraform
Trino

Similar Jobs

8 Days Ago
Remote
Hybrid
Belfast, County Antrim, Northern Ireland, GBR
Senior level
Senior level
Artificial Intelligence • Cloud • Information Technology • Sales • Security • Software • Cybersecurity
As a Senior DevOps Engineer, you will enhance the security and reliability of our microservices infrastructure. Responsibilities include automating operations, managing production infrastructure (Kubernetes, Cassandra), mentoring team members, and developing security measures for customer data while ensuring best practices in AWS.
Top Skills: AnsibleArgoAWSBashCassandraElasticsearchGitGroovyIstioJavaJenkinsKarpenterKongKubernetesLinuxPodmanPythonRubySpinnakerTerraform
17 Days Ago
Easy Apply
Remote
29 Locations
Easy Apply
Mid level
Mid level
Cloud • Security • Software • Cybersecurity • Automation
As an Intermediate Site Reliability Engineer for FinOps at GitLab, you will ensure reliability and cost-efficiency of services by integrating FinOps principles. You will automate cost management tasks, collaborate with finance and engineering teams to optimize cloud costs, and ensure financial compliance while contributing to high-quality service delivery.
Top Skills: AnsibleAWSElkGCPGitlabHelm ChartsOmnibus GitlabPrometheusTerraform
21 Days Ago
Remote
Hybrid
United Kingdom
Senior level
Senior level
Artificial Intelligence • Cloud • Sales • Security • Software • Cybersecurity • Data Privacy
As a Senior DevOps Engineer at SailPoint, you'll build and maintain secure SaaS applications within the Non-Employee Risk Management team, automating deployment and monitoring, troubleshooting issues, and collaborating with developers to enhance operational efficiency while ensuring compliance with security standards.
Top Skills: AWSCi/CdDevOpsGithub ActionsHashistackKafkaKubernetesLinuxMongoDBPythonRedisRuby On RailsSQL

What you need to know about the Edinburgh Tech Scene

From traditional pubs and centuries-old universities to sleek shopping malls and glass-paneled office buildings, Edinburgh's architecture reflects its unique blend of history and modernity. But the fusion of past and future isn't just visible in its buildings; it's also shaping the city's economy. Named the United Kingdom's leading technology ecosystem outside of London, Edinburgh plays host to major global companies like Apple and Adobe, as well as a growing number of innovative startups in fields like cybersecurity, finance and healthcare.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account