Iron Mountain Logo

Iron Mountain

DevOps Engineer

Posted 2 Days Ago
Be an Early Applicant
Scotland
Entry level
Scotland
Entry level
As a DevOps Engineer at Iron Mountain, you'll provide technical support for applications and hardware, manage observability strategies, and automate infrastructure provisioning. Your responsibilities will include optimizing application performance, ensuring system uptime, and implementing security best practices. You'll also troubleshoot incidents and collaborate with teams to resolve system issues efficiently.
The summary above was generated by AI

At Iron Mountain we know that work, when done well, makes a positive impact for our customers, our employees, and our planet. That’s why we need smart, committed people to join us. Whether you’re looking to start your career or make a change, talk to us and see how you can elevate the power of your work at Iron Mountain.

We provide expert, sustainable solutions in records and information management, digital transformation services, data centers, asset lifecycle management, and fine art storage, handling, and logistics. We proudly partner every day with our 225,000 customers around the world to preserve their invaluable artifacts, extract more from their inventory, and protect their data privacy in innovative and socially responsible ways. 

Are you curious about being part of our growth stor​y while evolving your skills in a culture that will welcome your unique contributions? If so, let's start the conversation.

THE OPPORTUNITY

Title:  DevOps Engineer

Location: UK, 100% remote

Full time, permanent role

Global Technology and Innovation:

Driving performance and growth through people, innovation, security, and new ways of working, Global Technology and Innovation provides secure and stable infrastructure, competitively differentiated solutions, innovative technology platforms, and business operations for Iron Mountain.

Job summary:

As a DevOps Engineer Professional at Iron Mountain, you will play a crucial role in providing technical support for our computer applications and hardware, including PCs, servers, and mainframes.

You will be responsible for answering system-related queries, collaborating with network services, software systems engineering, and application development teams to restore service and identify issues.

Your role will also involve managing the observability strategy, application sustainment, and global service
delivery to enhance the performance and reliability of our application.

Your role in our mission:

  • Design, build, and maintain cloud infrastructure (e.g., AWS, GCP, Azure)
  • Automate infrastructure provisioning using tools like Terraform, Ansible, or
    CloudFormation
  • Develop and maintain continuous integration/continuous deployment pipelines
  • Work with developers to streamline deployment processes using tools like Jenkins,
    GitLab
  • Analyze system performance and plan for future capacity needs
  • Optimize application performance by fine-tuning system configurations and identifying
    bottlenecks
  • Write scripts and tools to automate routine tasks (e.g., system updates, scaling)
  • Develop self-healing systems that can automatically resolve issues
  • Design and implement backup and disaster recovery strategies
  • Ensure systems are built with redundancy to handle failures and maintain uptime
  • Implement security best practices for systems and applications
  • Regularly perform security audits and manage patching for vulnerabilities

Troubleshooting:

  • Respond to incidents, outages, and degraded performance in real-time
  • Investigate root causes and implement corrective actions to prevent future issues
  • Participate in on-call rotations to handle production issues, escalating as needed
  • Document issues and incidents in postmortems, identifying key learnings
  • Diagnose and resolve complex infrastructure and application issues (e.g., network latency, database performance)
  • Use debugging tools (e.g., tcpdump, strace) to analyze system-level problems.
  • Analyze logs and metrics to identify patterns and root causes of failures
  • Use log aggregation tools (e.g., ELK stack, Splunk) to efficiently search for issues
  • Diagnose and troubleshoot network issues, such as DNS resolution failures, packet loss,
    or connectivity problems
  • Understand the interplay between network architecture and application performance.

Observability & Performance Management:

  • Implement and maintain monitoring systems (e.g., Prometheus, Grafana,
    Datadog)
  • Ensure proper logging and metrics collection for infrastructure and applications.
  • Manage the observability strategy with Engineering/Development and SRE teams
    to enhance application availability, performance, and reliability
  • Define and manage log-based metrics, alerts, and dashboards using Datadog.
  • Support applications built with Google Cloud logging, Identity & Access Management, Cloud network, and projects
     

Application Sustainment & Global Service Delivery:

  • Ensure continuous availability of critical applications, monitoring uptime and
    performance against SLOs
  • Collaborate with development teams to ensure that new releases are stable and
    do not introduce regressions
  • Lead efforts in troubleshooting and resolving application-related incidents
  • Oversee the lifecycle management of applications, including upgrades, patches,
    and version rollouts
  • Ensure compatibility between application versions and underlying infrastructure
  • Maintain thorough documentation of application architectures, configurations, and
    known issues
  • Develop and update a knowledge base to provide guidelines for resolving
    common application issues
  • Plan and implement strategies for scaling services to meet growing demand in
    specific regions
  • Work with product and engineering teams to ensure that infrastructure is capable
    of supporting future growth globally
     

Required Skills and Experience:

  • Bachelor's Degree in Computer Science, Engineering, or related field (4 years
  • degree)
  • Strong experience in Information Technology
  • Experience with Agile Scrum methodologies
  • Cloud Platforms: Amazon Web Services (AWS), Google Cloud Platform (GCP),
    Microsoft Azure
  • Motivated individual, who learns quickly, has pride in building a new product and
    can engage others to accelerate technical solutions
  • Familiarity with implementation design patterns and performance challenges
    involved in supporting a globally available Saas product

Minimum Skills & Qualifications:

  • Experience in working with remote distributed teams
  • Experience with CI/CD tooling such as Terraform, Helm, Jenkins, ArgoCD,
    GitLab CI/CD, Maven, Artifactory
  • Experience utilizing, building and optimizing observability stack with
    Grafana/Datadog/Prometheus and alerting and notification stacks like, OpsGenie
  • Experience with Monitoring and Logging: Prometheus, Grafana, ELK Stack
    (Elasticsearch, Logstash, Kibana), Kafka, Loki, Cloudwatch
  • Experience interfacing with and deploying services to cloud platforms
  • Extensive experience working with managed Kubernetes services in the Cloud
    (AWS, GCP and Azure)
  • Experience in developing scalable micro services and API gateway
  • Experience with server oriented architectures and web platform applications with
    ability to define and integrate with APIs and REST services
  • Deep Understanding of cloud networking concepts such as VPC, ingress,
    subnetting, TCP/IP, DNS, Load Balancers, network topologies, subnetting, and
    CIDR notation
  • Experience in Software Application Development using Python, Java, or .Net
  • Familiarity with secret management platforms like Thycotic/Delinea Secret Server, IKeyless, and/or Thycotic DevOps Secret Vault.

Additional Requirements:

  • Strong problem-solving skills and ability to work under pressure
  • Excellent communication and collaboration skills
  • Ability to analyze complex data and situations to make informed decisions

Certifications:  optional: Scrum Master/PMP Certification / Agile SAFe certification.

Discover what awaits you:

  • Discover Limitless Possibilities: Embark on an exciting journey with Iron Mountain, a global organization that embraces transformation and innovation.
  • Empowering Inclusion: Join a supportive environment where everyone's voice is heard, opinions are valued, and feedback is encouraged, fostering an atmosphere of inclusion and belonging.
  • Global Connectivity: Connect with 26,000+ talented individuals from 59 countries, opening doors to diverse cultures and fostering global learning opportunities.
  • Championing Individuality: Be part of a winning team that celebrates diversity and encourages individual differences to drive greatness.
  • Competitive Total Rewards: supporting your career at Iron Mountain, family, personal wellness, and wellbeing. (Local benefits may vary based on country-specific policies.)
  • Embrace Flexibility: Experience the freedom of remote/hybrid work, enabling a harmonious work-life balance (dependent on role).
  • Unleash Your Potential: Access abundant opportunities for personal and professional growth, preparing you for a digitalized future.
  • Valuing Every Contribution: Join a workplace that actively encourages and supports all talents, recognizing the unique impact of each individual.
  • Pioneering Sustainability: Contribute to our vision of fostering a sustainable and thriving workforce, leaving an enduring legacy for generations to come.

 #LI-Remote

Category: Information Technology

Top Skills

AWS
Azure
GCP

Similar Jobs

Be an Early Applicant
9 Days Ago
Glasgow, City of Glasgow, Scotland, GBR
Hybrid
289,097 Employees
Senior level
289,097 Employees
Senior level
Financial Services
As a DevOps Engineer III, you will enhance reliability, monitoring, and automation within the infrastructure, collaborating with development teams to improve application performance and ensuring quality deployments via CI/CD pipelines. You will also manage cloud infrastructure and respond to incidents, contributing to organizational goals.
Be an Early Applicant
7 Days Ago
Glasgow, City of Glasgow, Scotland, GBR
113 Employees
Mid level
113 Employees
Mid level
Software • Energy
As a DevOps Engineer, you'll support and maintain CI/CD pipelines on AWS and Azure, provide training to teams, manage DevOps tools, and drive automation. You'll collaborate with the technology team to enhance infrastructure and integrate security practices into development processes.
Be an Early Applicant
2 Days Ago
Glasgow, City of Glasgow, Scotland, GBR
Hybrid
289,097 Employees
Senior level
289,097 Employees
Senior level
Financial Services
As a Fullstack Java Software Engineer III, you will design and implement features for a strategic data platform, ensuring the stability and security of cloud-based applications. You'll be responsible for producing architecture artifacts, troubleshooting technical issues, and adhering to best practices in software development. This role requires collaboration within an agile team and a focus on continuous learning and improvement.

What you need to know about the Edinburgh Tech Scene

From traditional pubs and centuries-old universities to sleek shopping malls and glass-paneled office buildings, Edinburgh's architecture reflects its unique blend of history and modernity. But the fusion of past and future isn't just visible in its buildings; it's also shaping the city's economy. Named the United Kingdom's leading technology ecosystem outside of London, Edinburgh plays host to major global companies like Apple and Adobe, as well as a growing number of innovative startups in fields like cybersecurity, finance and healthcare.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account