DevOps Monitoring Engineer

Type Other
Seniority Mid-Level
Posted Mar 12, 2026

Richemont is hiring a DevOps Monitoring Engineer in Moscavide, Portugal to design and operate production observability and incident response systems.

Overview

Richemont is a leading international luxury goods group that develops, manages and distributes a portfolio of prestigious maisons including names such as Cartier, Montblanc and Van Cleef & Arpels. The group combines craftsmanship, heritage and contemporary retail and digital operations across global markets, and maintains regional technology and support centres to enable its maisons' commercial and digital ambitions.

Role & Responsibilities

  • Design, implement and maintain scalable monitoring and observability platforms to ensure service reliability and performance across cloud and on-premise environments.
  • Build and maintain dashboards, alerting rules and SLOs to enable rapid detection and resolution of production incidents.
  • Collaborate with platform, application and security teams to instrument services, define meaningful metrics, and improve system observability.
  • Automate deployment and management of monitoring agents, exporters and integrations using IaC and configuration management.
  • Lead incident triage, root-cause analysis and post-mortems; own remediation and continuous improvement actions to reduce recurrence.
  • Evaluate and introduce new monitoring, logging and tracing tools and best practices to advance the observability maturity of the estate.
  • Contribute to capacity planning, cost optimisation and operational runbooks for monitoring services.

Qualifications

  • Proven track record implementing and operating monitoring/observability solutions in production environments.
  • Strong working knowledge of metrics, logs and distributed tracing concepts, and how they inform SRE practices.
  • Experience automating infrastructure and configuration changes using Infrastructure-as-Code and configuration management.
  • Demonstrated ability to lead incident response and perform thorough root-cause analysis.
  • Excellent communication skills and ability to collaborate with engineering teams and non-technical stakeholders.

Skills

Prometheus Grafana Elastic Stack (Elasticsearch, Logstash, Kibana) Datadog Splunk OpenTelemetry Kubernetes Docker AWS Azure Terraform Ansible Python Bash Git Jenkins/GitLab CI

Experience

Minimum 3 years of hands-on experience in DevOps, site reliability or monitoring engineering roles, with demonstrable ownership of production observability platforms and incident response.

Education

Bachelor's degree in Computer Science, Software Engineering, Information Systems or equivalent practical experience.

Culture

Richemont combines heritage craftsmanship with a fast-evolving digital and retail organisation; teams work cross-functionally to support high standards of product quality, client experience and operational excellence. Technology teams are expected to be collaborative, pragmatic and oriented toward continuous improvement while respecting the maisons' brand values.