We are looking for a highly skilled and experienced Senior Observability Engineer to join one of our leading clients across the EU.
The ideal candidate will have a deep understanding of observability tools, automation development, and cloud frameworks, with a specific focus on OpenTelemetry, Dynatrace, Grafana, Honeycomb, Gremlin, and cloud platforms such as AWS and Azure. This role is perfect for someone who thrives in an environment where both on-premises and cloud-based solutions are implemented and optimized.
You will play a key role in ensuring the performance and reliability of complex systems by implementing observability solutions, automating processes, and addressing intricate technical challenges.
Key Responsibilities:
* Lead the design, implementation, and management of observability solutions both on-prem and in the cloud, with a focus on OpenTelemetry frameworks.
* Collaborate with development teams to integrate observability practices into the software development lifecycle.
* Automate manual tasks and processes to optimize workflow and reduce human intervention.
* Develop custom instrumentation, metrics, and logs to enhance the monitoring capabilities of systems and applications.
* Troubleshoot and resolve complex system issues using observability tools to ensure high availability and optimal performance.
* Establish governance and standards for implementing observability across the organization.
* Stay current with the latest developments in observability tools, particularly those compatible with OpenTelemetry.
* Work across cloud platforms (AWS and Azure) to deliver robust observability solutions.
Requirements:
* Minimum of 7 years of experience in cloud, observability, or a related field.
* Proven expertise in implementing and managing OpenTelemetry solutions.
* Strong hands-on experience with cloud-based monitoring, troubleshooting, and observability tools.
* Proficiency in Kubernetes and cloud-based architecture, particularly within AWS and Azure environments.
* Hands-on experience in scripting, automation, and integrating observability tools within cloud and on-prem environments.
* Degree in Computer Science, Information Technology, or a related field.
* Strong communication skills, both written and verbal, with the ability to collaborate effectively in remote and on-prem settings.
Essential Qualifications:
* Expertise in Terraform and experience with infrastructure as code.
* Strong knowledge of DevOps practices and CI/CD pipelines.
* Experience in troubleshooting complex technical issues using observability tools.
* Demonstrated ability to lead and manage a team of developers.
* Ability to think innovatively and provide solutions to complex challenges.
Technologies/Skills:
* OpenTelemetry, Dynatrace, Grafana, Honeycomb, Gremlin
* Kubernetes
* AWS and Azure cloud platforms
* Terraform
* Scripting and automation languages (e.g., Python, Bash)
* CI/CD and DevOps practices
* Strong problem-solving and troubleshooting skills
Languages & Levels:
* English (C1)
Minimum Experience:
* At least 7 to 10 years