Storage (HPC) DevOps Engineer
Location: Madrid, Spain - Hybrid
The position will involve working within product teams to deliver the best class scientific computing platforms, in partnership with our scientists and service providers. Knowledge of parallel file systems and high performance storage platforms, Linux system administration, scripting and a DevOps approach to platform administration is essential.
Your day-to-day activities involve solving issues and user requests, ensuring everything runs smoothly. You immerse yourself in the creative process of developing cutting-edge features, enhancing existing functionalities, and tackling technical debt to keep our systems robust.
Daily stand-ups and collaborative product planning sessions fuel your innovation and teamwork, empowering you to drive impactful projects and shape the future of our technology landscape.
Job Responsibilities
1. Contribute to activities passionate about availability, tuning, performance, efficiency, change management, monitoring, emergency response and capacity planning.
2. Engage in and improve, with low mentorship, the whole lifecycle of services—from inception and design through deployment, operation and refinement.
3. Monitor and resolve incidents/problems with platform operations, suggesting priorities and collaborating in the resolution when required.
4. Contribute to support services before they go live through activities such as infrastructure design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
5. Scale systems sustainably through mechanisms like automation, and evolve systems by proposing changes that improve reliability and velocity.
6. Contribute to the maintenance of services once they are in production by measuring and supervising availability, latency and overall system health.
7. Look for continuous improvement activities both in technical, teamwork, collaboration and processes areas. Propose and contribute to continuous improvement activities.
Who are you?
Curious and willing to learn, experience with Parallel Filesystem (e.g. Lustre, GPFS, BeeGFS), or Object storage (NetApp StorageGrid, S3), background in Linux Server technologies, proficiency in infrastructure as code, scripting and automation (AWX, Ansible, GitLab, Python, YAML), good communication and problem-solving skills, and the ability to work optimally in a fast-paced environment.
A Bachelor's or Master's degree with relevant work experience is preferred. At least 2 years experience of working in one or more multinational work environments (e.g. healthcare industry experience is a plus) as a systems or software engineer. Ability to work across multiple time zones, including on-call and occasionally travel.
About Roche
Roche employs 100,000 people in 100 countries who are pushing the boundaries in healthcare. Working together, we have become one of the world leaders in research groups. Our success is built on innovation, curiosity, and diversity.
#J-18808-Ljbffr