Site Reliability Engineer
Oracle Health Applications Infrastructure (OHAI) is hiring in its OHAI Platform Production Engineering organization!
Are you a creative person who loves a challenge? Solve the complex puzzles youve been dreaming of as our Site Reliability Engineer. If you have a passion for innovation in tech we want you on our team! Thrive in this crucial role. Oracle is a technology leader thats changing how the world does business. Were looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position. Come and join us!
A unique opportunity to join a rapidly growing world-class team to engineer groundbreaking Oracle Cloud technologies and infrastructure that make up the Oracle Cloud solutions.
The ideal candidate for this engaging and visible technical leadership role would have the experience of a developer the wits of a systems and infrastructure whiz and the courage of a spirited closer. All these qualities bundled up in an affable communicator in order to make our Oracle Cloud customers successful.
As a global leader were looking for a Site Reliability Engineer to drive success as part of our OHAI team. Join us and create the future.
Responsibilities
What you will do
As a Site Reliability Engineer (SRE) you will solve exciting technical challenges by defining designing deploying and troubleshooting key Oracle Health Hosting and Cloud services platforms and infrastructure always thinking about reliability scalability resilience security and performance.
You will be part of a team of SREs whose mission is the shared full stack ownership of a collection of cloud services and technologies areas integral to the support of medical institutions across the world.
* Ensure System Reliability Monitor and maintain the health of our production environments implementing strategies to achieve high availability and minimal downtime.
* Incident Management Quickly identify and resolve incidents conducting thorough post-mortem analyses to prevent future occurrences and improve our response processes.
* Performance Optimization Analyze system performance metrics to identify bottlenecks and develop solutions that enhance system efficiency and scalability.
* Automation Tooling Create and maintain automated processes for deployment monitoring and incident response to streamline operations and reduce manual intervention.
* Infrastructure as Code Utilize tools like Terraform or CloudFormation to define and manage infrastructure ensuring consistency and repeatability across environments.
* Capacity Planning Collaborate with teams to forecast future system demands and implement scalable solutions that meet growing user needs.
* Collaboration Communication Work closely with development and product teams to integrate reliability into the software development lifecycle advocating for best practices and sharing insights.
* Security Compliance Implement security best practices and ensure compliance with industry standards to protect our systems and data.
* Continuous Improvement Contribute to a culture of continuous improvement by identifying areas for enhancement sharing knowledge and mentoring junior engineers.
* On-Call Duties Participate in an on-call rotation to provide after-hours support and ensure operational excellence around the clock.
Required experience
2+ Years of Experience Managing Complex IT Systems/Managing IT Systems
Fluent in English (C1) - All work is conducted in English
Willing to be on on-call duty
Hybrid working environment -1-2 days per week in the Barcelona Office
Methodical approach to troubleshooting complex problems
Scripting languages such as Python PowerShell Bash JavaScript etc.
DevOps toolchain (general understanding)
Knowledge In
Windows Server Roles (AD GPO Certificates File Servers/Storage Management)
Linux
Networking and TCP/IP
Preferred Experience
* 5+ year experience of running large scale customer facing web services
* Cloud infrastructure Knowledge (AWS/OCI)
* Citrix
* Kubernetes Experience
* Configuration management tools such as Chef Ansible etc...
About Us
As a world leader in cloud solutions Oracle uses tomorrows technology to tackle todays problems. True innovation starts with diverse perspectives and various abilities and backgrounds.
When everyones voice is heard were inspired to go beyond whats been done before. Its why were committed to expanding our inclusive workforce that promotes diverse insights and perspectives.
Python, PowerShell, Bash, Windows Server, linux