1520 jobs - 0 added today
Recruiting? Call us on 01772 639042
Email me jobs relevant to my job search
4 days ago
only 25 days until close

Site Reliability Developer NSC


Oracle
Location: UK
Job type: Permanent
Category: Design / Development Jobs
Apply on company site
Select how you want to share:
View similar
Site Reliability Developer NSC-20000R8A Applicants are required to read, write, and speak the following languages: English

Preferred Qualifications

As a Site Reliability Engineer, you will be focused on improving service reliability, performance and operability of Oracle Cloud Services. You will have your hand on the pulse of the services and will play a key role in responding to live service issues. Additionally, you will have the opportunity to create automation and tooling that will allow us to continuously improve our services.

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to new regions and improve the availability, scalability, and efficiency of Oracle products and services using the right architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.

You will work with the other engineers on the team on the shared full stack ownership of a collection of services and/or technology areas and understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. You will be responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance with authority for end-to-end performance and operability. Partnering with development teams in defining and implementing improvements in service architecture, you will articulate technical characteristics of services and technology areas and guide development teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Ability to understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. You will demonstrate clear understanding of automation and orchestration principles and act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). A deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Professional curiosity and a desire to a develop deep understanding of services and technologies is required.

Knowledge, Skills, Abilities, and Background

* College diploma or university degree in the field of computer science or equivalent or 5+ years equivalent work experience in infrastructure, systems, engineering or development environment.

* Excellence in verbal and written communication. Ability to communicate with all levels during critical events and be a bridge for technical discussion with non-technical people.

* 3+ years of experience running large scale customer facing web services in a DevOps/SRE environment

* Strong Technical background with an ability to troubleshoot issues impacting large scale service architectures and application stacks.

* Demonstrable experience in one or more scripting/programming languages such as Java, BASH, GO, Python, Ruby and etc.

* Familiarity with large scale cloud networking infrastructure, including network architectures, TCP/IP protocols, firewall management, routing, switching, ACLs, SSL/TLS

* Experience utilizing Cloud Infrastructure such as Oracle Cloud Infrastructure, Azure, AWS, OpenStack, GCP

* Experience installing, configuring, and maintaining Linux, Linux services, and Linux networking, preferred Oracle Linux.

* Configuration and maintenance of applications such as Kafka, HDFS clusters

* Configuration and change management and automation

* Cloud Service monitoring and incident management

* Master of source code management for large teams using frameworks such as GIT, Teamcity, Terraform, Artifactory, Bitbucket, Docker (some combo of these helpful)Detailed Description and Job Requirements

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

A BS or MS in Computer Science, or equivalent. Identifies solutions to knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance. Experience running large scale customer facing web services. Identifies solutions to understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies. Work involves defining and documenting technical architecture of complex and highly scalable products. A minimum of 5+ years experience of running large scale customer facing web services.

As part of Oracle's employment process candidates will be required to successfully complete a pre-employment screening process. This will involve identity and employment verification, professional references, education verification and professional qualifications and memberships (if applicable).

Job: Product Development

Location: United Kingdom

Job Type: Regular Employee Hire

Organization: Oracle
Site Reliability Developer NSC-20000R8A Applicants are required to read, write, and speak the following languages: English

Preferred Qualifications

As a Site Reliability Engineer, you will be focused on improving service reliability, performance and operability of Oracle Cloud Services. You will have your hand on the pulse of the services and will play a key role in responding to live service issues. Additionally, you will have the opportunity to create automation and tooling that will allow us to continuously improve our services.

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to new regions and improve the availability, scalability, and efficiency of Oracle products and services using the right architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.

You will work with the other engineers on the team on the shared full stack ownership of a collection of services and/or technology areas and understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. You will be responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance with authority for end-to-end performance and operability. Partnering with development teams in defining and implementing improvements in service architecture, you will articulate technical characteristics of services and technology areas and guide development teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Ability to understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. You will demonstrate clear understanding of automation and orchestration principles and act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). A deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Professional curiosity and a desire to a develop deep understanding of services and technologies is required.

Knowledge, Skills, Abilities, and Background

* College diploma or university degree in the field of computer science or equivalent or 5+ years equivalent work experience in infrastructure, systems, engineering or development environment.

* Excellence in verbal and written communication. Ability to communicate with all levels during critical events and be a bridge for technical discussion with non-technical people.

* 3+ years of experience running large scale customer facing web services in a DevOps/SRE environment

* Strong Technical background with an ability to troubleshoot issues impacting large scale service architectures and application stacks.

* Demonstrable experience in one or more scripting/programming languages such as Java, BASH, GO, Python, Ruby and etc.

* Familiarity with large scale cloud networking infrastructure, including network architectures, TCP/IP protocols, firewall management, routing, switching, ACLs, SSL/TLS

* Experience utilizing Cloud Infrastructure such as Oracle Cloud Infrastructure, Azure, AWS, OpenStack, GCP

* Experience installing, configuring, and maintaining Linux, Linux services, and Linux networking, preferred Oracle Linux.

* Configuration and maintenance of applications such as Kafka, HDFS clusters

* Configuration and change management and automation

* Cloud Service monitoring and incident management

* Master of source code management for large teams using frameworks such as GIT, Teamcity, Terraform, Artifactory, Bitbucket, Docker (some combo of these helpful)Detailed Description and Job Requirements

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

A BS or MS in Computer Science, or equivalent. Identifies solutions to knowledge of server hardware and software configuration, networking, standard internet services, scripting languages, cloud computing patterns, technology security and compliance. Experience running large scale customer facing web services. Identifies solutions to understanding of load balancing technologies and experience with development in programming languages, databases and big data stores, and container technologies. Work involves defining and documenting technical architecture of complex and highly scalable products. A minimum of 5+ years experience of running large scale customer facing web services.

As part of Oracle's employment process candidates will be required to successfully complete a pre-employment screening process. This will involve identity and employment verification, professional references, education verification and professional qualifications and memberships (if applicable).

Job: Product Development

Location: United Kingdom

Job Type: Regular Employee Hire

Organization: Oracle
Apply on company site

Email me jobs relevant to my job search

  Back to the top