Service Reliability Engineer (Sre)

Service Reliability Engineer (Sre)
Company:

1058 Amadeus It Pacific Pty Ltd


Details of the offer

Service Reliability Engineer (SRE) page is loaded
Service Reliability Engineer (SRE) Apply locations Sydney, New South Wales time type Full time posted on Posted 2 Days Ago job requisition id R20813 Job Title
Service Reliability Engineer (SRE) PURPOSE OF THE ROLE The goal of a Service Reliability Engineer will be to accelerate Application teams' ability to reliably and consistently deliver applications by developing standardized automation to form a common continuous deployment pipeline for functional engineering teams as a whole.
Other responsibilities include ongoing issues such as change management, problem management, incident management, performance improvement, and automation/tool development. 
The Service Reliability Engineer is expected to excel under pressure, work well with others, be self-motivated, and be able to manage short and long term projects. Implementing automation for kick starting, monitoring, management, and support will be a key component of the position.
The Service Reliability Engineer will actively interface with software developers, network engineers, systems, storage, project management and database administrators on projects and provide support as required.
The Service Reliability Engineer will troubleshoot and resolve issues quickly and effectively. Good communication and teamwork is extremely important. The role also involves participating in the 24 x 7 pager rotation of the team.
Main Responsibilities: - Application Support Proactive incident management in synchronization with frontline services and Incidents Response Team
Incident response: Monitor and build/define alerting to enable auto-recovery. Provide automation to ensure auto-recovery. In case of not-automatically recovered issue, ensure first full recovery.
Once solved, analyze the root-cause of the issue, liaising with the development teams if needed, implement a specific monitoring and automate a response that will manage auto-recovery if the same issue happens again
Assist developers in debugging application & performance issues
Support application deployments, building new systems and upgrading and patching existing ones.
Operate the platform within our security and privacy guidelines.
- Service Automation Participate in the design and building of tools and processes to support operations. Leverage scripting to build required automation and tools on an ad-hoc basis.
Build and develop automation to enable quick & safe instance deployment
Design, drive, develop and use monitoring tools to find problems, resolve and/or escalate to development and ensure that we exceed our Service Level Agreements
- Continuous Improvement Be accountable for an applicative platform according to SLA, NFR and operability criteria
Contribute to the definition of SLAs, OLAs and NFR
Adopt and ensure usage of monitoring tools to find problems, raise alert, and ensure that we meet our SLAs/OLAs
Ensure process reengineering and optimization
Proactive thought leadership for creative and efficient technology solutions.
Drive continuous improvement to the service delivered to customer (agility, stability).
Drive the enforcement and definition of operational requirements / non-functional requirements in collaboration with application owners and middleware organizations
Document configuration processes and policies
Relevant Work Experience: Good knowledge of Operating Systems such as Linux / Unix: Operational support and Infrastructure support experience + minimum 1+ years of Hands-on experience in Linux administration
Good scripting skills for automation in one or more of the following languages: Ansible, Python
Good understanding of ITIL processes
Knowledge of standard automation tools: Jenkins, Ansible, Python, Git, HP Fortify
Knowledge and practical exposure to IT operations, ideally in mission-critical environments.
Ability to identify improvement potential in a structured manner (ex: following Lean Management, Six Sigma or similar approaches)
Project Management Fundamentals: Knowledge of the different aspects of a project and how they are applied
Knowledge of DevOps fundamental concepts and tooling.
Knowledge on CI/CD
Experience in implementing measurements and alerting in complex environments using standard tools like: ElasticSearch, Kibana, Splunk Grafana, Promotheus, Thanos, Argos, ServiceNow
Well versed with the cloud concepts and offerings of major cloud providers like AWS/Azure or equivalent (AZ900 certified is a plus).
Knowledge on Kubernetes, OpenShift
Basic knowledge of relational databases such as MySQL and Oracle
Basic knowledge of NoSQL (MongoDB, Couchbase) (nice-to-have)
Basic knowledge of how to work on processes (ie six sigma yellow belt)
Diversity & Inclusion Amadeus aspires to be a leader in Diversity, Equity and Inclusion in the tech industry, enabling every employee to reach their full potential by fostering a culture of belonging and fair treatment, attracting the best talent from all backgrounds, and as a role model for an inclusive employee experience.
Amadeus is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to gender, race, ethnicity, sexual orientation,age, beliefs, disability or any other characteristics protected by law.
About Us #J-18808-Ljbffr


Source: Jobrapido_Ppc

Requirements

Service Reliability Engineer (Sre)
Company:

1058 Amadeus It Pacific Pty Ltd


Sysadmin / Data Base Administrator / 4Gl Developer

We are seeking a tertiary qualified professional who can work in a team environment, with the ability to take ownership of projects as well as the capacity t...


From De Bortoli Wines - New South Wales

Published 21 days ago

Canobolas Rural Technology High School - Canteen Manager

Canobolas Rural Technology High School - Canteen Manager 35 hours per week 7.30-3pm School term only Cook/Chef Barista experience essential Manage a small te...


From Canobolas Rural Technology High School - New South Wales

Published 21 days ago

Director Of Information Technology

The Australian Human Rights Commission has an ongoing role for an EL2 Director of Information Technology.Working closely with the COO, this role provides vis...


From Clearcompany - New South Wales

Published 21 days ago

Director Cyber Security Operations

About the Agency The Australian Digital Health Agency (the Agency) is responsible for national digital health services and systems, with a focus on engagemen...


From Australian Digital Health Agency - New South Wales

Published 21 days ago

Built at: 2024-07-07T20:26:52.706Z