Infrastructure Support Engineer - Senior Consultant

Details of the offer

As a senior Infrastructure Support Engineer, you play a vital role in maintaining technical excellence and operational efficiency, with a primary focus on cloud environments. You'll help clients through the transition to agile, value-focused practices, emphasizing shared responsibility and continuous improvement. You will monitor infrastructure performance, respond to incidents promptly and maintain resources in line with modern standards, incorporating sustainable practices.
Job responsibilitiesYou will keep a vigilant eye on the operations of shipped products and services following the agreed upon "Eyes on glass/Follow the sun" engagement models.You will monitor product/service operations against key performance indicators defined by the business and take necessary actions in response to detected deviations.You will define and document the appropriate responses to various kinds of incident scenarios in collaboration with the Service Reliability Engineering (SRE) team and client stakeholders, and prepare runbooks.You will reduce the human effort in day-to-day operations by automating operations, using the latest tech stacks befitting the task and improving the overall efficiency of the entire team as time progresses.The team you are part of will be responsible for responding to incidents in production and other high-value environments and execute the appropriate response as established by runbooks or based on your judgment of incidents.You will be involved in Level 2 and Level 3 support tasks, troubleshooting and resolving incidents. Setting up war rooms for incident response, collaborating with tech leads, SRE leads, and development teams to address incidents and identify root causes.You will prepare incident Root cause analysis (RCA) and postmortem reports, explaining analyses and outlining preventive measures to clients; Collaborating with SRE, development teams or independently, your role is to ensure clear communication and proactive steps for future incident prevention.You will implement service/product reliability improvement in collaboration with service reliability engineers by writing infrastructure/observability configuration code.Job qualificationsTechnical SkillsYou have hands-on experience in using any CI/CD tools such as Jenkins, CircleCI or Gitlab for executing deployments.You have knowledge of Infrastructure as Code (IAC) tech stacks such as Terraform, Ansible, ARM or Cloudformation to provision and manage infrastructure.You have working experience in using observability tools for logging, monitoring, tracing and alerting, e.g.: Datadog/Prometheus/Grafana, ELK/EFK/Splunk.Experience and understanding across a range of AWS products.You have hands-on experience executing most common operations in managing workloads on any container ecosystem tech stacks. e.g.: Docker, Kubernetes, Openshift, etc.You understand system performance tuning and scaling to handle common heavy load scenarios along with concepts of highly available systems and basics of disaster recovery solutions, and are familiar with failover, backup and recovery concepts.You have experience operating a Linux OS such as RHEL or a Debian-Based OS and are familiar with most common Linux OS operations and commands, reading and tweaking Bash scripts and managing runtime environment configurations such as Env Vars, Logs, etc.You have experience supporting backend storage solutions such as SQL and NoSQL databases, e.g.: Postgres and MongoDB, and caching solutions such as Redis and Memcached.You have experience in networking configuration and security, and are familiar with common networking setup and security practices, e.g.: loading, balancing, proxies, transport layer security (TLS) and certificate management, and an understanding of standard network protocols and configurations.You have a good understanding of fundamental concepts of APIs such as request, response, headers, authentication, JSON payloads, etc.Professional SkillsYou have strong communication and articulation skills, are proficient in English and able to confidently hold a Q&A discussion with senior stakeholders.You have people skills with an emphasis on close collaboration with multiple, cross-functional teams from the client side or Thoughtworks.You have the ability to work under pressure with composure during production incidents.You have strong analysis, deduction and reasoning skills, with the ability to identify patterns in data and draw conclusions.You have strong drive and ownership to sign up and deliver work when called upon without being too concerned with role boundaries.You are willing to be part of a rotation- and need-based 24x7 available team.
#J-18808-Ljbffr


Nominal Salary: To be agreed

Source: Jobleads

Requirements

Platform Administrator

WE ARE FUJITSU We use technology to make happier lives. We are a global leader in technology and business solutions that transform organizations and the worl...


Fujitsu - New South Wales

Published 8 days ago

Lead Solution Designer

Delivery Lead - Solution Design Join a Leading Brand! Innovative & Future Tech Focus Great Hybrid Teamwork Culture 12 month contract Overview: We are seeking...


Futureyou - New South Wales

Published 8 days ago

Senior Software Engineer

Job Opportunity: Senior Software Engineer Client Overview: We are partnering with a leading financial institution seeking multiple talented Senior Software E...


Randstad - New South Wales

Published 8 days ago

Senior Developer

Join a Leading Healthcare Technology Company Full-Time Role in Bowral | Interesting & Varied Role EARN $95,000 - $100,000 pa + Superannuation About The Compa...


Employment Office - New South Wales

Published 8 days ago

Built at: 2024-11-26T22:42:05.414Z