Director, Site Reliability Engineer | Sydney, Au

Details of the offer

Our Purpose

We work to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart, and accessible. Using secure data and networks, partnerships, and passion, our innovations and solutions help individuals, financial institutions, governments, and businesses realize their greatest potential. Our decency quotient, or DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views, and experiences. We believe that our differences enable us to be a better team - one that makes better decisions, drives innovation, and delivers better business results.
Title and Summary

Director, Site Reliability Engineer
About the Role

We are seeking a Director, Site Reliability Engineer (SRE) to join our Business Operations team at Mastercard. As the production readiness steward for Mastercard products, you'll play a vital role in ensuring our platform's stability, scalability, and performance.
In this role, you will empower developers to build resilient, fault-tolerant products by providing support during the application build phase, focusing on operational design, automation, capacity planning, and monitoring. You'll also lead efforts in triage, root cause analysis, and proactive risk management to enhance customer experience and maximize application value.
A Biz Ops engineer will spend a bit of time throughout their career with all of these aspects of the role:
Operational Readiness Architect:
• Serve as the primary contact responsible for the overall application health, performance, and capacity
• Support services before they go live through activities such as system design consulting, capacity planning, and launch reviews.
• Partner with the development and product team of a new application to establish the right monitoring and alerting strategy and create the framework to achieve zero downtime during deployment.Site Reliability Engineering:
• Performs operability and resilience design and implements and maintains highly reliable and scalable infrastructure.
• Perform root cause analysis of incidents and collaborate with development teams to resolve issues.
• Stay up to date with the latest technologies and trends in SRE and cloud computing.
• Participate in on-call rotations and be available to respond to critical incidents.
• Complete end-to-end run ownership of the product.
• Practice sustainable incident response and blameless post-mortems while taking a holistic approach to problem solving and optimizing time to recover.
• Automate data-driven alerts to proactively escalate issues. Work with development teams to establish SLOs and improve reliability.DevOps/Automation:
• Tackle complex development, automation, and business process problems. Engage in and improve the whole lifecycle of services—from inception and design, through deployment, operation, and refinement.
• Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices.
• Perform operational and resilience design and implement solutions for capacity planning and performance optimization.
• Increase automation and tooling to reduce toil and manual intervention.ITSM Practices:
• Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns.Role qualifications:

The ideal candidate will have experience in:
BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.Ability to read, write, and understand code in one of the programming languages such as Java, Spring Framework, Python, Go.Strong understanding of DevOps principles, practices along with configuration management.Experience in operational and resilience designing, building, and operating large-scale, distributed systems.A passion for observability, automation, and continuous improvement.Familiarity with cloud platforms like AWS, Azure, or GCP (a plus).Experience in observability tools such as Splunk, Dynatrace, Prometheus, Datadog, Grafana, and Monitoring as a Code.Experience with algorithms, data structures, scripting, pipeline management, and software design.Systematic problem-solving approach, analytical, coupled with strong communication skills and a sense of ownership and drive.Strong leadership and mentoring skills.Willingness and ability to learn and take on challenging opportunities and to work as a member of a matrix-based diverse and geographically distributed project team.Corporate Security Responsibility

All activities involving access to Mastercard assets, information, and networks come with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:

Abide by Mastercard's security policies and practices;Ensure the confidentiality and integrity of the information being accessed;Report any suspected information security violation or breach, andComplete all periodic mandatory security trainings in accordance with Mastercard's guidelines.
#J-18808-Ljbffr


Source: Jobleads

Requirements

Data Quality Specialist Lead

TAFE NSW Life-Changing Careers Data Quality Specialist Lead Location negotiable (subject to campus availability) 2x temporary full time until November 2025 B...


From Tafe Nsw - New South Wales

Published 13 days ago

Business Analyst - Non-Financial Risk

Business/Systems Analysts (Information & Communication Technology) Are you passionate about driving meaningful change and delivering impactful solutions? A l...


From Morgan Mckinley - New South Wales

Published 13 days ago

Applications Specialist

At Varian, a Siemens Healthineers Company, we bring together the world's best talent to realize our vision of a world without fear of cancer. Together, we wo...


From 0460 Vms Australasia Pty Ltd. - New South Wales

Published 13 days ago

Security Operations Engineer

Salary: $900 to $1000 per day including super Location: Sydney CBD office Work Arrangement: Hybrid WFH 2 days a week Contract Duration: 6 to 12 month cont...


From Https:/Www.Energyjobline.Com/Sitemap.Xml - New South Wales

Published 13 days ago

Built at: 2024-11-05T13:43:39.178Z