Staff Reliability Engineer

Staff Reliability Engineer
Company:

Blinq Technologies


Details of the offer

WHAT IS BLINQ?
The first interaction two people have is the bedrock of all strong business relationships. If you can make that experience special, you can start to build a great second interaction, and so on. Blinq is the tool to help people do that. We're building a platform that allows you to share a snapshot of who you are with anyone, anywhere via digital business cards, dynamic email signatures and virtual backgrounds. Join us on our mission to help the world connect.
Role Overview: As a Staff Reliability Engineer, you will be at the forefront of maintaining the reliability and performance of our digital business card platform. You will play a key role in designing systems that ensure high availability, optimizing our infrastructure to handle increasing demand, and automating processes that keep our services resilient. Your expertise will be vital in ensuring that our users can exchange digital business cards and manage their profiles without interruption. You'll also work closely with product and development teams to integrate reliability as a core aspect of our platform's growth.
This is a hybrid role working from our Melbourne or Sydney locations.
What You Will Own: Ensure platform reliability: Lead efforts to enhance the reliability and availability of our digital business card platform, ensuring users have a seamless experience when sharing and managing their information. Monitor and optimize performance: Continuously improve platform performance, making sure that it scales efficiently and remains responsive as our user base grows. Incident detection and response: Implement robust monitoring and alerting systems to detect and resolve issues swiftly, minimizing downtime for users during critical networking moments. Collaborate with cross-functional teams: Work with product, development, and operations teams to integrate reliability engineering into the product lifecycle, ensuring that reliability is considered from design through deployment. Automation and scaling: Automate manual processes and optimize system scalability, reducing human intervention and ensuring the platform remains stable under increased user demand. Leadership and mentoring: Mentor junior engineers in reliability best practices, fostering a culture of reliability across engineering teams. Post-incident analysis: Perform root cause analysis for incidents and outages, driving initiatives to prevent future occurrences and improve system resiliency. What We Look For In You: 8+ years experience in site reliability engineering within SaaS or digital products. Experience with cloud platforms (AWS, GCP, Azure), Kubernetes, Docker, Terraform, and infrastructure-as-code. Strong expertise in automating workflows with Typescript, Node or similar programming languages to improve efficiency and system resilience. Experience with monitoring tools (e.g., Prometheus, Grafana, Datadog) to implement effective observability and alerting systems. Demonstrated ability to lead incident response processes, manage critical outages, and implement long-term improvements. Excellent communication skills and a collaborative mindset for working with cross-functional teams. We welcome individuals at all experience levels and take pride in being an equal opportunity employer committed to creating an inclusive and diverse workforce. Join us on this remarkable journey as we reshape the way people connect and network.
#J-18808-Ljbffr


Source: Jobrapido_Ppc

Requirements

Staff Reliability Engineer
Company:

Blinq Technologies


Testers

The Department of Employment and Workplace Relations (DEWR) enables access to quality skills, training, and employment to support Australians find secure wor...


From Kirra Services - Australia

Published 9 days ago

Grc Cyber Consultant

Security (Information & Communication Technology) About the Role: Our client, a leading provider of cybersecurity advisory, is seeking a talented Mid to Seni...


From Tideri Jobbörse - Australia

Published 9 days ago

Ict Security Operations Lead

At Leidos, we do work that really matters inspired by our mission to make the world safer, healthier, and more efficient through technology, engineering, and...


From Tideri Jobbörse - Australia

Published 9 days ago

Test Analyst

The department is seeking an experienced Tester, who is part of a small agile Testing team as well as being hands on with writing and executing Test Cases, w...


From Tideri Jobbörse - Australia

Published 9 days ago

Built at: 2024-10-06T08:21:50.396Z