Infrastructure Manager - AI SaaS and deployed AI Trellis Data is an award-winning AI SaaS and Deployed AI company. As one of the leading providers of secure AI products, and builders of large language models and other cutting-edge AI technology, we provide a highly innovative and vibrant culture. With offices in Australia and the US and a global customer base, we offer one of the rarest opportunities for material sales success, available in Australia.
We are seeking a results-driven Infrastructure Manager to join our forward-thinking team.
Key Responsibilities :
Design, implement, and manage our IT infrastructure to support the development, deployment, and scalability of our machine learning solutions in Australia and globally. Manage cloud environments (e.g., AWS, Azure, Google Cloud) and our own physical infrastructure to ensure cost-effectiveness, scalability, and reliability. Implement and maintain network infrastructure, including routers, switches, firewalls, and VPNs. Develop and enforce security policies and procedures to protect data, systems, and networks from security threats and vulnerabilities. Oversee system administration tasks, including server provisioning, configuration, monitoring, and maintenance. Ensure the availability, performance, and reliability of servers, storage, and other infrastructure components. Drive automation initiatives to streamline infrastructure provisioning, deployment, and management processes. Collaborate with development teams to implement CI/CD pipelines, infrastructure as code (IaC), and configuration management tools. Conduct capacity planning assessments to anticipate infrastructure needs and scale resources accordingly. Monitor system performance and identify opportunities for optimization and improvement. Develop and maintain disaster recovery plans and procedures to ensure business continuity. Requirements :
Bachelor's degree in computer science, engineering, or a related field (or equivalent experience). Proven experience in infrastructure management or related roles, preferably in the technology or machine learning industry. Strong technical expertise in cloud computing, networking, virtualization, and system administration. Demonstrated experience with cloud platforms (e.g., AWS, Azure, Google Cloud) and infrastructure as code (IaC) tools (ideally Terraform). Solid understanding of security best practices, compliance requirements, and risk management principles. Proficiency in automation and scripting languages (e.g., Python, Bash) and automated configuration management (e.g., Ansible). Excellent leadership, communication, and interpersonal skills, with the ability to effectively collaborate with cross-functional teams. Deep understanding of server and application monitoring and log management. How to Apply : This role is available in Canberra only. Submit your resume and a cover letter explaining why you want to work with an award-winning team!
#J-18808-Ljbffr