02nd December, 2024 NV1 clearance required
SFIA Level – USUP5
The Level 3 Support Engineer or Systems Administrator is responsible for proactively monitoring and maintaining the health, performance, and security of the various systems and modules.
The role involves, but is not limited to, proactive system management, incident and request resolution, capacity management, and collaboration with stakeholders to ensure smooth operational capability for the systems under their purview.
The person in this role will be detail-oriented, proactive, and skilled in troubleshooting Windows/Linux environments, with strong communication skills and the ability to coordinate and collaborate with a diverse range of stakeholders within the Department.
It is also expected that the person in this role will have a minimum of 2 years of experience prior to taking up this position.
Key Responsibilities
System Monitoring and Incident Management
Utilise monitoring tools to oversee infrastructure, applications, and system alerts.
Conduct daily system health checks and investigate/escalate issues as needed.
Manage incident and service request queues, providing Level 2/3 support, troubleshooting, and resolution of user-reported issues.
Coordinate with resolver groups and raise tickets for critical issues.
User Account and Permission Management
Assist in managing user accounts and permissions within Active Directory across all domains.
Ensure adherence to security policies for account and permission modifications.
System Performance Management
Identify and resolve capacity issues, ensuring optimal resource allocation and system performance.
Generate performance reports and provide routine system and task summaries to support management decisions.
Patch Management
Manage patching cycles for all systems, deploying updates using a central repository.
Ensure cybersecurity compliance across systems, identify endpoints that fall out of compliance and rectify.
Change Management
Initiate change requests for necessary deployments and coordinate with stakeholders to minimise disruption.
Coordinate and communicate planned and unplanned outages to all stakeholders in a timely manner to minimise impact of disruption.
Documentation and SOP Development
Create and maintain Standard Operating Procedures (SOPs) for incident, request, and change management.
Review and update system administrator documentation as needed.
Update user guides to reflect system changes and provide end-user training materials as needed.
Hardware and Software Deployment
Build and configure hardware assets for deployment as required.
Deploy software updates and manage application installations in alignment with organisational standards.
Technical Assistance and Support
Provide end-user support for hardware, software, and network issues, ensuring timely and effective resolutions.
Perform service and performance checks on critical server services, proactively addressing potential issues.
#J-18808-Ljbffr