Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI

New Today

Overview

Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI

In this fantastic new role, you will join a growing team to work with a wide range of experienced engineers, product managers, and production support specialists supporting our Group AI offering (e.g. speech transcription, translation, knowledge management) for scaled consumption across our Group Business and Functions. You will be a senior leader within ETIV, acting as a regional Production Support Lead to provide consultancy, practice and guidance to application teams to ensure the stability, reliability, and performance of our production systems and critical applications.

Based in Sheffield on a hybrid basis.

Responsibilities

  • Production Support: 24x7 production support to attend system alerts, recovery, and any operational tasks to ensure system reliability and availability; participate in rotating on-call duties; manage escalations and proactively prevent service outages.
  • Incident Management and Problem Management: Oversee incident response and drive timely resolution and recovery to minimise service degradation and outage time; conduct post-incident reviews for root cause analysis and improvement actions.
  • Change Management: Drive proper change management and approval processes for all change requests for the platform, ensuring proper planning and execution.
  • Automation, Monitoring and Visualization: Drive automation of operational tasks, monitoring to detect and handle issues early, and visualizations (e.g., dashboards) to understand system health in real time.
  • Capacity Management: Work with the platform team to ensure the platform is fit for future demand.
  • Best Practices and Collaboration: Define best practices for reliability and collaborate with platform and capability engineering teams to ensure these are implemented in development to ensure reliability and availability.
  • Security and Vulnerability: Ensure the platform and systems comply with relevant controls, particularly security and vulnerability patching.
  • Continuous Improvement: Identify improvement opportunities and drive their implementation to ensure site reliability and availability with proper reporting.
  • SRE: Lead the adoption and implementation of SRE principles, including SLIs, SLOs, and error budgets to improve service reliability and performance.
  • Collaboration: Work with engineering, QA, and product teams to incorporate operational requirements into the application lifecycle and advocate for reliability-focused engineering practices.
  • Leadership: Mentor and guide junior SRE and production support team members, fostering ownership, continuous learning, and excellence.

Qualifications and Skills

Technical:

  • DevOps
  • Containerization (Docker, Kubernetes)
  • Designing and operating scalable, secure, and highly available platforms (e.g., Kubernetes, GKE, EKS, or OpenShift)
  • Container orchestration and CI/CD
  • AI / GenAI platforms and workloads, including ML pipelines, model serving/inference, GPU/accelerator resource management
  • Java, Python, Go, or Bash
  • Configuration management (e.g., Terraform, Ansible) to enable infrastructure as code and automation
  • Reliability and performance improvements, including incident management, problem management, change planning and control, and capacity management
  • Translation of strategies and plans to achieve business and functional goals
  • Senior stakeholder management
  • Relationship management

Behavioural Skills:

  • Customer oriented
  • Outcome oriented
  • Problem solver
  • Team management

Cognitive Skills:

  • Divided attention
  • Quantitative
  • Critical thinking
  • Collaboration
  • Logic and reasoning

This role is based in Sheffield on a hybrid basis.

EEO and Recruitment

Being open to different points of view is important for our business and the communities we serve. HSBC is dedicated to creating diverse and inclusive workplaces. We are committed to removing barriers and ensuring careers are inclusive and accessible for everyone. We take pride in being a Disability Confident Leader and will offer an interview to people with disabilities, long term conditions or neurodivergent candidates who meet the minimum criteria for the role.

If you need accommodations or changes during the recruitment process, please contact our Recruitment Helpdesk:

Email: hsbc.recruitment@hsbc.com

Telephone: +44 207 832 8500

#J-18808-Ljbffr
Location:
Sheffield
Job Type:
FullTime
Category:
IT & Technology

We found some similar jobs based on your search