Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI
New Today
Overview
Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI
In this fantastic new role, you will join a growing team to work with a wide range of experienced engineers, product managers, and production support specialists supporting our Group AI offering (e.g. speech transcription, translation, knowledge management) for scaled consumption across our Group Business and Functions. You will be a senior leader within ETIV, acting as a regional Production Support Lead to provide consultancy, practice and guidance to application teams to ensure the stability, reliability, and performance of our production systems and critical applications.
Based in Sheffield on a hybrid basis.
Responsibilities
- Production Support: 24x7 production support to attend system alerts, recovery, and any operational tasks to ensure system reliability and availability; participate in rotating on-call duties; manage escalations and proactively prevent service outages.
- Incident Management and Problem Management: Oversee incident response and drive timely resolution and recovery to minimise service degradation and outage time; conduct post-incident reviews for root cause analysis and improvement actions.
- Change Management: Drive proper change management and approval processes for all change requests for the platform, ensuring proper planning and execution.
- Automation, Monitoring and Visualization: Drive automation of operational tasks, monitoring to detect and handle issues early, and visualizations (e.g., dashboards) to understand system health in real time.
- Capacity Management: Work with the platform team to ensure the platform is fit for future demand.
- Best Practices and Collaboration: Define best practices for reliability and collaborate with platform and capability engineering teams to ensure these are implemented in development to ensure reliability and availability.
- Security and Vulnerability: Ensure the platform and systems comply with relevant controls, particularly security and vulnerability patching.
- Continuous Improvement: Identify improvement opportunities and drive their implementation to ensure site reliability and availability with proper reporting.
- SRE: Lead the adoption and implementation of SRE principles, including SLIs, SLOs, and error budgets to improve service reliability and performance.
- Collaboration: Work with engineering, QA, and product teams to incorporate operational requirements into the application lifecycle and advocate for reliability-focused engineering practices.
- Leadership: Mentor and guide junior SRE and production support team members, fostering ownership, continuous learning, and excellence.
Qualifications and Skills
Technical:
- DevOps
- Containerization (Docker, Kubernetes)
- Designing and operating scalable, secure, and highly available platforms (e.g., Kubernetes, GKE, EKS, or OpenShift)
- Container orchestration and CI/CD
- AI / GenAI platforms and workloads, including ML pipelines, model serving/inference, GPU/accelerator resource management
- Java, Python, Go, or Bash
- Configuration management (e.g., Terraform, Ansible) to enable infrastructure as code and automation
- Reliability and performance improvements, including incident management, problem management, change planning and control, and capacity management
- Translation of strategies and plans to achieve business and functional goals
- Senior stakeholder management
- Relationship management
Behavioural Skills:
- Customer oriented
- Outcome oriented
- Problem solver
- Team management
Cognitive Skills:
- Divided attention
- Quantitative
- Critical thinking
- Collaboration
- Logic and reasoning
This role is based in Sheffield on a hybrid basis.
EEO and Recruitment
Being open to different points of view is important for our business and the communities we serve. HSBC is dedicated to creating diverse and inclusive workplaces. We are committed to removing barriers and ensuring careers are inclusive and accessible for everyone. We take pride in being a Disability Confident Leader and will offer an interview to people with disabilities, long term conditions or neurodivergent candidates who meet the minimum criteria for the role.
If you need accommodations or changes during the recruitment process, please contact our Recruitment Helpdesk:
Email: hsbc.recruitment@hsbc.com
Telephone: +44 207 832 8500
- Location:
- Sheffield
- Job Type:
- FullTime
- Category:
- IT & Technology
We found some similar jobs based on your search
-
New Today
Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI
-
Sheffield
- IT & Technology
Overview Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI In this fantastic new role, you will join a growing team to work with a wide range of experienced engineers, product managers, and production support specia...
More Details -
-
New Today
Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI
-
Sheffield, England, United Kingdom
-
£150,000 - £200,000
- IT & Technology
Overview Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI In this fantastic new role, you will join a growing team to work with a wide range of experienced engineers, product managers, and production support specia...
More Details -
-
New Yesterday
Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI
-
Sheffield
- IT & Technology
Overview Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI. In this fantastic new role, you will join a growing team to work with a wide range of experienced engineers, product managers, and production support specia...
More Details -
-
New Yesterday
Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI
-
Sheffield, England, United Kingdom
-
£150,000 - £200,000
- IT & Technology
Associate Director of Engineering - Emerging Technology and Innovation (ETIV) - AI. Duties include 24x7 Production Support to attend system alerts, recovery, and any operational tasks to ensure system reliability and availability. You will be a Senior leader within ETIV, acting as a regional Production Support Lead.
More Details -