Please be advised that our Careers site will be unavailable from November 28 at 12am ET to November 29 12am ET for scheduled system maintenance.

Title:  Director, Resilience Engineering 1

 

 

 

Requisition ID: 248316

Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.

 

As the Director, Resilience Engineering, you contribute to the global success of the Resilience Engineering function by designing, building, and continuously improving the bank’s resilience capabilities. You lead an engineering team responsible for delivering resilience patterns, automation, resilience and chaos testing frameworks, observability capabilities, and insight driven improvements that enhance the stability and reliability of technology services.

You ensure that all activities are executed in alignment with enterprise standards, architectural requirements, regulatory expectations, and the bank’s risk culture. You champion engineering excellence, innovation, and high performance across global technology teams.


Is this role right for you? In this role, you will:

Leadership & Strategy
•    Champion a customer focused, engineering driven culture that strengthens resilience across the bank’s global technology organization.
•    Define and drive the strategic roadmap for resilience engineering capabilities, in alignment with enterprise architecture, operational resilience, information security, and technology risk.
•    Provide thought leadership on modern resilience engineering practices, including resilience-by-design, failure mode analysis, SRE principles, chaos engineering, and automated resilience validation.

Engineering & Delivery
•    Lead the development, implementation, and lifecycle management of resilience engineering capabilities, including: 
o    resilience and chaos testing automation,
o    common tooling for resilience,
o    fault injection frameworks and recovery automation,
o    observability and telemetry capabilities that support resilience insights.
•    Partner with platform and application teams globally to embed resilience into architecture, software design, and operational practices.
•    Drive enterprise adoption of resilience tooling, telemetry pipelines, resilience libraries, and engineering accelerators.

Data Analytics & Operational Insights
•    Leverage operational data (logs, metrics, traces, change data, incident patterns, capacity telemetry, SLO/SLA performance) to identify leading indicators of instability and proactively reduce incidents.
•    Translate complex operational data into meaningful insights, engineering backlogs, and actionable recommendations that improve system availability and resilience.
•    Use analytical tooling (e.g., SQL, Python, data visualization platforms, observability/AIOps systems) to derive trends, correlations, and prioritized areas of engineering focus.
•    Champion a culture of insight driven engineering, ensuring teams use data effectively to inform decisions, drive proactivity, and validate resilience controls.

Governance, Oversight & Risk Management
•    Provide engineering perspectives and evidence to support resilience related policy, standard, and framework updates.
•    Identify technology resilience gaps through technical reviews, test results, telemetry analysis, and incident post mortems. Lead engineering workstreams to address them.
•    Partner with Operational Resilience, Enterprise Architecture, Information Security & Controls, and Technology Risk to drive an integrated resilience posture.
•    Support internal and external audits by providing technical documentation, test artifacts, architectural diagrams, and resilience evidence.

Stakeholder & Executive Engagement
•    Build strong relationships with senior technology stakeholders (SVPs, VPs, platform owners, engineering leaders) and influence adoption of resilience engineering practices.
•    Clearly articulate complex engineering concepts and resilience risks to non technical audiences, enabling informed decision making.
•    Lead resilience engineering forums, communities of practice, and cross functional working groups.

People Leadership
•    Lead and develop high performing engineering teams, fostering craftsmanship, innovation, continuous improvement, and inclusion.
•    Mentor engineers and SMEs across global locations; promote a culture of learning, experimentation, and engineering excellence.
•    Contribute to a high performance environment aligned with the bank’s values and the risk culture.

  


Do you have the skills that will enable you to succeed in this role? We'd love to work with you if you have:


•    7–10+ years of experience in technology engineering roles such as site reliability engineering (SRE), platform engineering, solution architecture, system design, or production engineering.
•    Strong understanding of distributed systems, cloud architectures, networking fundamentals, and application patterns that impact availability and resilience.
•    Hands on experience with resilience engineering practices such as: 
•    failure mode and dependency analysis
•    chaos engineering
•    automated failover/failure testing
•    observability, metrics, logging, distributed tracing
•    performance and capacity engineering
•    Demonstrated experience building or integrating platform level capabilities (e.g., telemetry pipelines, resilience libraries, automation frameworks).
•    Experience in major cloud platforms (GCP, Azure) and container/orchestration ecosystems (Kubernetes) is an asset.
•    Familiarity with ITSM processes and tooling (ServiceNow is a plus).
•    Demonstrated ability to derive engineering insights from operational data to reduce incident frequency and improve system stability.
•    Experience analyzing time series telemetry, observability data, incident patterns, and reliability metrics.
•    Proficiency with analytics tooling (SQL, Python, visualization platforms, AIOps/observability systems).
•    Ability to convert complex datasets into engineering actions, design improvements, and strategic recommendations.
•    Strong leadership presence with the ability to influence at senior levels across technology and risk stakeholders.
•    Excellent communication skills—able to translate complex engineering findings into clear narratives and actionable recommendations.
•    Strong organizational and prioritization skills, able to manage multiple engineering workstreams concurrently.
•    Calm under pressure, especially during high severity incidents, with the ability to provide technical leadership and direction.
•    Demonstrated collaboration across enterprise functions, including architecture, security, operational resilience, and development teams.
•    Professional certifications (e.g., cloud architecture, SRE, ITIL, resilience/continuity) beneficial but not required.

 

 

What's in it for you?
 
•    Diversity, Equity, Inclusion & Allyship - We strive to create an inclusive culture where every employee is empowered to reach their fullest potential, respected for who they are, and are embraced through bias-free practices and inclusive values across Scotiabank. We embrace diversity and provide opportunities for all employee to learn, grow & participate through our various Employee Resource Groups (ERGs) that span across diverse gender identities, ethnicity, race, age, ability & veterans.
•    Accessibility and Workplace Accommodations - We value the unique skills and experiences each individual brings to the Bank and are committed to creating and maintaining an inclusive and accessible environment for everyone. Scotiabank continues to locate, remove, and prevent barriers so that we can build a diverse and inclusive environment while meeting accessibility requirements. 
•    Upskilling through online courses, cross-functional development opportunities, and tuition assistance. 
•    Competitive Rewards program including bonus, flexible vacation, personal, sick days, and benefits will start on day one.
•    Community Engagement - no matter where you choose to work from; we offer opportunities for community engagement & belonging with our various programs.

 

Location(s):  Canada : Ontario : Toronto 

Scotiabank is a leading bank in the Americas. Guided by our purpose: "for every future", we help our customers, their families and their communities achieve success through a broad range of advice, products and services, including personal and commercial banking, wealth management and private banking, corporate and investment banking, and capital markets.  

At Scotiabank, we value the unique skills and experiences each individual brings to the Bank, and are committed to creating and maintaining an inclusive and accessible environment for everyone. If you require accommodation (including, but not limited to, an accessible interview site, alternate format documents, ASL Interpreter, or Assistive Technology) during the recruitment and selection process, please let our Recruitment team know. If you require technical assistance, please click here. Candidates must apply directly online to be considered for this role. We thank all applicants for their interest in a career at Scotiabank; however, only those candidates who are selected for an interview will be contacted.


Job Segment: Testing, Information Security, Information Technology, IT Architecture, Cloud, Technology