Title: Senior Platform Engineer
Requisition ID: 255022
Join a purpose driven winning team, committed to results, in an inclusive and high-performing culture.
The Senior Platform Engineer will play a critical role within the Enterprise Data & AI Technology organization - one of Scotiabank’s most significant enterprise wide strategic initiatives. This organization drives data enabled decision making, AI innovation, and technology modernization across the Bank.
The Senior Platform Engineer will be responsible for the building, tuning, managing infrastructure, DevOps, Platform site reliability, monitoring, troubleshooting, enhancing, enabling new features on Data & AI platform(s) as per banks Data & AI strategy. This consists of working with cross functional teams like IAM, Network, Cloud Ops, Security, Client partners etc for integration, process automation, platform enhancement and delivery of new projects.
Is this role right for you? In this role, you will:
- Guidance and Direction : Provide clear direction to the team, set goals, and keep the team accountable for their deliverables. Align team goals with the overall direction of the Azure & Databricks Platform roadmap and enterprise standards.
- Technical Oversight : Own the technical direction across Azure and Databricks: Azure networking and security architecture (VNets, Private Endpoints, NSGs, route tables, Azure Firewall), Azure Identity & Access Management (RBAC, PIM), and Databricks platform governance (Unity Catalog, workspace configuration, cluster policies). Ensure best practices for reliability, cost, and security are consistently applied.
- Quality Assurance: Ensure a high quality of support delivery for platform users; adhere to platform SLAs/SLOs and service objectives
- Process Improvements: Continually improve platform processes and SOPs for efficiency and automation. Design and develop reusable Terraform modules for Azure native resources and Databricks (clusters, SQL warehouses, Unity Catalog objects), enabling consistent, scalable, and automated deployments via Terraform Cloud/Enterprise and CI/CD.
- Customer Relations: Build strong relationships with data engineers, analysts, and platform users. Communicate proactively with stakeholders and cross‑functional teams (Platform, Security, Cloud Ops, Networking, Data Governance) to align priorities, manage expectations, and drive adoption of platform standards.
- Advanced Monitoring and Troubleshooting: Troubleshoot and resolve performance issues across Databricks jobs, clusters, SQL warehouses, and Azure dependencies. Implement Azure Monitor and Log Analytics‑based observability with custom dashboards for cluster/job health, driver/executor metrics, and cost insights. Establish proactive alerting and early issue detection via logs/metrics for Databricks and Azure services.
- Site Reliability : Analyze, triage, and resolve platform issues promptly to achieve SLOs and platform reliability objectives. Drive error‑budget aware practices, post‑incident reviews, and resilience engineering (e.g., autoscaling, retry/backoff strategies, policy guardrails).
- Incident Management : Provide support during major incidents, including after‑hours support. Lead incident response, communications to users and stakeholders, and root‑cause analysis with clear action items and follow‑through.
- Observability Tools Development : Design, build, and deploy logging/monitoring solutions for early detection and actionable insights. Standardize ingestion to Log Analytics from Databricks (audit logs, cluster events, job runs) and key Azure resources; built dashboards and alert rules to reduce MTTR.
- Release Control Management : Maintain and enhance the Infrastructure & Platform release pipeline using Terraform, Terraform Cloud, Azure DevOps and/or GitHub Actions, with source control in GitHub/Bitbucket and artifact promotion via ACR/Artifacts. Enforce approvals, change windows, and automated checks to ensure safe, repeatable releases.
- Client Pipeline Management : Implement CI/CD for infrastructure and analytics workloads using Terraform, Docker, Azure DevOps/GitHub Actions, and Artifact/Container registries.Automated Terraform plan/apply, Databricks Bundle releases, policy validation, and security scanning to streamline delivery and ensure compliance.
- Credential Security : Set up Azure Key Vault and HashiCorp Vault for secret management; integrate with Databricks secret scopes and workload identities. Enforce least‑privilege access via Azure RBAC and rotate credentials per policy.
- Vendor and Technical Support Interaction : Partner with Microsoft and Databricks support and product teams to fine‑tune and troubleshoot components, plan upgrades, and adopt new capabilities aligned to roadmap and enterprise controls.
- Mentorship : Mentor junior engineers in best practices for building, deploying, testing, and supporting services on Azure and Databricks. Promote a culture of automation, documentation, and continuous learning.
Do you have the skills that will enable you to succeed in this role? We'd love to work with you if you have:
- 15+ years of IT experience in big organizations operating in various geographies/regulations.
- 5+ years of hands‑on experience with Microsoft Azure (networking, security, identity, storage, compute, PaaS).
- 5+ years with Databricks on Azure (workspaces, jobs/workflows, clusters/SQL warehouses, Unity Catalog governance).
- 5+ years using Infrastructure as Code (Terraform modules, Terraform Cloud/Enterprise; working knowledge of ARM/Bicep a plus).
- 5+ years with CI/CD (Azure DevOps, GitHub Actions), including automated testing, security scanning, and policy gates.
- 5+ years with development/scripting languages (Python, Go optional; plus Bash/PowerShell) for automation and platform tooling.
- 5+ years with container technologies (Docker, orchestration on AKS or containerized jobs on Databricks/Functions).
- Strong understanding of Azure networking (VNets, subnets, Private Endpoints, NSGs, UDRs, Azure Firewall), RBAC/PIM, and zero‑trust principles.
- In‑depth knowledge of databases and data platforms: Azure SQL, Cosmos DB, Databricks Lakehouse (Delta Lake, SQL Warehouses), and data integration patterns (Event Hubs, ADLS Gen2).
- Comprehensive understanding of SDLC and GitOps (branching, environments, code review, release promotion).
- Experience with config management and automation (Ansible, Bash/PowerShell) and governance via cluster policies and IaC standards.
- Bachelor’s degree in computer science, Engineering, Mathematics, Management or related field
What's in it for you?
- Diversity, Equity, Inclusion & Allyship - We strive to create an inclusive culture where every employee is empowered to reach their fullest potential, respected for who they are, and are embraced through bias-free practices and inclusive values across Scotiabank. We embrace diversity and provide opportunities for all employee to learn, grow & participate through our various Employee Resource Groups (ERGs) that span across diverse gender identities, ethnicity, race, age, ability & veterans.
- Accessibility and Workplace Accommodations - We value the unique skills and experiences each individual brings to the Bank, and are committed to creating and maintaining an inclusive and accessible environment for everyone. Scotiabank continues to locate, remove and prevent barriers so that we can build a diverse and inclusive environment while meeting accessibility requirements.
- Upskilling through online courses, cross-functional development opportunities, and tuition assistance.
- Competitive Rewards program including bonus, flexible vacation, personal, sick days and benefits will start on day one.
- Dynamic Ecosystem - Free tea & coffee, universal washrooms, and lots of space for team collaboration.
- Community Engagement - No matter where you choose to work from; we offer opportunities for community engagement & belonging with our various programs.
Location(s): Canada : Ontario : Toronto
Scotiabank is a leading bank in the Americas. Guided by our purpose: "for every future", we help our customers, their families and their communities achieve success through a broad range of advice, products and services, including personal and commercial banking, wealth management and private banking, corporate and investment banking, and capital markets.
At Scotiabank, we value the unique skills and experiences each individual brings to the Bank, and are committed to creating and maintaining an inclusive and accessible environment for everyone. If you require accommodation (including, but not limited to, an accessible interview site, alternate format documents, ASL Interpreter, or Assistive Technology) during the recruitment and selection process, please let our Recruitment team know. If you require technical assistance, please click here. Candidates must apply directly online to be considered for this role. We thank all applicants for their interest in a career at Scotiabank; however, only those candidates who are selected for an interview will be contacted.
Job Segment:
Technical Support, Quality Assurance, Computer Science, Investment Banking, QA, Technology, Quality, Finance