Site Reliability Engineer

Intrado
Full time
Software Development
Canada
Hiring from: Canada
About Us

Intrado se consacre à sauver des vies et à protéger des communautés, en les aidant à se préparer pour des événements critiques, à intervenir lorsqu’ils surviennent, et à s’en rétablir.

Aujourd’hui, notre entreprise de logiciels en tant que service (SaaS) de pointe est à l’avant-garde de la transformation du continuum des interventions d’urgence du service 911, grâce à des logiciels fondés sur des données de prochaine génération. Les solutions d’Intrado permettent aux entreprises, aux préposés aux appels, aux répartiteurs et aux premiers intervenants de prendre des décisions plus éclairées, d’intervenir rapidement et de façon sécuritaire et, ultimement, de mieux desservir leurs communautés.

Intrado is dedicated to saving lives and protecting communities, helping them prepare for, respond to, and recover from critical events.

Today, our cutting-edge SaaS company is at the forefront of transforming the 911 emergency response continuum with next generation data-driven software. Intrado’s solutions allow enterprises, call takers, dispatchers, and first responders to make more informed decisions, respond quickly and safely, and ultimately serve their communities better.

Responsibilities/Qualifications

In this Site Reliability Engineering (SRE) role, you’ll partner closely with development and business teams to create effective monitoring, alerting, and observability solutions that improve system performance and visibility. You’ll support production systems, troubleshoot complex issues, and help drive long-term stability through proactive incident management and automation. You'll get to design secure, cost-effective, and reliable cloud infrastructure.

Reliability Engineering & System Operations

  • Design, implement, and maintain scalable, reliable production systems.
  • Troubleshoot and resolve complex application and system issues.
  • Collaborate with development teams to build features with reliability, observability, and performance in mind.
  • Apply Site Reliability Engineering (SRE) best practices including Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs).

Monitoring & Observability

  • Develop and maintain monitoring, alerting, synthetic testing, and dashboards to ensure visibility into system health.
  • Configure agents for metrics/log collection and manage incident notification channels.
  • Analyze trends and recurring issues to drive proactive improvements.

Cloud Infrastructure Management

  • Manage and optimize AWS/Azure environments in staging and production.
  • Collaborate with architecture, development, and finance teams to design secure, cost-effective, and reliable cloud infrastructure.

Incident & Problem Management

  • Participate in 24/7 on-call rotations, quickly respond to production incidents, and identify root causes.
  • Lead post-mortems and implement long-term fixes.
  • Escalate and communicate issues as appropriate.

Automation & Tooling

  • Automate repetitive operational tasks and improve system efficiency.
  • Build and maintain deployment and configuration tools.
  • Working in CI/CD tools such as GitHub Actions.

Collaboration & Customer Focus

  • Partner with product and development teams to prioritize and resolve production-impacting issues.
  • Support internal teams with tools and insights for efficient self-service.
  • Ensure timely resolution of tickets and clear communication with stakeholders.

Architecture & Documentation

  • Review technical documentation (HLDs/FRDs) to identify potential issues early.
  • Maintain knowledge of product platforms and usage patterns.

What You Bring

  • Education: Bachelor’s in Computer Science, MIS, or related field (or equivalent experience).
  • Experience: 2+ years in application support; experience in development, databases, or systems administration preferred.
  • Cloud: Expertise in AWS and/or Azure (GCP a plus) with hands on experience.
  • Languages: Skilled in one or more languages (Python, Go, Java, Ruby, JavaScript); scripting with Bash or Python.
  • Monitoring Tools: Experience with tools like DataDog, Splunk, New Relic; dashboard creation and performance monitoring.
  • Systems & Networking: Strong Linux/Unix skills; SQL, VPN, TCP/IP, FTP/SMTP troubleshooting.
  • Containers & IaC: Production level of Kubernets and Terraform.
  • SRE Practices: Knowledge of SLIs/SLOs/SLAs, CI/CD, and automation strategies.
  • Soft Skills: Excellent problem-solving, communication, and collaboration.
  • Mindset: Continuous improvement focus with a proactive approach to reliability.

Total Rewards

Vous voulez aimer là où vous travaillez? Chez Intrado, nous offrons un régime complet d’avantages sociaux qui comprend ce que vous attendez (assurance médicale, assurance dentaire et assurance des soins de la vue, assurance-vie et assurance invalidité, congés payés, régime enregistré d’épargne-retraite (REER) avec cotisations égales de l’employeur et compte de gestion de dépenses flexible), et plusieurs avantages qui excèderont vos attentes, tels que le remboursement de frais de scolarité, des congés parentaux payés, l’accès à une bibliothèque complète de ressources de formation personnelle et professionnelle, des rabais d’employés, des assurances couvrant et plus encore! Postulez dès aujourd’hui pour vous joindre à nous dans un travail qui en vaut la peine!

Want to love where you work? At Intrado, we offer a comprehensive benefits package that includes what you’d expect (medical, dental, vision, life and disability coverage, paid time off, a Registered Retirement Savings Plan (RRSP) with employer matching contributions plan and flexible spending accounts), and several that go above and beyond - tuition reimbursement, paid parental leave, access to a comprehensive library of personal and professional training resources, employee discounts, insurance coverage and more! Apply today to join us in work worth doing!

How to apply

To apply for this job you need to authorize on our website. If you don't have an account yet, please register.

Post a resume

Similar jobs

Description Are you ready to take your talent discovery and business partnership skills to the next level? Join our team Our talent acquisition team is distributed worldwide and consists of exceptional business partners who are proud to work virtually and...
Software Development
Canada
Hiring from: Canada
Progressive Leasing
Full time
Progressive Leasing is a leading provider of in-store and e-commerce lease-to-own solutions. As an almost 20+ year old FinTech company that has gone from start-up to industry leader, we know how to innovate, simplify, and value all people. We are...
Software Development
United States
Hiring from: United States
Varsity Tutors, a Nerdy Company
Part time
Ontario High School English Tutor Job Varsity Tutors is looking for experts like you to tutor K-12 and college students online in a variety of academic subjects! By partnering with Varsity Tutors, teaching online is seamless and interactive. Some benefits...
Software Development
Canada
Hiring from: Canada