Site Reliability Specialist

Apply now »

Date: Jul 23, 2022

Location: Ottawa, ON, CA, K2K 3N5

Company: Innovapost

Who is Innovapost?

Great question! We are the technology arm of the Canada Post Group of companies. This includes Canada Post, Purolator, and SCI. By joining us you will be able to make a positive impact on how every Canadian deliver and receives their packages and mail. Next time you see your neighbor picking up their mail and receiving a package, you will be able to say “you’re welcome!”

Why join Innovapost?

As Technology evolves constantly, we invest in our High Performing Workforce with the Skills of Tomorrow through continuous learning and career growth within Innovapost, including access to LinkedIn Leaning, the new Degreed capability and a Leadership Developmental Program. We invest in Diversity, Inclusion and Belonging, and provide continuous employee engagement to grow our business together.      

What role you will play?

The Site Reliability Engineer (SRE) and team are responsible for the improvement of Innovapost’s production services availability, scalability, performance, and reliability. The team proactively looks for ways to improve application monitoring, address production issues and investigate and assist with customer inquiries. They will provide tools, systems, and knowledge to teams that will help them improve productivity, reduce downtime, and speed up code turnaround. Their contributions will help Development, Operations, and Support teams spend less time working on problems, and more time building new features and services.

Responsibilities?

Designs, develops, deploys and maintains complex IT solutions to make systems more performant, cost-effective and improve organizational efficiency

Develop & Deploy – Builds and supports the application environment; leads the infrastructure-as-code approach with expertise in both data and operations. Implements applications in Azure or other cloud providers including solving problems and analyzing issues. Strong desire to introduce automation to replace voperations. Improves the whole lifecycle of applications and cloud services, from inception and design, through deployment, operation, and refinement

Automate – Leads the process of streamlining the application of continuous delivery by automating the process from development to deployment, ensuring that only highest quality code is delivered. Develops automations to enhance operational capabilities in supporting cloud-based solutions

Monitor – Develops tools to monitor and improve the reliability, and performance of critical platforms.

Identifies and resolves highly complex production problems for support of applications; identifies root cause and defining actions to eliminate recurrence. Maintains cloud services by measuring and monitoring availability, latency, and overall system health.

Other Responsibilities

Be Curious & Innovative – Explores industry trends and best practices; researches and evaluates next generation technologies, applications, hardware and software, to enhance and improve infrastructure and support processes

Be a Team Player and an Effective Communicator – Collaborates with the development, infrastructure and management teams to design efficient, scalable and robust solutions. Works in a collaborative team-oriented environment, shares information, values diverse ideas, interacts with customers, partners with cross functional and remote teams. Articulates technical issues, and shares ideas, participates in peer discussions and solutions.

Experience

Minimum 7+ years of experience with enterprise-grade infrastructures, Cloud, operations and / or systems engineering

Experience developing, deploying, and operating applications and infrastructure in Azure and other cloud providers

Monitoring and logging experience using tools such as Application Insights, Log Analytics, Azure and other Cloud Monitoring tools

Experience with container and orchestration technologies - Docker, Kubernetes

Knowledge?

Understanding of networking: VNETs, NSGs, Load Balancers, DNS Configuration, certificate management, Application gateway and Network watcher

Familiarity with infrastructure as code concepts

Familiar with the principles of site reliability engineering, such as eliminating toil, embracing risk, simplicity, and release management

Familiarity with and passion for software engineering best practices such as testing, continuous integration, continuous delivery, security, performance, and disaster recovery

Strong scripting and automation skills

Knowledge in YAML and JSON

Familiarity with DevOps, CI/CD/CT – Azure DevOps

Mandatory Skills

Cloud Computing

Cloud Architecture

DevOps

Microsoft Azure

Amazon Web Services

Release Management

Disaster Recovery Planning

Configuration Management

Nice to Have

More Information

Computer Science or Engineering degree

The work we do at Innovapost impacts every Canadian. To work with us, you must be eligible to obtain a Canada Post Reliability Security Clearance.

Good luck! We are looking forward to meeting you, but unfortunately, we will not be able to meet all of you!

Competencies

Be Courageous
Be Curious
Be Inclusive
Innovate
Inspire