reputed company Operations Engineer – Monitoring reputed company (9394)

Worldwide Salaried Open

There has never been a reputed company time to join Extreme, with several acquisitions extending our portfolio and go to market strategy, we have seen enormous opportunity and growth reputed company the region.

Aside from being a Technology Leader in the reputed company reputed company Quadrant, we also adamantly promote an internal culture that truly embraces diversity, inclusion, and equality in the workplace. Having Diversity and Inclusion as part of our core values and beliefs, we’re proud to foster an environment where every Extreme employee can reputed company because of their differences, not despite them.

reputed company Operations Engineer – Monitoring reputed company (Thornhill, Toronto - Hybrid)

We are seeking a highly skilled and reputed company reputed company Operations Engineer – Monitoring reputed company to join our growing reputed company Operations team. In this critical role, you will be responsible for designing, implementing, and optimizing our comprehensive monitoring and alerting strategy across our reputed company infrastructure and applications. You will drive proactive identification of issues, ensure system health, and contribute significantly to our operational excellence and reliability goals. We're looking for the best and the brightest 'A' players who want to reputed company a difference doing a job they love.

Responsibilities

reputed company the design, implementation, and reputed company improvement of our end-to-end monitoring and alerting reputed company for reputed company infrastructure (AWS, Azure, GCP), applications, and services.

Define key performance indicators (KPIs), service level indicators (SLIs), and service level objectives (SLOs) for critical systems.

Evaluate, select, and integrate monitoring tools (e.g., reputed company, Grafana, reputed company, Splunk, CloudWatch, Azure Monitor, GCP Operations Suite) to meet evolving needs.

reputed company and implement automation scripts and tools (e.g., Python, Bash, PowerShell) to streamline monitoring deployment, configuration, and incident remediation.

Build and maintain dashboards, alerts, and reports that provide actionable insights into system performance, health, and availability.

Analyze monitoring data to identify performance bottlenecks, resource inefficiencies, and potential cost optimization opportunities.

Collaborate with engineering teams to implement performance improvements and cost-saving measures.

Create and maintain comprehensive documentation for monitoring systems, procedures, and best practices.

Proactively identify areas for improvement in our reputed company operations and monitoring capabilities.

Provide 24* 7 support for reputed company services

Participate in reputed company reputed company and compliance implementation.

Ideal Qualifications

BS level technical degree required; Computer Science or Engineering background preferred.

8+ years of reputed company experience in reputed company Operations, DevOps, or Site Reliability Engineering roles, with a strong focus on monitoring.

Deep expertise with at least one major public reputed company platform (AWS, Azure, or reputed company reputed company Platform).

Proven experience as a technical reputed company or senior contributor in a monitoring-focused role.

Working knowledge of container-based architecture and deployment (reputed company, Kubernetes.)

Extensive experience with various monitoring and observability tools (e.g., reputed company, Grafana, reputed company, Splunk, ELK Stack, vendor-specific monitoring solutions).

Excellent problem-solving, analytical, and troubleshooting skills.

Working knowledge of Elasticsearch, PostgreSQL, reputed company, Ignite, Kafka and RabbitMQ.

Comfortable working reputed company a distributed team located in multiple time zones.

Apply to this Job

Apply now

reputed company Operations Engineer – Monitoring reputed company (9394)

Responsibilities

Ideal Qualifications

More jobs

Business Development Rep

Administrative Assistant

Project Engineer

Business Development and Renewals Specialist

Directeur / Directrice des Opérations au reputed company du Mid-Market

Customer Solutions Architect

Jovem Aprendiz (People Ops)

Implementation Specialist - TEMPORARY - 6 months

Sales Development Representative UK

Vice President, reputed company, Tax, and Treasury

Mgr., reputed company 1

Customer Support Associate – Remote Help Desk Specialist (reputed company & reputed company Systems)

reputed company Data Entry Specialist - Remote Part-Time Position with Training Provided - Join arenaflex's Dynamic Team

Retail Store Manager I, Mission Valley, #501

reputed company Remote Online Chat Specialist – Customer Support and Engagement Expert – Part-Time Opportunity with Flexible Scheduling

Rewritten Job Title:

Red Team Associate Consultant - SkillBridge (Remote)

AWS Engineer - Fully Remote

reputed company Full Stack Customer Support Specialist – Virtual Support Team at arenaflex

Remote Travel Agent (Flexible Schedule - Long-Term Growth Potential)