Job Overview:
We are seeking a highly skilled and experienced Senior Observability and Monitoring engineer to join our diverse team of cloud and infrastructure automation engineers. In this role, you will be responsible for managing, maintaining, and operating observability and monitoring platforms that ensure the reliability, performance, and scalability of our systems.
Responsibilities:
- Contribute to the design and develop comprehensive observability and monitoring strategies for infrastructure and sophisticated engineering systems and applications.
- Build and manage monitoring tools and platforms such as Prometheus, Grafana, Azure Monitoring, AWS CloudWatch, Dynatrace/Datadog and similar tools that forms our AIOps stack.
- Develop and maintain dashboards, alerts, and reports to provide real-time insights into system performance and health.
- Collaborate with multi-functional teams to identify and resolve performance bottlenecks and reliability issues.
- Automate monitoring and alerting processes to improve efficiency and reduce manual intervention.
- Conduct root cause analysis of incidents and implement preventive measures to avoid recurrence.
- Mentor and guide junior engineers in standard methodologies for observability and monitoring.
- Stay up-to-date with the latest industry trends and technologies to continuously improve our monitoring capabilities.
Required Skills and Experience :
- Bachelor’s degree in Computer Science, Engineering, or a related field with demonstrated ability in observability and monitoring roles.
- Proficiency in monitoring tools and platforms such as Prometheus, Grafana, AWS CloudWatch, Azure Monitor, Datadog, Dynatrace, etc.
- Strong understanding of cloud environments (AWS, Azure, GCP) and container orchestration (Kubernetes, Docker).
- Experience with scripting and automation using languages such as Python, Bash, or similar.
- Excellent problem-solving skills and attention to detail.
- Strong communication and teamwork skills.
- Ability to work in a fast-paced, multifaceted environment.
“Nice To Have” Skills and Experience :
- Experience in working in a large multi-cluster Kubernetes and Cloud environment.
- Experience working in the semiconductor industry.
- Experience with machine learning and AI driven monitoring solutions.
- Knowledge of CI/CD pipelines and DevOps practices.
In Return:
We offer exciting and interesting work in a diverse team. Arm's growth trajectory will ensure career progression and the opportunity to have a significant impact on our success!
#LI-KR2
Accommodations at Arm
At Arm, we want our people to Do Great Things. If you need support or an accommodation to Be Your Brilliant Self during the recruitment process, please email accommodations@arm.com. To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.
Hybrid Working at Arm
Arm’s approach to hybrid working is designed to create a working environment that supports both high performance and personal wellbeing. We believe in bringing people together face to face to enable us to work at pace, whilst recognizing the value of flexibility. Within that framework, we empower groups/teams to determine their own hybrid working patterns, depending on the work and the team’s needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.
Equal Opportunities at Arm
Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don’t discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.