Skip to main content

Senior Observability Engineer

Job ID 2024-13210 Date posted 16/12/2024 Location Bengaluru, India Category IT
Apply

Job Overview:

We are seeking a highly skilled and experienced Senior Observability and Monitoring engineer to join our diverse team of cloud and infrastructure automation engineers. In this role, you will be responsible for managing, maintaining, and operating observability and monitoring platforms that ensure the reliability, performance, and scalability of our systems.

Responsibilities:

  • Contribute to the design and develop comprehensive observability and monitoring strategies for infrastructure and sophisticated engineering systems and applications.
  • Build and manage monitoring tools and platforms such as Prometheus, Grafana, Azure Monitoring, AWS CloudWatch, Dynatrace/Datadog and similar tools that forms our AIOps stack.
  • Develop and maintain dashboards, alerts, and reports to provide real-time insights into system performance and health.
  • Collaborate with multi-functional teams to identify and resolve performance bottlenecks and reliability issues.
  • Automate monitoring and alerting processes to improve efficiency and reduce manual intervention.
  • Conduct root cause analysis of incidents and implement preventive measures to avoid recurrence.
  • Mentor and guide junior engineers in standard methodologies for observability and monitoring.
  • Stay up-to-date with the latest industry trends and technologies to continuously improve our monitoring capabilities.

Required Skills and Experience :

  • Bachelor’s degree in Computer Science, Engineering, or a related field with demonstrated ability in observability and monitoring roles.
  • Proficiency in monitoring tools and platforms such as Prometheus, Grafana, AWS CloudWatch, Azure Monitor, Datadog, Dynatrace, etc.
  • Strong understanding of cloud environments (AWS, Azure, GCP) and container orchestration (Kubernetes, Docker).
  • Experience with scripting and automation using languages such as Python, Bash, or similar.
  • Excellent problem-solving skills and attention to detail.
  • Strong communication and teamwork skills.
  • Ability to work in a fast-paced, multifaceted environment.

“Nice To Have” Skills and Experience :

  • Experience in working in a large multi-cluster Kubernetes and Cloud environment.
  • Experience working in the semiconductor industry.
  • Experience with machine learning and AI driven monitoring solutions.
  • Knowledge of CI/CD pipelines and DevOps practices.

In Return:

We offer exciting and interesting work in a diverse team. Arm's growth trajectory will ensure career progression and the opportunity to have a significant impact on our success!

#LI-KR2

Accommodations at Arm

At Arm, we want our people to Do Great Things. If you need support or an accommodation to Be Your Brilliant Self during the recruitment process, please email accommodations@arm.com. To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.

Hybrid Working at Arm

Arm’s approach to hybrid working is designed to create a working environment that supports both high performance and personal wellbeing. We believe in bringing people together face to face to enable us to work at pace, whilst recognizing the value of flexibility. Within that framework, we empower groups/teams to determine their own hybrid working patterns, depending on the work and the team’s needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.

Equal Opportunities at Arm

Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and don’t discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

Life at Arm

  • Culture at Arm
    Culture at Arm

    Make an Impact

    We, not I. This belief is at the heart of Arm's company culture and it underscores the culture of collaboration alongside individual accountability in a supportive environment working together for the success of Arm. Across our entire ecosystem, we know that when you're able to be your most brilliant self, you can do great things.
    Read more

  • Diversity, Equity and Inclusion
    Diversity, Equity & Inclusion

    This is Collective Progress

    At Arm, we're committed to inspiring revolutionary ideas in a diverse, equitable, and inclusive environment. Be your most brilliant self, and empower others, via various avenues for active participation – Employee Resource Groups (ERGs), Employee Communities, DEI working groups, and DEI Council.
    Read more

  • Benefits at Arm
    Benefits at Arm

    Benefits Designed for You

    When our employees thrive, so does Arm. Because our teams are so remarkable, we offer remarkable benefits designed to nurture the professional and personal growth of the brilliant people building the future of computing.
    Read more

Jobs for You

  • Senior Cyber Defence Operations Analyst Leading day to day detailed operations, as well as triage, investigate and respond to security incidents Cambridge, United Kingdom IT
  • Software Engineer Propel mobile GenAI performance within Arm's ML System Analysis team, identifying bottlenecks and delivering quality insights that drive SW optimisations. Cambridge, United Kingdom Machine Learning
  • SoC Performance Analysis Engineer Lead SoC performance analysis and optimization for Arm CPUs and System IP designs. San Jose, California Hardware Engineering

No previously viewed jobs

No jobs have been saved

Get Job Alerts

Can’t find the job you’re seeking? Register to be notified as soon as new jobs become available. Enter your email, select your preferred job category and/or location, then click Add to set your preferences and Subscribe to create your job alert.

Interested InSelect a job category from the list of options. Search for a location and select one from the list of suggestions. Finally, click “Add” to create your job alert.

By submitting your information, you acknowledge that you have read our privacy policy, and consent to receive email communication from Arm.

Join our Talent Community, Unlock Opportunities

Subscribe to receive Arm communications directly to your inbox. Stay connected to be the first to hear about updates from our community and exciting roles that align with your skills.

Join Now