Back to Jobs

[Remote] Staff Software Engineer - ML Observability

Remote, USA Full-time Posted 2026-06-16

Note: The job is a remote job and is open to candidates in USA. Datadog is the leading observability and security platform for the AI era, providing businesses with unified visibility across the technology stack. As a Staff Engineer, you will lead the development of new features within Datadog’s LLM Observability product, shaping product direction and driving experimentation to solve complex problems in AI systems.


Responsibilities

  • Drive design and implementation of LLM observability features
  • Ideate, prototype, and scale new product features to provide insights and drive improvements for generative AI systems
  • Work cross-functionally with other eng teams, product, UX, and applied science to iterate fast and find product-market fit
  • Develop and extend tools for tracing, evaluating, and debugging LLMs
  • Influence architecture decisions and mentor engineers to build resilient, high-performance systems
  • Stay close to customer pain points and use those insights to guide product and engineering priorities
  • Stay current with industry trends and advancements in machine learning and observability, driving innovation within the team

Skills

  • You have a BS/MS/PhD in a Computer Science, Engineering or related scientific field or equivalent experience
  • Deep understanding of distributed systems and scalable backend architectures
  • Hands-on experience building and shipping LLM-powered or GenAI applications
  • Understanding of model internals, inference pipelines, evaluation techniques, and prompt engineering
  • Ability to thrive in ambiguous, fast-changing spaces and have a product-oriented mindset
  • You're excited to shape the next generation of AI observability tools from the ground up
  • Communicate clearly, think rigorously, and take pride in clean, maintainable code
  • Experience with observability tools/platforms

Benefits

  • Competitive salary and equity package
  • May include variable compensation
  • Healthcare
  • Dental
  • Parental planning
  • Mental health benefits
  • A 401(k) plan and match
  • Paid time off
  • Fitness reimbursements
  • A discounted employee stock purchase plan
  • Competitive global benefits
  • Continuous professional development

Company Overview

  • Datadog is an observability and security platform that offers infrastructure, applications, software development, and monitoring services. It was founded in 2010, and is headquartered in New York, New York, USA, with a workforce of 1001-5000 employees. Its website is https://www.datadoghq.com.

  • Company H1B Sponsorship

  • Datadog has a track record of offering H1B sponsorships, with 8 in 2026, 123 in 2025, 66 in 2024, 45 in 2023, 53 in 2022, 31 in 2021, 29 in 2020. Please note that this does not guarantee sponsorship for this specific role.

  •   Apply To This Job

    Similar Jobs