Software Engineer (L4/5) - Telemetry Collections - Worldwide

22 days ago


Full Time

Who We Are

Netflix is the world's leading streaming entertainment service with over 182 million paid memberships in over 190 countries enjoying TV series, documentaries, and feature films across a wide variety of genres and languages.We deploy hundreds of microservices, across multiple regions, tens of thousands of instances, and have millions of client devices with our software installed. That's a lot of infrastructure and software that collectively work together to give customers an end product they love to use. Ensuring availability and reliability across that scale is a task that's taken on by our amazing Netflix engineers through full operational ownership of their software.Our team, Telemetry Engineering, builds software to provide real-time operational insight to our 1000’s of engineers and teams across Netflix. This requires collecting, streaming, and persisting operational data, HUGE amounts of it, and making that data accessible through flexible APIs and visualizations.

The Opportunity

Our team is looking for a Telemetry Collections Engineer familiar with Java and Node.js, our two primary programming languages, to help design the user experience design and implementation for the telemetry collection at Netflix. The goal is to deliver a consistent and dependable development and operations experience for users of the metrics, tracing, and logs platforms. You will collaborate with both users and developers of the core telemetry libraries and collectors. This role will be part engineer, part DevOps engineer, and part support engineer, with a focus on enabling the success of other engineers at Netflix when they use our tools and SDKs to build and operate applications.You should be able to understand and empathize with the customer development and operating experience, so that you can leverage these inputs in your designs. You should be able to determine and establish consistent data reporting formats across the tools, in order to support the creation of managed experiences.You should be able to read existing library and collector code bases, so that you can submit pull requests to drive your recommendations forward. You should be able to write and maintain highly performant collectors that can run across the entire Netflix fleet. You should be comfortable prototyping new open-source telemetry libraries, in order to determine whether or not they will be useful to the Netflix ecosystem, and conducting performance testing to validate whether or not they can sustain the sampling rates that will be required. You should be ready to provide consulting and support to both data producers and consumers as a part of maintaining data quality across telemetry platforms.One of your outputs will be documentation and training materials for the library and collector ecosystem, intended to ease on-boarding for new engineers and to provide useful references for experienced users of these tools.You will be in charge of investigating and determining the best strategy for instrumenting third-party services both inside and outside of the protected network, and ensuring that Netflix engineers can enjoy a full observability experience across all of the services they run, wherever they are located.If you enjoy working in a unique culture of Freedom Responsibility and building systems at scale that are critical to delivering the Netflix streaming experience, then come join our team!

What You Bring to the Table

  • Expertise building consistent and reliable client libraries, with the ability to manage change responsibly across version releases.
  • Knowledge of the Java and Node.js programming languages and ecosystem. Other languages, such as Go or Python, are a plus.
  • Experience with instrumenting the collection of metrics, tracing and logs in applications and the ability to form opinions on what a streamlined developer experience should look like.
  • Strong Cloud/DevOps skills to help our team successfully roll out changes to our Telemetry systems for users.
  • eBPF knowledge and experience building eBPF programs is a strong plus
  • A positive attitude and the ability to empathize with the customer experience, while finding reasonable solutions that drive the state-of-the-practice of telemetry collection forward to a better place.

Sharing Is Caring

In this group, you'll have a chance to create software that is state-of-the-art and foundational. Because of Netflix's desire to share technology and concepts, you'll be in the rare position of both working on this and sharing this knowledge with your peers outside Netflix. We believe this is unique to Netflix, and if it sounds amazing to you, we should talk.Our culture is unique, and we live by our values, allowing you to do your best work and grow. To learn more about Productivity Engineering, listen to this podcast. We are an equal opportunity employer and celebrate diversity, recognizing that diversity of thought and background builds stronger teams. We approach diversity and inclusion seriously and thoughtfully. We do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.At Netflix, we carefully consider a wide range of compensation factors to determine your personal top of market. We rely on market indicators to determine compensation and consider your specific job family, background, skills, and experience to get it right. These considerations can cause your compensation to vary and will also be dependent on your location. The overall market range for roles in this area of Netflix is typically $300,000 - $900,000This market range is based on total compensation (vs. only base salary), which is in line with our compensation philosophy. Netflix is a unique culture and environment. Learn more here.

Originally posted on Himalayas