Staff Engineer - Distributed Systems

Full Time
Los Angeles, CA, USA
5 months ago
Staff Engineer - Distributed Systems

Location- (100% remote within the US or Canada)

The proliferation of machine log data has the potential to give organizations unprecedented real-time visibility into their infrastructure and operations. With this opportunity comes tremendous technical challenges around ingesting, managing, and understanding high-volume streams of heterogeneous data. The Data Collection team owns the ingestion pipeline -- starting with a lightweight agent to collect, compress, encrypt, and ship the data back to the Sumo Logic cloud.

As a Staff Backend Engineer on the Data Collection team, you will be responsible for designing and implementing advanced mechanisms to collect massive amounts of machine-generated data from heterogeneous systems in real-time. You will build asynchronous systems with high levels of concurrency, multithreading, and parallel programming. The Data Collection team is responsible for managing the data collection infrastructure and collection agents. Individual agents collect at rates of tens of thousands of events per second.

You are a strong software engineer who is passionate about large-scale systems. You care about producing clean, elegant, maintainable, robust, well-tested code; you do this as a member of a team, helping the group come up with a better solution than you would as individuals. Ideally, you have experience with performance, scalability, and reliability issues of 24x7 commercial services.

Responsibilities:

  • Design and implement extremely high-volume, fault-tolerant, scalable backend systems that process and manage petabytes of customer data.
  • Analyze and improve the efficiency, scalability, and reliability of our backend systems.
  • Write robust code; demonstrate its robustness through automated tests.
  • Work as a member of a team, helping the team respond quickly and effectively to business needs.
  • Help manage exabytes of data using the latest and greatest technologies such as Kafka, Mesos, Spark and Docker!

Requirements:

  • BTech., B.S., M.S., or Ph.D. in Computer Sciences or related discipline
  • 8+ years of industry experience with a proven track record of ownership and delivery.
  • Object-oriented experience, for example in Java, Scala, Ruby, or C++.
  • Understand performance characteristics of commonly used data structures (maps, lists, trees, etc).
  • Desire to learn Scala, a JVM language (scala-lang.org).
  • Prior experience building and shipping a production platform/service
  • Assembling and owning a large-scale distributed service platform.

Desirable:

  • Experience in multi-threaded programming and distributed systems is highly desirable.
  • Experience in big data and/or 24x7 commercial service is highly desirable.
  • Domain expertise in the development of large-scale distributed systems and infrastructure
  • You should be happy working with Unix (Linux, OS X).
  • Agile software development experience (test-driven development, iterative and incremental development) is a plus.

About Us:

Sumo Logic, Inc., empowers the people who power modern, digital business.  Sumo Logic enables customers to deliver reliable and secure cloud-native applications through its SaaS analytics platform. The Sumo Logic Continuous Intelligence Platform™ helps practitioners and developers ensure application reliability, secure, and protect against modern security threats, and gain insights into their cloud infrastructures. Customers worldwide rely on Sumo Logic to get powerful real-time analytics and insights across observability and security solutions for their cloud-native applications. For more information, visit www.sumologic.com.

The expected annual base salary range for this position is $190,000-$220,000 + 15% bonus. In addition to base pay, certain roles are eligible to participate in our bonus or commission plans, as well as our benefits offerings, and equity awards. Compensation varies based on a variety of factors which include (but aren’t limited to) role level, skills and competencies, qualifications, knowledge, location, and experience.

Other details

  • Health, Dental, Vision- Insurance
  • 401k and Life Insurance options
  • Unlimited PTO with 15+ days of recognized holidays
  • Quarterly Wellness days
  • 100% remote with the option to be in the office if you want (Bay Area, Austin, Denver, NYC)
  • 3 months of paid parental leave

#LI-DNI