Relativity

Returning Candidate?

Site Reliability Engineer

Site Reliability Engineer

Job Location 
US-IL-Chicago
Req. # 
2017-KV-ENG-0008
Type 
Full-Time
Department 
Engineering
Sub-Department 
Engineering Operations

MORE INFORMATION ABOUT THIS JOB

Overview

At Relativity, we make great software that helps users organize data, discover the truth, and act on it. Our product is used by more than 13,000 organizations around the world – in the cloud, on-premises, or both – to manage large volumes of data.

 

Here you can own your career in a community of values-driven people who help our customers around the world solve complex data challenges. If this sounds like the place for you, check out the details of this position below.

 

 

The Site Reliability Engineer is responsible for activities related with monitoring, operating and improving resiliency of a large distributed enterprise cloud solution. This role is also responsible with making changes directly or providing feedback and suggestions to other Engineering teams on how to make the overall system more performant and more reliable.

 

 

 

Responsibilities

The Site Reliability Engineer is responsible for delivering results for the Product Development department by:

  • Maintaining a highly-distributed system in a public cloud with distributed database, compute and storage systems
  • Contributing to a Lean (Kanban), or hybrid team to solve the operational challenges
  • Deploy changes into testing and production environments
  • Provide feedback at Change Advisory Board (CAB) meetings regarding upcoming changes and changes that have been implemented
  • Provide feedback to Engineering teams regarding areas of the software that require more monitoring\alerting capabilities as well as can be engineered to be more resilient
  • Track changes to the system in the Change Management Database
  • Following practices and procedures that adhere with industry best practices for operating a large-scale infrastructure and software system
  • Collaborate with software development teams to understand new features being delivered to the cloud solution and gain an understanding of how to monitor\operate
  • Continuously improve monitoring and alerting capabilities of the system as well as make changes to make the application and infrastructure more resilient
  • Support Problem and Incident managers by providing information regarding trends of reoccurring issues within the application and cloud infrastructure

 

In addition to the above responsibilities, the Site Reliability Engineer is expected to display professionalism in the following ways:

  • Maintain an attitude of commitment through outward display of willingness
  • Practice positive interactions - lean on encouragement in place of judgment
  • Impress responsibility on others by displaying ownership in tasks
  • Act in the interest of the overall team and our customers
  • Understand the needs of our customers

Qualifications

  • Experience working in an Operations Center
  • Experience supporting public cloud based infrastructure
  • At least one year of experience with Windows Server, Linux, IIS and SQL Server experience; designing and deploying systems from the ground up, with knowledge and experience deploying and provisioning storage and networking
  • Experience with storage knowledge required
  • Cloud Services – Knowledge around MS Azure and other cloud offerings is a plus
  • OS / Software – Microsoft Windows Server, Linux, Internet Information Services, MS SQL Server, and typical back-office product knowledge
  • Automation – Powershell, Chef, Python experience to help with automating repeatable tasks
  • VMware – vCenter, ESXi, vCloud Automation Center
  • Storage – General knowledge of iSCSI vs Fiber Channel, NAS, SAN, DAS, local
  • Networking – General networking knowledge, VLANs, routing, VMware based switching, and firewall concepts
  • Ability to maintain a calm demeanor when things are going wrong to troubleshoot issues effectively
  • A big picture mentality around solutions architecture and a Keep It Simple philosophy
  • AWS, Azure, or VMware certification a plus
  • Excellent communication and inter-personal skills, including the ability to communicate difficult technical concepts in a straight-forward, simple, manner

Minimum Qualifications:

  • Bachelor’s Degree or equivalent in Computer Science or related disciplines
  • 2 + years of experience with scripting and automation languages (Powershell, Chef, Ruby, Python, etc.)
  • 2 + years of supporting customer facing web delivered software
  • 1 + years of cloud experience
  • Experience with SQL Server and No SQL Systems (Elastic, Mongo, Cassandra, etc.)

About Us

Our software has more than 150,000 active users in more than 40 countries from organizations including the U.S. Department of Justice, more than 70 Fortune 100 companies, and more than 195 of the Am Law 200. We have grown significantly over the last several years and continue striving to build software that helps solve our customers’ toughest e-discovery and unstructured data challenges.

 

If you’re ready to grow with us, we’d love to hear from you.

 

#LI-KV1

ABOUT KCURA

Share on your newsfeed

Connect With Us!

Not ready to apply? Connect with us for general consideration.