Sr. Reliability Engineer – 8087

  • Remote
  • Applications have closed

Raeder Landree, Inc.

Finders of Keepers

The Senior Reliability Engineer will drive the technical team efforts and work with other Reliability Engineers (RE), Product Managers, Software Engineers, and Architects to produce mission-critical infrastructure, tools, performance improvements, actionable and meaningful performance measurements, and communication to stakeholders. The Senior RE is expected to work with management, peers, and customers to define and implement the technical vision, improve monitoring tools, error detections, defects elimination while improving Mean Time to Detection/Resolution, and overall service availability and customer satisfaction. The RE role provides an opportunity to blend system design and software engineering skills with passion for troubleshooting and defects elimination to address an ever-changing applications and environments with scalability and reliability challenges. Responsibilities: Improving and developing reliability platform, building out custom tools, infrastructure, and services. Automation of manual tasks to reduce toil. Perform engineering and technical tasks as assigned by applying general engineering principles. Perform independent research in support of technical tasks. Contribute positively to open-source projects and join existing communities. Navigate this broader ecosystem and structure projects with upstream/ downstream opportunities in mind. Participate in an on- call rotation, have strong written communication skills, and be able to develop working relationships with coworkers. Provide technical expertise and consultation through direct involvement to identify and resolve problems. Work frequently with Product teams on shared goals and cross-team projects. Bring experience, pragmatism, empathy, and composure to interactions with teams outside of the RE organization. Work frequently with Product teams on shared goals and cross-team projects. Balance planned and reactive work using basic project planning techniques and technical roadmaps. Experience negotiating SLIs, SLOs, and SLAs with product owners. Supervise service reliability, metrics, sustainability, technical debt, and operational toil for live services running at scale. Work across multiple project teams simultaneously to support rapid development efforts. Identify and integrate with third-party solutions where it makes the most sense. Use data to understand the availability, reliability, and sustainability of our software. Bachelor’s Degree in Computer Science, Software Engineering, Information Systems 3-5 years of experience Valuable Technologies Like: Cloud computing, Web Services, Kubernetes, (Repository Management git/svn/), Ansible, Terraform, Virtualization, Docker Containers, Kafka, RabbitMQ, Redis, Netbox, Akamai/Apigee Valuable Methodologies Like: Agile, SCRUM, Reliability Engineering, 12 factor apps, microservice architecture, public cloud architecture Valuable Languages Like: React, Node, Kotlin, C#, Java, JavaScript, Linux shell, Powershell, SQL, HTML, CSS Valuable Databases/OS Systems Like: Non-relational databases (NoSQL, Elasticsearch, CosmosDB), MySQL, Postgres, SQLServer, Oracle, DB2, Windows, Linux Valuable Observability Tools Like: Grafana, Prometheus, Victoria Metrics, Elasticsearch, Azure Monitor, APM Tools (NewRelic, DataDog, AppDynamics, etc) Service Management Tools Like: Jira, Pivotal Tracker, Xmatters Intellectual curiosity, problem solving, and openness is key to its success. Mindset for solving production systems issues and understanding root cause.

 

Please attach resume or CV and indicate preferred contact information.