Company and Role Overview

Stavvy is working to transform the lending process for borrowers, their lenders, and the vendors used throughout the process. Whether we are working to enable real estate title companies to facilitate remote closings in a safe way or better connecting lenders with the businesses they use during the lending process, people at Stavvy are disruptors at heart. Our team is constantly iterating, solving problems, and working together to truly connect all stakeholders throughout the lending process in meaningful ways.

We are looking for a Senior Site Reliability Engineer to help us shape our infrastructure and build the foundation our team relies on for the rapid delivery of our product. We'll depend on you to instill best practices for building scalable distributed systems, with an emphasis on observability and fault tolerance. Our stack is managed by Terraform and consists of technologies such as Python, PostgreSQL, Redis, and JavaScript, but we're also open to using the best tool for the job. If you thrive in an early startup environment, or are simply looking for more ownership and the ability to have a large impact, we would love to meet you.

What You’ll Do

  • Collaborate with product managers and devs to automate our delivery process
  • Manage our AWS and GCP infrastructure, with an emphasis on configuration as code
  • Improve monitoring and alerting strategies, and work with devs to improve performance and reliability
  • Help build our on-call policies and run books
  • Participate in design reviews
  • Take ownership of projects and demonstrate a high level of accountability

What You’ll Need

  • 5+ years of experience building and maintaining cloud infrastructure for distributed production systems
  • 5+ years of experience as a backend engineer developing enterprise web applications
  • Proficiency in Bash, Python, or other scripting languages
  • Demonstrated experience with configuration and orchestration tools such as Terraform, CircleCI, and Docker
  • Expert in AWS, bonus points if you have experience with GCP
  • Familiarity with administering Linux systems and networks
  • Experience operating and tuning data stores such as Redis and PostgreSQL
  • Experience managing streaming infrastructure such as Redis
  • Familiarity with managing and configuring networking, load balancers, and proxies such as NGINX
  • Bonus points for experience with ELK