Site Reliability Engineering Manager (SRE) in OpenLoop

FULL_TIME

Lima
This job is performed partly from home and partly at the office in: Lima
(Hybrid)
| Expert | Full time | SysAdmin / DevOps / QA

9 applications
Replies between 12 and 26 days
Last checked today
Apply now
Requires applying in English

OpenLoop is looking for a Site Reliability Engineering Manager (SRE) to join our team in Lima, Peru. This role will be a member of the Engineering Team.

About OpenLoop

OpenLoop was co-founded by CEO, Dr. Jon Lensing, and COO, Christian Williams, with the vision to bring healing anywhere. Our telehealth support solutions are thoughtfully designed to streamline and simplify go-to-market care delivery for companies offering meaningful virtual support to patients across an expansive array of specialties, in all 50 states.

Our Company Culture

We have a relatively flat organizational structure here at OpenLoop. Everyone is encouraged to bring ideas to the table and make things happen. This fits in well with our core values of Autonomy, Competence and Belonging, as we want everyone to feel empowered and supported to do their best work.

Applications are only received at getonbrd.com.

About the Role

Team Leadership & Management

  • Build, lead, mentor, and grow a team of Site Reliability Engineers
  • Conduct regular 1:1s, performance reviews, and career development planning
  • Foster a culture of learning, collaboration, continuous improvement, sense of urgency and over communication.
  • Recruit, interview, and onboard new SRE team members
  • Collaborate with engineering leadership on team planning and resource allocation

Technical Strategy & Operations

  • Address and evaluate current company situation regarding applications, systems and platforms to build an SRE roadmap as well as the team headcount
  • Define and implement SRE strategy aligned with business objectives
  • Establish and maintain SLIs, SLOs, and error budgets across all services
  • Drive incident response processes and post-mortem culture
  • Lead capacity planning and infrastructure scaling initiatives
  • Oversee monitoring, alerting, and observability implementations
  • Champion automation and infrastructure-as-code practices

Cross-Functional Collaboration

  • Partner with engineering teams to improve system reliability and deployment practices
  • Work with security teams to implement secure, compliant infrastructure
  • Collaborate with the product team to balance feature velocity with reliability
  • Engage with executive leadership on infrastructure strategy and planning
  • Coordinate with vendors and external partners on critical infrastructure components

Operational Excellence

  • Ensure 24/7 system availability and rapid incident response
  • Implement and maintain disaster recovery and business continuity plans
  • Drive cost optimization initiatives while maintaining reliability standards
  • Establish and track key reliability metrics and KPIs
  • Lead efforts to reduce toil and increase automation

Requirements

Experience

  • 8+ years of experience in infrastructure, DevOps, or Site Reliability Engineering
  • 3+ years of people management experience, preferably in technical roles
  • Proven track record of managing large-scale, distributed systems
  • Experience with incident management and post-mortem processes
  • Strong background in AWS and container orchestration

Technical Skills

  • Strong proficiency in at least one programming language (Typescript, Python, Go, etc.)
  • Deep understanding of Linux/Unix systems and networking
  • Experience with Infrastructure as Code (AWS CDK)
  • Proficiency with monitoring and observability tools (Prometheus, Grafana, ELK, etc.)
  • Knowledge of CI/CD pipelines and deployment automation (Github Actions, Jenkins, etc)
  • Understanding of database systems and performance optimization

Leadership & Communication

  • Advanced english (C1) fluency
  • Excellent verbal and written communication skills
  • Experience leading technical discussions and presenting to stakeholders
  • Ability to translate technical concepts to non-technical audiences
  • Strong problem-solving and decision-making capabilities
  • Experience with agile methodologies and project management

Desirable Skills

Additional Experience

  • Experience in high-growth startup environments
  • Background in regulated industries (healthcare, finance, etc.)
  • Experience with event-driven architecture, microservices and service mesh
  • Knowledge of security best practices and compliance frameworks
  • AWS Certified Solutions Architect or similar cloud certifications

Technical Depth

  • Experience with chaos engineering and fault injection
  • Knowledge of performance testing and load testing frameworks
  • Understanding of distributed tracing and application performance monitoring
  • Experience with configuration management tools (Ansible, Chef, Puppet)

Our Benefits

  • Contract under a Peruvian company ID("Planilla"). You will receive all the legal benefits in Peruvian soles (CTS, "Gratificaciones", etc).
  • Monday - Friday workdays, full time (9 am - 6 pm).
  • Unlimited Vacation Days - Yes! We want you to be able to relax and come back as happy and productive as ever.
  • EPS healthcare covered 100% with RIMAC --Because you, too, deserve access to great healthcare.
  • Oncology insurance covered 100% with RIMAC
  • AFP retirement plan—to help you save for the future.
  • We’ll assign a computer in the office so you can have the best tools to do your job.
  • You will have all the benefits of the Coworking space located in Lima - Miraflores (Free beverage, internal talks, bicycle parking, best view of the city)

GETONBRD Job ID: 55228

Wellness program OpenLoop offers or subsidies mental and/or physical health activities.
Life insurance OpenLoop pays or copays life insurance for employees.
Paid sick days Sick leave is compensated (limits might apply).
Partially remote You can work from your home some days a week.
Bicycle parking You can park your bicycle for free inside the premises.
Health coverage OpenLoop pays or copays health insurance for employees.
Dental insurance OpenLoop pays or copays dental insurance for employees.
Free car parking You can park your car for free at the premises.
Computer provided OpenLoop provides a computer for your work.
Informal dress code No dress code is enforced.
Vacation over legal OpenLoop gives you paid vacations over the legal minimum.
Beverages and snacks OpenLoop offers beverages and snacks for free consumption.

Remote work policy

Hybrid

This job is performed partly from home and partly at the office in Lima (Peru).

  1. Jobs ›
  2. SysAdmin / DevOps / QA ›
  3. OpenLoop ›
  4. Site Reliability Engineering Manager (SRE)

About OpenLoop

Servicing all 42,000 zip codes nationwide, OpenLoop accelerates the delivery of patient care by matching our trusted community of certified clinicians and insurance partners with innovators in the digital health space. — OpenLoop's full profile

Site Reliability Engineering Manager (SRE)
OpenLoop • Lima
This job is performed partly from home and partly at the office in: Lima
(Hybrid)
Apply
Requires applying in English
Share this job Share