Software: Operations & Reliability Lead

Guaynabo, P.R., Puerto Rico
Full Time
Mid Level
Role Overview
We’re looking for an experienced Operations & Reliability Lead to strengthen our monitoring, security, automation, and cloud operations. This role drives reliability, resilience, and a security‑first posture across all systems and environments.
What You’ll Do
  • Build and maintain application and infrastructure monitoring, dashboards, and automated alerts.
  • Implement cloud and On Premise resource provisioning and enforce standardized configuration baselines.
  • Manage backup, recovery, and resilience workflows with regular testing cycles.
  • Conduct AI‑assisted performance testing, security audits, and penetration testing.
  • Coordinate with NOC and SOC to support continuous monitoring and threat detection.
  • Lead incident response, root‑cause analysis, and operational readiness activities.
  • Implement cost optimization and resource governance across cloud environments.
  • Automate operational tasks and integrate AI‑Ops capabilities.
What You Bring
  • Strong experience with monitoring tools (New Relic, Datadog, Prometheus, Azure Monitor, etc.).
  • Hands‑on expertise with cloud platforms, IaC, CI/CD, and configuration management.
  • Solid understanding of security frameworks, threat detection, and compliance.
  • Experience with backup/DR strategies and resilience best practices.
  • Strong troubleshooting, documentation, and cross‑team collaboration skills.
Valuable Extras
  • Cloud or security certifications (Azure/AWS Architect, Security+, CISSP, ITIL, SRE).
  • Experience with AI‑Ops platforms or ML‑based operational tooling.
  • Background in regulated industries.
Education & Experience
  • Bachelor's degree in Computer Science or related field.
  • At least 2 years of experience working with systems.
Share

Apply for this position

Required*
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Human Check*