Blackpoint Cyber Logo

Blackpoint Cyber

Director of SRE

Posted 5 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Canada
Expert/Leader
Remote
Hiring Remotely in Canada
Expert/Leader
As Director of SRE, lead infrastructure and reliability efforts, focusing on cloud scalability, cost optimization, and team management to ensure efficient and secure services at Blackpoint Cyber.
The summary above was generated by AI

Blackpoint Cyber is the leading provider of world-class cybersecurity threat hunting, detection and remediation technology. Founded by former National Security Agency (NSA) cyber operations experts who applied their learnings to bring national security-grade technology solutions to commercial customers around the world, Blackpoint Cyber is in hyper-growth mode,  fueled by a recent $190m series C round. 

Why Blackpoint?

Ready to give some hackers hell? At Blackpoint Cyber, we fight unfair fights, eliminating threats before they strike. Built by former US Department of Defense and Intelligence security experts, our mission is to provide absolute and unified Managed Detection and Response (MDR) services to organizations worldwide.

Company Culture

We value high-quality execution, ownership, and integrity—principles that are never compromised. Our team is collaborative, energetic, and thrives in a high-performance culture, continuously growing by tackling the toughest challenges in cybersecurity.

What You'll Do

As a Director of SRE, you will lead the infrastructure, reliability, and cost optimization efforts for Blackpoint Cyber’s mission-critical services. You will be responsible for ensuring the scalability, availability, and efficiency of our cloud infrastructure, while also optimizing COGS (Cost of Goods Sold) to maintain financial efficiency.

This role requires strong leadership, hands-on infrastructure expertise, and a deep understanding of cost-effective scaling strategies. You will work closely with engineering, security, and product teams to ensure our systems are resilient, secure, and cost-efficient.

Key Responsibilities

Infrastructure & Reliability

  • Lead the design, implementation, and management of scalable, reliable, and highly available cloud-based infrastructure (AWS/Azure/GCP).

  • Establish SRE best practices, including monitoring, incident response, capacity planning, and performance tuning.

  • Improve observability, monitoring, and alerting, ensuring quick detection and resolution of reliability issues.

  • Drive automation-first approaches, reducing manual intervention through Infrastructure-as-Code (IaC) and CI/CD pipelines.

  • Lead a team of SREs, applying Blackpoint Cyber's management values of Coach, Model, Care, in defining business-critical outcomes, creating action plans, and supporting the team in achieving them.

  • Continue hands-on contributions in an SRE role

  • Design, implement, and support key infrastructure, including automated attack infrastructure deployment, isolated identity and productivity environments, and secure data storage.

  • Establish and apply security hygiene and monitoring policies to meet Blackpoint Cyber security requirements.

COGS Optimization & Cost Efficiency

  • Monitor and optimize cloud spending, ensuring cost-effective resource utilization without compromising reliability.

  • Define and implement cost-saving strategies (e.g., right-sizing instances, leveraging spot instances, optimizing storage, etc.).

  • Work closely with finance and procurement teams to forecast infrastructure costs and align expenses with business objectives.

Leadership & Collaboration

  • Manage and mentor a team of SREs, DevOps engineers, and cloud infrastructure specialists.

  • Partner with engineering teams to design reliable and scalable architectures, embedding reliability into development workflows.

  • Collaborate with security teams to ensure compliance, security hardening, and disaster recovery readiness.

  • Drive post-incident reviews, ensuring continuous improvement in system resilience.

What You Bring

Must-Have Qualifications

  • 10+ years of experience in SRE, DevOps, or Cloud Infrastructure roles.

  • 5+ years of experience in people management, leading SRE team.

  • Strong experience with AWS, Azure, or GCP, with expertise in cost management and scaling strategies.

  • Proficiency in Infrastructure-as-Code (IaC) (e.g., Terraform, CloudFormation, Pulumi).

  • Hands-on experience with CI/CD pipelines, Kubernetes, and container orchestration.

  • Expertise in monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Splunk).

  • Proven ability to optimize cloud costs (COGS) while maintaining reliability and performance.

  • Strong leadership, collaboration, and problem-solving skills.

  • Experience with SLA/SLO/SLIs will be valuable.

  • "Engineering efficiency" through self-serve tooling.

Nice-to-Have

  • Experience working in a cybersecurity or high-security environment.

  • Understanding of compliance frameworks (SOC2, ISO 27001, FedRAMP, etc.).

  • Knowledge of serverless architectures and edge computing.

  • Experience working with FinOps teams to manage cloud costs effectively.

Blackpoint Cyber welcomes and encourages applications from qualified individuals of all races,  colors, religions, sex, sexual orientation, gender identity or expression, national origin, age, marital  status, or any other legally protected status. We are committed to equality of opportunity in all  aspects of employment.  For eligible employees in the US, Blackpoint offers competitive Health, Vision, Dental, and Life Insurance plans, a robust 401k plan, Discretionary Time Off, and other minor perks.

Top Skills

AWS
Azure
Ci/Cd
CloudFormation
Datadog
GCP
Grafana
Kubernetes
Prometheus
Pulumi
Splunk
Terraform

Similar Jobs

2 Hours Ago
Easy Apply
Remote or Hybrid
Canada
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
As a Staff Machine Learning Engineer, you'll lead AI initiatives using large-scale data, optimize ML models for edge devices, and collaborate with cross-functional teams.
Top Skills: C++PythonRayRustSpark
3 Hours Ago
Remote
2 Locations
Senior level
Senior level
Artificial Intelligence • Productivity • Software • Automation
Zapier seeks an Engineering Manager to lead the AI Capabilities team, focusing on building a robust AI platform with reusable APIs and high-quality features. Responsibilities include leading engineers, making architectural decisions, and enhancing development processes to improve user experiences and align with company goals.
Top Skills: DjangoNode.jsPythonReactTypescript
3 Hours Ago
Remote
2 Locations
Senior level
Senior level
Artificial Intelligence • Productivity • Software • Automation
Lead a team of engineers to design and implement AI features, improve user experience, and drive automation solutions with a focus on user needs and experimentation.
Top Skills: DjangoNode.jsPythonReactTypescript

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account