We operate at significant scale, managing: 40+ billion API hits and 120+ billion events per month , 500+ microservices in production 200+ TB of application data , 6+ PB of total storage . As we scale, our platform must evolve to support AI-enabled products, developer velocity, and efficient infrastructure usage.
We are seeking a Director – Platform Services to lead HighLevel’s evolution toward a modern, AI-enabled, developer-first platform.
This role is responsible for:
Designing and operating scalable cloud infrastructureDriving platform services that enable product teams
Leading adoption of AI capabilities across platform services
Improving developer productivity and internal tooling
Defining and executing infrastructure automation and cost strategies
This is a strategic and execution-focused leadership role, critical to HighLevel’s ability to scale efficiently, innovate faster, and operate with strong reliability and cost discipline.
Responsibilities:
Lead a team building a centralized platform (Internal Developer Portal) to standardize how engineering teams: Spin up environments , Deploy services , Manage infrastructure , Scale applications globally
Drive adoption of Platform-as-a-Product mindset, treating platform capabilities as products for internal developers.
Define platform vision and roadmap aligned with business and engineering needs.
Improve developer productivity across Dev teams: Reducing cognitive load , Standardizing workflows , Enabling self-service capabilities
Build and scale: Golden paths and service templates , Developer tooling and CLI/API interfaces , Environment provisioning systems
Build platform services and APIs that simplify complex infrastructure operations.
Promote an API-first approach to platform capabilities.
Enable developers to interact with infrastructure through standardized APIs and services.
Support scalable, consistent service deployment patterns.
Drive adoption of Infrastructure-as-Code (IaC) across the organization.
Lead efforts to automate provisioning, configuration, and operations.
Architect and oversee implementation of highly resilient infrastructure systems, including: Multi-region failover strategies , Disaster recovery and business continuity , Multi-cloud abstraction (where required)
Ensure infrastructure supports seamless application migration during outages.
Design for 99.99%+ availability across critical systems.
Define and evolve CI/CD systems to support: High-velocity development driven by AI-generated code , Progressive delivery (canary, blue-green) Automated rollback based on telemetry , Risk-aware validation and automated quality checks ,
Integrate: Policy-as-code , Security as a code , AI - Aware Validation
Ensure CI/CD becomes the primary control system for safety and reliability.
Act as the Scrum/Delivery leader for the platform organization.
Balance: Immediate operational and migration needs , long term architectural imporvements
Partner with SRE and Infra teams to ensure operational readiness.
Ensure predictable delivery of platform initiatives
Partner with Cloud Infrastructure and FinOps teams to : Optimize infrastructure utilization , Improve cost visibility and predictability , Ensure platform abstractions are cost-efficient
Balance performance, reliability, and cost.
Build and lead high-performing teams across : Platform Engineering : Cloud Intra , Dev Ex /CI/CD Pipeline , Infra Automation
Develop managers, leads, and senior ICs.
Establish clear ownership, accountability, and growth paths.
Foster a culture of innovation, reliability, and operational excellence.
Work closely with: Product Engineering teams to enable faster delivery ,
- Data and AI teams for platform integration , Security and compliance teams for governanc
Act as a key technical leader in executive discussions and decision-making.
Requirements:
Bachelor’s degree or equivalent experience in Engineering or related field.
12+ years of experience in platform, infrastructure, or cloud engineering.
Proven experience leading platform engineering and infrastructure teams at scale.
Strong hands-on experience with: Cloud platforms (GCP preferred) , Kubernetes (GKE) and distributed systems , CI/CD and DevOps practices
Experience driving developer productivity initiatives and platform adoption.
Experience defining and executing automation and infrastructure strategies.
Strong leadership, communication, and stakeholder management skills.
Experience enabling AI/ML platforms or AI product integrations
Familiarity with LLM platforms, model serving, or AI infrastructure
Experience with Internal Developer Platforms (IDP)
Experience in high-growth SaaS environments
Exposure to compliance and audit-driven environments (SOC2, IPO readiness)
What Success Looks like (First 6–12 Months)
Platform services enable faster and more reliable product development.
AI capabilities are safely and effectively adopted across teams.
Developer productivity improves measurably.
Infrastructure is highly automated and requires minimal manual intervention.
Costs (including AI workloads) are optimized and predictable.
Platform and infrastructure systems are scalable, secure, and reliable.
Youtube Channel : https://www.youtube.com/channel/UCXFiV4qDX5ipE-DQcsm1j4g

