About the Company:
At Coupang we are building the future of ecommerce. Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We exist to wow our customers. We know we’re doing the right thing when we hear our customers say, “How did we ever live without Coupang?” We are one of the fastest-growing e-commerce companies that established an unparalleled reputation for being a dominant and reliable force in South Korean commerce.
We are proud to have the best of both worlds — a startup culture with the resources of a large global public company. This fuels us to continue our growth and launch new services at the speed we have been since our inception. We are all entrepreneurial surrounded by opportunities to drive new initiatives and innovations. At our core, we are bold and ambitious people that like to get our hands dirty and make a hands-on impact. At Coupang, you will see yourself, your colleagues, your team, and the company grow every day.
Our mission to build the future of commerce is real. We push the boundaries of what’s possible to solve problems and break traditional tradeoffs. Join Coupang now to create an epic experience in this always-on, high-tech, and hyper-connected world.
About the Role:
Site Reliability Engineers (SREs) at Coupang is a mission-critical role which combines software and system engineering to build, run and scale our complex, large-scale ecommerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer facing services are healthy, monitored, automated, and designed to scale. As SRE organization we take pride in handling “operations as an engineering” problem with automation first approach. You will use your background to build best in class infrastructure automation for areas such as Observability, Incident management, Disaster Recovery, Load testing, Capacity engineering and many more. In this role you will work very closely with our product development teams from an early stage of design to all the way helping resolve any production incidents, maintaining SLI/SLA bar for production services and influencing them with SRE principles and best practices. If you take pride in complete ownership, have a passion for solving complex technical challenges for large scale distributed systems and demeanor to work and communicate effectively across team boundaries, this is the role for you!
Key Responsibilities:
- Serve as a primary point responsible for the reliability, health, and performance of all Coupang customer-facing services.
- Gain deep knowledge of Coupang application workflow and dependencies.
- Define and track key performance indicators (KPIs) and service-level objectives (SLOs) related to system availability, performance, and reliability.
- Build world class incident management process and automation, including fast incident remediation, incident operational reviews and retrospectives.
- Develop and implement best practices for creating and maintaining effective monitoring, alerting, and telemetry systems.
- Build automation to execute regular Disaster Recovery testing and load testing to stay ahead of expected growth of Coupang services.
- Work closely with product development teams to ensure the products are designed with scale and operability in mind.
- Build right guardrails and automation for deploying production changes holding the reliability bar.
- Participate in a 24x7 rotation for production issue escalations, functions well in a fast-paced environment.
- Communicate effectively with people at all levels of the organization.
Essential Qualifications:
- 5+ years of industry experience building and operating large scale distributed systems.
- Deep UNIX/Linux systems knowledge and administration background.
- Demonstrated programming skills in one or more of: Python, Java, Golang, Ruby.
- Strong problem-solving and analytical skills spanning systems, network (TCP/IP) and code, with a focus on data-driven decision-making.
- Experience with cloud-based infrastructure, including AWS, Azure, or Google Cloud Platform.
- Strong understanding of DevOps and SRE practices, including continuous integration, continuous delivery, and infrastructure as code (IaC).
- Experience with containerization and orchestration technologies, such as Docker and Kubernetes.
- Excellent communication and collaboration skills, with the ability to work with teams across distinct functions and technical domains.
- Knowledge of observability ecosystem including metrics, logging, tracing and tools, such as Prometheus, Grafana, Elastic Stack, Datadog, or New Relic.
Preferred Qualifications:
- Bachelor's degree in computer science, Engineering, or a related technical field.
- Prior experience working with large scale web-based Java architectures and JVM configuration.
- Professional certifications in cloud platforms, monitoring tools, or related technologies.
- Previous experience working on a large-scale ecommerce platform.
전형 절차 및 안내 사항
- 전형 절차
- 서류전형 - 1차 라이브 코딩면접 - 심층 화상면접 (라이브 코딩, 아키텍쳐/디자인 인터뷰 포함) – 최종 합격
- 전형절차는 직무별로 다르게 운영될 수 있으며, 일정 및 상황에 따라 변동될 수 있습니다.
- 전형 일정 및 결과는 지원서에 등록하신 이메일로 개별 안내 드립니다.
- 참고 사항
- 본 공고는 모집 완료 시 조기 마감될 수 있습니다.
- 지원서 내용 중 허위사실이 있는 경우에는 합격이 취소될 수 있습니다.
- 취업 보호 대상자(보훈대상자, 장애인 등)는 관련 법률에 따라 채용우대를 받을 수 있습니다.
- 직급과 담당 업무 범위는 후보자의 전반적인 경력과 경험 등 제반사정을 고려하여 변경될 수 있습니다. 이러한 변경이 필요할 경우, 최종 합격 통지 전 적절한 시기에 후보자와 커뮤니케이션 될 예정입니다.
- 채용 및 업무 수행과 관련하여 요구되는 법령상 자격이 갖추어지지 않은 경우 채용이 제한될 수 있습니다.
개인정보 처리방침
- 쿠팡 그룹은 입사지원자 개인정보 처리방침(아래 링크)에 따라 귀하의 개인정보를 수집하여 처리합니다. https://www.coupang.jobs/kr/privacy-policy
서류 반환 정책
- 본 고지는 『채용절차의공정화에관한법률』 제11조제6항에 따른 것 입니다.
- 당사 채용에 응시한 구직자 중 최종 합격이 되지 못한 구직자는 『채용절차의 공정화에 관한 법률』에 따라 제출한 채용서류의 반환을 청구할 수 있음을 알려 드립니다. 다만, 홈페이지 또는 전자우편으로 제출된 경우나 구직자가 당사의 요구 없이 자발적으로 제출한 경우에는 그러하지 아니하며, 천재지변이나 그 밖에 당사에게 책임 없는 사유로 채용서류가 멸실된 경우에는 반환한 것으로 봅니다.
- 위2항 본문에 따라 채용 서류 반환 청구를 하는 구직자는 채용 서류 반환 청구서 [채용절차의 공정화에 관한 법률 시행규칙 별지 제 3 호 서식]를 작성하여 이메일 ([email protected]) 로 제출하면, 제출이 확인된 날로부터 14 일 이내에 지정한 주소지로 등기우편을 통하여 발송해 드립니다. 이 경우 등기우편요금은 수신자 부담으로 하게 되오니 유념하시기 바랍니다.
- 당사는 위2항 본문에 따른 구직자의 반환 청구에 대비하여 채용 여부가 확정된 날로부터 180 일간 구직자가 제출한 채용서류 원본을 보관하게 되며, 그때까지 채용서류의 반환을 청구하지 아니할 경우에는 『개인정보 보호법』에 따라 지체 없이 채용서류 일체를 파기할 예정입니다.
- 단, 위 1항 내지 4항의 내용은 대한민국의 노동 관계 법령이 적용되는 경우에만 적용됩니다. 그 이외의 경우에는 적용되지 않습니다.