Poshmark Logo

Poshmark

Staff Site Reliability Engineer I

Posted 3 Days Ago
Be an Early Applicant
In-Office
Chennai, Tamil Nadu, IND
Senior level
In-Office
Chennai, Tamil Nadu, IND
Senior level
The Staff Site Reliability Engineer ensures the health, performance, and capacity of internet-facing services, contributing to automation and deployment in a fast-paced environment.
The summary above was generated by AI
About Poshmark


Poshmark is the leading fashion marketplace where style comes alive through discovery, self-expression, and human connection. Powered by a vibrant community of 165 million members, Poshmark brings real people and taste to shopping through a social experience shaped by shared discovery. Buying and selling fashion feels simple, joyful, and personal, while every item tells its own story. Poshmark empowers sellers to grow meaningful businesses, keeps fashion in circulation longer, and gives shoppers access to unique and trusted finds, from everyday pieces to one-of-a-kind vintage and luxury.

 

We’re looking for an experienced Staff Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying and resolving production issues. The ideal candidate will be passionate about an operations role that involves deep knowledge of both the application and the product, and will also believe that automation is a key component to operating large-scale systems.

6-Month Accomplishments

  • Familiarize with poshmark tech stack and functional requirements.

  • Get comfortable with automation tools/frameworks used within cloudops organization and deployment processes associated with.

  • Gain in depth knowledge related to related product functionality and infrastructure required for it.

  • Start Contributing by working on small to medium scale projects.

  • Understand and follow on call rotation as a secondary to get familiarized with the on call process.

12+ Month Accomplishments

  • Execute projects related to comms functionality, independently, with little guidance from lead.

  • Create meaningful alerts and dashboards for various sub-system involved in targeted infrastructure.

  • Identify gaps in infrastructure and suggest improvements or work on it.

  • Get involved in on-call rotation.

Responsibilities

  • Serve as a primary point responsible for the overall health, performance, and capacity of one or more of our Internet-facing services.

  • Gain deep knowledge of our complex applications.

  • Assist in the roll-out and deployment of new product features and installations to facilitate our rapid iteration and constant growth.

  • Develop tools to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale UNIX environment.

  • Work closely with development teams to ensure that platforms are designed with "operability" in mind.

  • Function well in a fast-paced, rapidly-changing environment.

  • Participate in a 24x7 on-call rotation.

Desired Skills

  • 6+ years of experience in Systems Engineering/Site Reliability Operations role is required, ideally in a startup or fast-growing company.

  • 6+ years in a UNIX-based large-scale web operations role.

  • 6+ years of experience in doing 24/7 support for large scale production environments.

  • Battle-proven, real-life experience in running a large scale production operation.

  • Experience working on cloud-based infrastructure e.g AWS, GCP, Azure.

  • Hands-on experience with continuous integration tools such as Jenkins, configuration management with Ansible, systems monitoring and alerting with tools such as Nagios, New Relic, Graphite.

  • Experience scripting/coding

  • Ability to use a wide variety of open source technologies and tools.

Technologies we use:

  • Ruby, JavaScript, NodeJs, Tomcat, Nginx, HaProxy

  • MongoDB, RabbitMQ, Redis, ElasticSearch.

  • Amazon Web Services (EC2, RDS, CloudFront, S3, etc.)

  • Terraform, Packer, Jenkins, Datadog, Kubernetes, Docker, Ansible and other DevOps tools.

Please note that Poshmark will not be able to sponsor work-related visa for this position.

Similar Jobs

2 Days Ago
In-Office or Remote
India
Senior level
Senior level
Cloud • Software
The Senior Site Reliability Engineer at Tyk will optimize and maintain cloud platforms, enhance automation, and ensure high reliability across systems while collaborating cross-functionally and driving continuous improvements.
Top Skills: AWSEksGoGrafanaHelmKubernetesLinuxMongoDBPrometheusPythonRedisTerraform
8 Days Ago
In-Office
Chennai, Tamil Nadu, IND
Senior level
Senior level
Information Technology • Legal Tech • Professional Services • Analytics • Business Intelligence
The Senior Site Reliability Engineer I ensures system reliability, collaborates on deployment strategies, and promotes DevOps and site reliability best practices.
Top Skills: Amazon Web ServicesBashDockerEcsGitlabGrafanaJenkinsKubernetesPowershellPrometheusPythonTerraform
An Hour Ago
Hybrid
Chennai, Tamil Nadu, IND
Expert/Leader
Expert/Leader
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Lead cloud modernization and migration to cloud-native microservices; provide technical leadership across agile teams; champion CI/CD, DevSecOps, automated testing; manage talent, stakeholders, and delivery of high-quality online APIs.
Top Skills: AgileAutomated TestingCi/CdDevOpsDevsecopsDistributed SystemsGCPJavaMicroservices

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account