CrowdStrike Logo

CrowdStrike

Sr. Engineer - ML Platform (Remote, IND)

Sorry, this job was removed at 12:33 p.m. (IST) on Tuesday, Dec 10, 2024
Be an Early Applicant
Remote
Remote

#WeAreCrowdStrike and our mission is to stop breaches. As a global leader in cybersecurity, our team changed the game. Since our inception, our market leading cloud-native platform has offered unparalleled protection against the most sophisticated cyberattacks. We're looking for people with limitless passion, a relentless focus on innovation and a fanatical commitment to the customer to join us in shaping the future of cybersecurity. Consistently recognized as a top workplace, CrowdStrike is committed to cultivating an inclusive, remote-first culture that offers people the autonomy and flexibility to balance the needs of work and life while taking their career to the next level. Interested in working for a company that sets the standard and leads with integrity? Join us on a mission that matters - one team, one fight.
About the Role:
The charter of the Data + ML Platform team is to harness all the data that is ingested and cataloged within the Data LakeHouse for exploration, insights, model development, ML Engineering and Insights Activation. This team is situated within the larger Data Platform group, which serves as one of the core pillars of our company. We process data at a truly immense scale. Our processing is composed of various facets including threat events collected via telemetry data, associated metadata, along with IT asset information, contextual information about threat exposure based on additional processing, etc. These facets comprise the overall data platform, which is currently over 200 PB and maintained in a hyper scale Data Lakehouse, built and owned by the Data Platform team. The ingestion mechanisms include both batch and near real-time streams that form the core Threat Analytics Platform used for insights, threat hunting, incident investigations and more.
As an engineer in this team, you will play an integral role as we build out our ML Experimentation Platform from the ground up. You will collaborate closely with Data Platform Software Engineers, Data Scientists & Threat Analysts to design, implement, and maintain scalable ML pipelines that will be used for Data Preparation, Cataloging, Feature Engineering, Model Training, and Model Serving that influence critical business decisions. You'll be a key contributor in a production-focused culture that bridges the gap between model development and operational success. Future plans include generative AI investments for use cases such as modeling attack paths for IT assets.
What You'll Do:

  • Help design, build, and facilitate adoption of a modern Data+ML platform
  • Modularize complex ML code into standardized and repeatable components
  • Establish and facilitate adoption of repeatable patterns for model development, deployment, and monitoring
  • Build a platform that scales to thousands of users and offers self-service capability to build ML experimentation pipelines
  • Leverage workflow orchestration tools to deploy efficient and scalable execution of complex data and ML pipelines
  • Review code changes from data scientists and champion software development best practices
  • Leverage cloud services like Kubernetes, blob storage, and queues in our cloud first environment


What You'll Need:

  • B.S. in Computer Science, Data Science, Statistics, Applied Mathematics, or a related field and 10+ years related experience; or M.S. with 8+ years of experience; or Ph.D with 6+ years of experience.
  • 3+ years experience developing and deploying machine learning solutions to production. Familiarity with typical machine learning algorithms from an engineering perspective (how they are built and used, not necessarily the theory); familiarity with supervised / unsupervised approaches: how, why, and when and labeled data is created and used
  • 3+ years experience with ML Platform tools like Jupyter Notebooks, NVidia Workbench, MLFlow, Ray, Vertex AI etc.
  • Experience building data platform product(s) or features with (one of) Apache Spark, Flink or comparable tools in GCP. Experience with Iceberg is highly desirable.
  • Proficiency in distributed computing and orchestration technologies (Kubernetes, Airflow, etc.)
  • Production experience with infrastructure-as-code tools such as Terraform, FluxCD
  • Expert level experience with Python; Java/Scala exposure is recommended. Ability to write Python interfaces to provide standardized and simplified interfaces for data scientists to utilize internal Crowdstrike tools
  • Expert level experience with CI/CD frameworks such as GitHub Actions
  • Expert level experience with containerization frameworks
  • Strong analytical and problem solving skills, capable of working in a dynamic environment
  • Exceptional interpersonal and communication skills. Work with stakeholders across multiple teams and synthesize their needs into software interfaces and processes.


Critical Skills Needed for Role:

  • Distributed Systems Knowledge
  • Data Platform Experience
  • Machine Learning concepts


Experience with the Following is Desirable:

  • Go
  • Iceberg
  • Pinot or other time-series/OLAP-style database
  • Jenkins
  • Parquet
  • Protocol Buffers/GRPC


#LI-DP1
#LI-VJ1
#LI-Remote
Benefits of Working at CrowdStrike:

  • Remote-first culture
  • Market leader in compensation and equity awards with option to participate in ESPP in eligible countries
  • Competitive vacation and flexible working arrangements
  • Physical and mental wellness programs
  • Paid parental leave, including adoption
  • A variety of professional development and mentorship opportunities
  • Access to CrowdStrike University, LinkedIn Learning and Jhanna
  • Offices with stocked kitchens when you need to fuel innovation and collaboration
  • Birthday time-off in your local country
  • Work with people who are passionate in our mission and Great Place to Work certified across the globe


CrowdStrike is proud to be an equal opportunity and affirmative action employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. Our approach to cultivating a diverse, equitable, and inclusive culture is rooted in listening, learning and collective action. By embracing the diversity of our people, we achieve our best work and fuel innovation - generating the best possible outcomes for our customers and the communities they serve.
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran. If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at [email protected] for further assistance.

Similar Jobs at CrowdStrike

Be an Early Applicant
19 Hours Ago
15 Locations
Remote
10,000 Employees
Senior level
10,000 Employees
Senior level
Cloud • Information Technology • Sales • Security • Cybersecurity
You will design and build a modern Data+ML platform, modularizing ML code, establishing development patterns, and implementing ML pipelines for data preparation and model serving. Collaborate with cross-functional teams to enhance ML capabilities in a production environment.
7 Days Ago
15 Locations
Remote
10,000 Employees
Senior level
10,000 Employees
Senior level
Cloud • Information Technology • Sales • Security • Cybersecurity
As a Senior Engineer on the ML Platform team at CrowdStrike, you will build and maintain scalable ML pipelines, contribute to the development of the ML Experimentation Platform, and collaborate with Data Scientists and Engineers. You’ll facilitate the adoption of modern data and ML solutions, modularize complex ML code, and ensure adherence to software development best practices.
Be an Early Applicant
19 Hours Ago
16 Locations
Remote
10,000 Employees
Senior level
10,000 Employees
Senior level
Cloud • Information Technology • Sales • Security • Cybersecurity
As a Software Engineer at CrowdStrike, you will develop the Falcon Host sensor, leading the design and implementation of features across various OS subsystems while debugging reports and improving reliability. You will work collaboratively within a distributed team and own projects from design to delivery using Agile methodologies.

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account