The Principal Data Scientist will analyze business problems, develop advanced ML models, and implement solutions at scale while leveraging NLP and deep learning techniques.
As an expectation a fitting candidate must have/be:
Good to have: Document analysis using Image processing/computer vision and geometric deep learning
Technology Stack:
Python as a primary programming language.
Conceptual understanding of classic ML/DL Algorithms like Regression, Support Vectors, Decision tree, Clustering, Random Forest, CART, Ensemble, Neural Networks, CNN, RNN, LSTM etc.
Morningstar is an equal opportunity employer.
Morningstar's hybrid work environment gives you the opportunity to work remotely and collaborate in-person each week. We've found that we're at our best when we're purposely together on a regular basis, at least three days each week. A range of other benefits are also available to enhance flexibility as needs change. No matter where you are, you'll have tools and resources to engage meaningfully with your global colleagues.
I10_MstarIndiaPvtLtd Morningstar India Private Ltd. (Delhi) Legal Entity
- Ability to analyze business problem and cut through the data challenges.
- Ability to churn the raw corpus and develop a data/ML model to provide business analytics (not just EDA), machine learning based document processing and information retrieval
- Quick to develop the POCs and transform it to high scale production ready code.
- Experience in extracting data through complex unstructured documents using NLP based technologies.
Good to have: Document analysis using Image processing/computer vision and geometric deep learning
Technology Stack:
Python as a primary programming language.
Conceptual understanding of classic ML/DL Algorithms like Regression, Support Vectors, Decision tree, Clustering, Random Forest, CART, Ensemble, Neural Networks, CNN, RNN, LSTM etc.
- Programming:
- Must Have: Must be hands-on with data structures using List, tuple, dictionary, collections, iterators, Pandas, NumPy and Object-oriented programming
- Good to have: Design patterns/System design, cython
- ML libraries:
- Must Have: Scikit-learn, XGBoost, imblearn, SciPy, Gensim
- Good to have: matplotlib/plotly, Lime/sharp
- Data extraction and handling:
- Must Have: DASK/Modin, beautifulsoup/scrappy, Multiprocessing
- Good to have: Data Augmentation, Pyspark, Accelerate
- NLP/Text analytics:
- Must Have: Bag of words, text ranking algorithm, Word2vec, language model, entity recognition, CRF/HMM, topic modelling, Sequence to Sequence
- Good to have: Machine comprehension, translation, elastic search
- Deep learning:
- Must Have: TensorFlow/PyTorch, Neural nets, Sequential models, CNN, LSTM/GRU/RNN, Attention, Transformers, Residual Networks
- Good to have: Knowledge of optimization, Distributed training/computing, Language models
- Software peripherals:
- Must Have: REST services, SQL/NoSQL, UNIX, Code versioning
- Good to have: Docker containers, data versioning
- Research:
- Must Have: Well verse with latest trends in ML and DL area. Zeal to research and implement cutting areas in AI segment to solve complex problems
- Good to have: Contributed to research papers/patents and it is published on internet in ML and DL
Morningstar is an equal opportunity employer.
Morningstar's hybrid work environment gives you the opportunity to work remotely and collaborate in-person each week. We've found that we're at our best when we're purposely together on a regular basis, at least three days each week. A range of other benefits are also available to enhance flexibility as needs change. No matter where you are, you'll have tools and resources to engage meaningfully with your global colleagues.
I10_MstarIndiaPvtLtd Morningstar India Private Ltd. (Delhi) Legal Entity
Top Skills
Beautifulsoup
Dask
Gensim
Imblearn
Modin
NoSQL
Python
PyTorch
Rest Services
Scikit-Learn
Scipy
Scrappy
SQL
TensorFlow
Xgboost
Similar Jobs at Morningstar
Enterprise Web • Fintech • Financial Services
Design and develop scalable AI/ML solutions, employing advanced techniques in machine learning and NLP. Lead feature engineering and data visualization efforts, collaborate using cloud solutions, and implement effective development practices. Requires 3-5 years in data science.
Top Skills:
AWSAzureGCPIbm WatsonKerasMs SqlNltkNumpyPandasPostgresPythonPyTorchScikit-LearnScipyTensorFlow
Enterprise Web • Fintech • Financial Services
The Analyst will support cash flow analysis in structured finance, develop tools with Python and VBA, and analyze credit risk in mortgage-backed securities.
Top Skills:
C#C++ExcelIntexMatlabPythonRSQLVBA
Enterprise Web • Fintech • Financial Services
The Senior Primary Research Associate drives outreach initiatives to collect proprietary data from industry professionals, ensuring data quality and supporting team objectives.
Top Skills:
Data OperationsFinancial Research ToolsOutreach Campaigns
What you need to know about the Chennai Tech Scene
To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.