Caterpillar Logo

Caterpillar

Principal AI Data Scientist / Engineer

Posted Yesterday
Hybrid
Irving, TX
Senior level
Hybrid
Irving, TX
Senior level
Lead the AI-enablement vision for an enterprise-scale accounting data system, utilizing AI/LLM expertise, software engineering, and cloud-native architecture for high-performance data management.
The summary above was generated by AI
Career Area:
Technology, Digital and Data
Job Description:
Your Work Shapes the World at Caterpillar Inc.
When you join Caterpillar, you're joining a global team who cares not just about the work we do - but also about each other. We are the makers, problem solvers, and future world builders who are creating stronger, more sustainable communities. We don't just talk about progress and innovation here - we make it happen, with our customers, where we work and live. Together, we are building a better world, so we can all enjoy living in it.
Caterpillar is seeking an elite Principal AI Data Scientist / Engineer, Accounting Systems to join our team and act as a champion of Artificial Intelligence (AI) and Advanced Analytics within Caterpillar's Global Finance Services Division. Reporting directly to the Director of Advanced Analytics within Global Finance, you will lead the AI-enablement vision for our next-generation, enterprise-scale accounting data harmonizer system.
This is a critical senior AI engineering position requiring a rare blend of AI/LLM deployment expertise, robust software engineering experience with high-performance languages, and knowledge of modern cloud-native data architecture. Top candidates will also have experience with accounting systems and principles, and be able to collaborate across groups to understand, identify, and resolve all types of issues. You will lead the technical vision of an AI-enabled, resilient, and highly accurate system that serves as the backbone of enterprise-wide corporate accounting.
What You Will Do:
  • IT Architecture Experience: Leverage previous experience in end-to-end architecture for a multi-sourced data platform, evaluating scalability, performance, and resilience.
  • AI-Driven Entity Resolution: Develop and implement sophisticated strategies for Entity Resolution (ER) by utilizing Large Language Models (LLMs) and Graph Databases (e.g., Neo4J, AWS Neptune, CosmosDB) to accurately map, reconcile, and standardize accounting data across diverse sources.
  • Advanced RAG Implementation: Architect and deploy production-grade Retrieval-Augmented Generation (RAG) pipelines for complex data interpretation and standardization. This includes managing the underlying Vector Databases and optimizing prompt/context engineering for high accuracy.
  • Performance Optimization: Understand performance SLAs. Leverage specialized databases such as OLAP solutions (e.g., DuckDB, ClickHouse) for rapid analytics and column stores/caching (e.g., Redis) for low-latency access.
  • Cloud Infrastructure and Deployment: Engage with IT experts on cloud deployment strategy (AWS/Azure), containerization (Docker) and orchestration (Kubernetes) to ensure robust, scalable, and observable deployments.
  • Cross-Functional Strategy: Collaborate directly with Accounting, ERP knowledge owners, IT, MDM, and Data Quality teams to translate complex accounting requirements into scalable, automated technical solutions.

Skills Descriptors:
Self-Starting, High Accountability, and Execution-Focused Mindset:
  • Must demonstrate strong initiative, interpersonal skills, and the ability to communicate effectively

Core Engineering and Architecture:
  • Programming Proficiency: Mastery in Python (for AI/ML) AND strong proficiency in at least one compiled, high-performance language (e.g., Go, Java, C#/.NET) for building scalable backend services
  • Cloud Expertise: Extensive experience architecting solutions on AWS or Azure.
  • Containerization & Orchestration: Knowledge of Docker and Kubernetes (K8s) in a production environment
  • Streaming/Messaging: Proven experience designing systems utilizing Kafka or similar technologies (e.g., Kinesis, RabbitMQ)

Advanced AI/LLM Deployment:
  • Demonstrated experience deploying LLMs in a production environment for data-centric tasks (not just chatbots)
  • Specific expertise in building RAG pipelines, managing Vector Databases (e.g., Pinecone, Weaviate, PGVector), and advanced prompt/context engineering
  • Experience with deep learning frameworks (PyTorch, TensorFlow) and the HuggingFace ecosystem
  • Experience with Agentic Frameworks (e.g. LangGraph, AutoGen)

Modern Data Stack and Databases:
  • Entity Resolution: Proven track record of solving complex entity resolution challenges at scale
  • Graph Databases: Hands-on experience with Neo4J, AWS Neptune, or CosmosDB, specifically applied to ER or MDM
  • Data Warehousing: Deep expertise in Snowflake architecture and optimization.
  • Fast Analytics: Experience utilizing OLAP databases (e.g., DuckDB) and in-memory/column stores (e.g., Redis) for performance optimization

Domain Knowledge:
  • Strong understanding of corporate accounting principles, consolidation processes, ERP system data structures (e.g., SAP, Oracle), and the nuances of accounting data

Top Candidates Will Also Have:
  • Certifications in AWS or Azure architecture
  • Experience with Infrastructure as Code (IaC) tools (e.g., Terraform)
  • A track record of technical excellence in a Fortune 500 environment

Additional Information:
  • This position can be located in Irving, TX or Peoria, IL
  • Travel requirements will be less than 10%
  • This role currently has no direct reports
  • Domestic relocation is available to those that qualify
  • Sponsorship is NOT available

What You Will Get:
  • Our goal at Caterpillar is for you to have a rewarding career. Our teams are critical to the success of our customers who build a better world.
  • Here you earn more than just a salary because we value your performance. We offer a total rewards package that provides benefits on day one (medical, dental, vision, RX, and 401K) along with the potential of an annual bonus. Additional benefits include paid vacation days and paid holidays.
  • All qualified individuals - Including minorities, females, veterans and individuals with disabilities - are encouraged to apply.

About Caterpillar -
Caterpillar Inc. is the world's leading manufacturer of construction and mining equipment, off-highway diesel and natural gas engines, industrial gas turbines and diesel-electric locomotives. For nearly 100 years, we've been helping customers build a better, more sustainable world and are committed and contributing to a reduced-carbon future. Our innovative products and services, backed by our global dealer network, provide exceptional value that helps customers succeed.
Final details:
Please frequently check the email associated with your application, including the junk/spam folder, as this is the primary correspondence method. If you wish to know the status of your application - please use the candidate log-in on our career website as it will reflect any updates to your status.
#LI
Summary Pay Range:
$144,960.00 - $235,440.00
Compensation and benefits offered may vary depending on multiple individualized factors, job level, market location, job-related knowledge, skills, individual performance and experience. Please note that salary is only one component of total compensation at Caterpillar.
Benefits:
Subject to plan eligibility, terms, and guidelines. This is a summary list of benefits.
  • Medical, dental, and vision benefits*
  • Paid time off plan (Vacation, Holidays, Volunteer, etc.)*
  • 401(k) savings plans*
  • Health Savings Account (HSA)*
  • Flexible Spending Accounts (FSAs)*
  • Health Lifestyle Programs*
  • Employee Assistance Program*
  • Voluntary Benefits and Employee Discounts*
  • Career Development*
  • Incentive bonus*
  • Disability benefits
  • Life Insurance
  • Parental leave
  • Adoption benefits
  • Tuition Reimbursement

* These benefits also apply to part-time employees
This position requires working onsite five days a week.
Relocation is available for this position.
Visa Sponsorship is not available for this position. This employer is not currently hiring foreign national applicants that require or will require sponsorship tied to a specific employer, such as, H, L, TN, F, J, E, O. As a global company, Caterpillar offers many job opportunities outside of the U.S which can be found through our employment website at www.caterpillar.com/careers.
Posting Dates:
September 3, 2025 - September 17, 2025
Any offer of employment is conditioned upon the successful completion of a drug screen.
Caterpillar is an Equal Opportunity Employer, Including Veterans and Individuals with Disabilities. Qualified applicants of any age are encouraged to apply.
Not ready to apply? Join our Talent Community.

Top Skills

AWS
Aws Neptune
Azure
C#/.Net
Clickhouse
Cosmosdb
Docker
Duckdb
Go
Huggingface
Java
Kafka
Kubernetes
Neo4J
Pgvector
Pinecone
Python
PyTorch
Redis
Snowflake
TensorFlow
Weaviate

Caterpillar Chennai, Tamil Nadu, IND Office

Chennai, India

Similar Jobs at Caterpillar

3 Days Ago
Hybrid
Irving, TX, USA
Entry level
Entry level
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
Participants in the IT Development Program will complete three 12-month rotations in various IT domains, enhancing technical and leadership skills through hands-on experience and mentorship from experts.
Top Skills: Application DesignArchitectureCloudCybersecurityEnterprise Data ServicesNetwork ConnectivitySoftware TestingSystem Integration
6 Days Ago
Hybrid
Irving, TX, USA
Mid level
Mid level
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
This role involves managing and enhancing Power BI dashboards, analyzing large datasets, translating economic indicators into business recommendations, and collaborating with teams to drive insights. Strong communication and technical skills in data analysis are essential.
Top Skills: ExcelPower BIPythonRSQLTableau
6 Days Ago
Hybrid
Irving, TX, USA
Mid level
Mid level
Artificial Intelligence • Cloud • Internet of Things • Software • Cybersecurity • Industrial
The role involves analyzing large datasets to influence business decisions, enhancing Power BI dashboards, and communicating insights through presentations. Candidates will work collaboratively across teams and apply machine learning techniques for data optimization.
Top Skills: ExcelMachine LearningNlpPower BIPythonRTableau

What you need to know about the Chennai Tech Scene

To locals, it's no secret that South India is leading the charge in big data infrastructure. While the environmental impact of data centers has long been a concern, emerging hubs like Chennai are favored by companies seeking ready access to renewable energy resources, which provide more sustainable and cost-effective solutions. As a result, Chennai, along with neighboring Bengaluru and Hyderabad, is poised for significant growth, with a projected 65 percent increase in data center capacity over the next decade.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account