Sr AI Data Engineer
Listed on 2026-03-11
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer
As a Sr Data Engineer here at Honeywell, you will play a crucial role in designing and implementing advanced data solutions for AI solutions that drive business insights, enhance decision-making processes and empower AI solutions. Your expertise will help in critical data science development activities across all AI modalities (classic, Gen and agentic) and data types (structured and unstructured).
You will report directly to our AI Director and you’ll work out of our Phoenix, AZ or Charlotte, NC location on a Hybrid work schedule.
In this role, you will impact the organization by leveraging your technical skills to develop innovative data solutions that support strategic initiatives and improve operational efficiency.
KEY RESPONSIBILITIES- Support end‑to‑end data needs for all AI modalities
, including classic ML, GenAI/LLMs, and agentic AI systems. - Build robust, scalable data pipelines for structured, semi‑structured, and unstructured data
, including text, documents, images, audio, video, and logs. - Develop feature engineering pipelines for classic ML, including feature extraction, transformation, and feature store management.
- Build and optimize GenAI and LLM data pipelines
, including embedding generation, vectorization, chunking, metadata extraction, and document enrichment for RAG and context retrieval. - Develop data ingestion and orchestration workflows that support agentic AI
, including memory stores, event-driven pipelines, tool-use data flows, and real‑time retrieval services. - Design and implement advanced data solutions using AWS (S3, Glue, Lambda, EMR, Kinesis), Databricks (Spark, Delta Lake, Vector Search), and Dataiku to enable intelligent systems at scale.
- Implement data governance, quality, lineage, monitoring, and observability to support high-performance, trustworthy AI.
- Partner with data scientists, ML engineers, and AI product teams to deliver datasets for model development, fine‑tuning, evaluation, and production inference.
- Optimize pipelines for latency, cost, reliability, and throughput, ensuring AI systems—from batch ML to real‑time agents—have the data they need.
- Bachelor’s degree from an accredited institution in a technical discipline such as science, technology, engineering, mathematics.
- 5 or more years of experience in data engineering, distributed data systems, or ML data pipelines.
- Strong experience working with Apache Spark
, preferably in Databricks. - Proficiency in Python and SQL; experience with distributed computing and big data frameworks.
- Hands‑on experience with cloud‑based ETL/ELT pipelines
, preferably AWS (S3, Glue, Lambda, EMR, Step Functions, Redshift, Athena). - Experience building data solutions that support multiple AI workloads
, including: - ML training and inference data flows
- Unstructured data ingestion and transformation
- Embedding/vector pipelines for LLMs
- Experience working with data modeling, data integration, ETL/ELT frameworks, and reliable production‑grade pipelines.
- Bachelor’s degree in a technical field (CS, Engineering, Math, or related).
- Experience supporting AI at scale across classic ML, GenAI/LLM, and agentic AI systems
. - Experience with vector databases and semantic search (Databricks Vector Search, Pinecone, FAISS, Milvus, Open Search).
- Familiarity with LLM and GenAI data preparation
, including: - Text processing
- Tokenization
- Chunking strategies
- Prompt/context formatting
- Experience with unstructured data technologies (OCR, NLP pipelines, computer vision data processing).
- Hands‑on experience with Dataiku for automation, workflow orchestration, and AI project management.
- Knowledge of MLOps tooling: MLflow, Delta Lake, experiment tracking, CI/CD for ML.
- Understanding of agentic AI system patterns
, such as memory architectures, tool APIs, event‑driven workflows, and reasoning chain data requirements. - Strong analytical mindset, attention to detail, and commitment to high data quality.
- Ability to thrive in a fast‑paced, evolving AI environment and collaborate across cross‑functional teams.
In addition to a competitive salary, leading‑edge work, and developing solutions side‑by‑side with dedicated experts in their fields,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).