More jobs:
AI Architect - Remote
Remote / Online - Candidates ideally in
New York, New York County, New York, 10261, USA
Listed on 2026-03-01
New York, New York County, New York, 10261, USA
Listing for:
Saransh Inc
Full Time, Remote/Work from Home
position Listed on 2026-03-01
Job specializations:
-
Software Development
AI Engineer
Job Description & How to Apply Below
Overview
Title:
AI Architect
Location:
NYC, NY
AI Architect to lead the design and implementation of enterprise-scale AI solutions for financial services automation. Drive architectural decisions for LLM-based systems, agentic workflows, and intelligent document processing platforms serving private equity and fund management operations.
Required Qualifications- 15+ years of experience in AI/ML architecture with 8+ years in enterprise AI solutions.
- Deep expertise in LLM architectures, prompt engineering, and agentic frameworks (Lang Graph, Lang Mem).
- Hands-on experience with Azure OpenAI GPT-4/5, embedding models, and Azure cloud services.
- Strong background in Python, distributed systems, and enterprise architecture.
- Experience with Claude Code for agentic coding and AI-powered development.
- Proven track record in financial services or regulatory compliance environments.
- Expert knowledge of RAG architectures, advanced RAG patterns, and vector database optimization.
- Experience with Small Language Models (SLM), Agent-to-Agent (A2A) communication, and Model Context Protocol (MCP).
- Proven ability to architect and scale AI solutions for enterprise workloads (1M+ documents, sub-second response times).
- Design end-to-end AI solutions for private equity fund operations and financial automation.
- Architect scalable agentic AI frameworks using Lang Graph, Lang Mem, and custom agent orchestration.
- Lead technical strategy for Azure OpenAI GPT-5 integration and advanced embedding-based retrieval systems.
- Design and implement advanced RAG architectures including hybrid search, query routing, and contextual retrieval.
- Establish multi-agent systems with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP).
- Architect Small Language Model (SLM) integration for specialized tasks and cost optimization.
- Design enterprise-scale solutions supporting millions of documents with sub-second query response times.
- Establish AI governance, model safety protocols, and regulatory compliance frameworks.
- Lead architectural reviews for distributed AI systems, microservices, and cloud-native deployments.
- Hands-on development using Claude Code for rapid prototyping and agentic workflows.
- Drive architectural reviews for Llama Parse/Azure Document Intelligence integration.
- Design fault-tolerant, high-availability AI systems with automatic failover and load balancing.
- Establish comprehensive monitoring, observability, and performance optimization strategies.
- Mentor technical teams and establish AI engineering best practices using modern tool chains.
Oversee model performance evaluation using Lang Graph evals and Deep Eval frameworks.
Seniorities- Mid-Senior level
- Full-time
- Design, Art/Creative, and Information Technology
- IT Services and IT Consulting
New York, NY – salary range not specified in original posting.
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×