×
Register Here to Apply for Jobs or Post Jobs. X

Algotale-Senior AI​/ML Solution Architect - Generative AI & Agentic Systems

Job in Town of Poland, Jamestown, Chautauqua County, New York, 14701, USA
Listing for: Nexthire
Full Time position
Listed on 2026-01-17
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below
Location: Town of Poland

Senior AI/ML Solution Architect - Generative AI & Agentic Systems

Algotale is a premier IT staffing and software solutions provider, delivering top-tier talent and custom-built technology to drive business success. With a strong network of skilled professionals across software development, cloud solutions, and project management, we help companies scale efficiently and execute projects seamlessly. Our flexible engagement models cater to both short-term and long-term needs, ensuring precision-matched expertise for every requirement.

From IT staffing to full-cycle software development, Algotale empowers businesses with innovative, high-impact solutions.

Position Overview

We are looking for a Senior AI/ML Solution Architect with deep expertise in Generative AI and agentic systems to lead the design and implementation of enterprise-scale AI solutions. This role requires a unique blend of hands‑on technical expertise in both Large Language Models (LLMs) and Small Language Models (SLMs), combined with the architectural vision to deploy these solutions across diverse computing environments.

The ideal candidate will architect scalable agentic solutions, implement advanced fine‑tuning strategies, and design comprehensive integration systems that connect AI capabilities with enterprise applications. You will be at the forefront of our AI transformation initiatives, working with cutting‑edge technologies while maintaining a practical approach to deployment and optimization.

Experience Requirements
  • Overall

    Experience:

    8+ years in technology and software development
  • Generative AI

    Experience:

    2+ years of hands‑on experience with LLMs and generative AI systems
  • Solution Architecture

    Experience:

    4+ years architecting enterprise‑scale solutions
Key Responsibilities Architecture & Design
  • Design and architect scalable agentic solutions using advanced LLM capabilities
  • Implement Model Context Protocol (MCP) integrations to connect applications with diverse external services and APIs
  • Develop multi‑agent orchestration systems for complex workflow automation
  • Design context and memory management systems for persistent agent interactions
Technical Implementation
  • Build and optimize Retrieval‑Augmented Generation (RAG) systems for efficient knowledge retrieval
  • Implement agent frameworks (Lang Chain, Lang Graph, Semantic Kernel, Agno) for various deployment environments
  • Design and deploy model inference pipelines optimized for different computing environments (cloud, edge, on‑premises)
  • Develop comprehensive fine‑tuning strategies for both Large Language Models (LLMs) and Small Language Models (SLMs)
  • Architect SLM deployment strategies for resource‑constrained environments
  • Implement model compression and quantization techniques for efficient inference
Integration & Connectivity
  • Architect REST/gRPC/Graph

    QL APIs and SDK integrations for seamless service connectivity
  • Implement event‑driven architectures using webhooks and message buses
  • Design secure authentication and authorization systems (SSO/OIDC)
  • Build connectors for popular platforms (Slack, Jira, Salesforce, CRM/ERP systems)
Data & Model Management
  • Design comprehensive data preprocessing pipelines including cleaning, deduplication, and PII reduction
  • Implement embedding creation and re‑embedding strategies for optimal retrieval
  • Develop chunking and windowing strategies for mobile‑optimized content processing
  • Establish model selection criteria and evaluation frameworks
Required Technical Skills Core AI/ML Expertise
  • Foundation Models:
    Deep experience with GPT‑4, Claude, LLaMA, and other state‑of‑the‑art LLMs
  • Small Language Models (SLMs):
    Expertise in deploying and optimizing SLMs (Phi‑3, Gemma, Tiny Llama) for mobile environments
  • Agent Frameworks:
    Proficiency in Lang Chain, Lang Graph, Microsoft Semantic Kernel, Agno, and custom agent development
  • RAG Systems:
    Advanced knowledge of retrieval‑augmented generation, vector databases, and semantic search
Fine‑tuning & Adaptation
  • Advanced fine‑tuning techniques:
    LoRA/QLoRA, DoRA, AdaLoRA for parameter‑efficient training
  • Model compression:
    Pruning, quantization (INT8/INT4), knowledge distillation
  • Prompt‑tuning, adapters, prefix tuning, and P‑tuning v2 methodologies
  • RLHF/…
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary