×
Register Here to Apply for Jobs or Post Jobs. X

Applied AI Architect

Job in Austin, Travis County, Texas, 78716, USA
Listing for: Talent Groups
Part Time position
Listed on 2026-03-04
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

Hybrid Details

Onsite 3 days/week (Tuesday-Thursday)

Job Description

​Austin, Texas client is seeking an Applied AI Architect with deep experience bridging LLM/SLM model research and enterprise productization. You will lead the technical direction from model architecture selection, fine‑tuning, and optimization to deployment and observability, shaping the next generation of agentic AI for cybersecurity. This role demands both foundational knowledge and production practicality designing and validating novel approaches, then translating them into reliable, scalable solutions deployed in the client's platform.

What

You Will Be Doing
  • Drive research-to-production of LLM/SLM systems from design and fine‑tuning to evaluation, deployment, and continual adaptation in enterprise agent workflows.
  • Lead technical choices determining when to apply context engineering, prompt tuning, continued pretraining, supervised fine‑tuning, reasoning fine‑tuning, LoRA, or RL.
  • Architect high‑performance inference and serving using vLLM, NVIDIA NIM, Triton, CUDA, or other optimized frameworks.
  • Integrate reinforcement learning frameworks (veRL, SkyRL, PyTorch, Ray RLlib) to enhance reasoning, adaptability, and agent feedback loops.
  • Develop and operationalize AI Ops pipelines, build benchmarks and metrics for model evaluation, observability, drift detection, and lifecycle automation.
  • Advance agent interoperability using A2A (Agent‑to‑Agent) or MCP (Model Context Protocol) for large‑scale coordination.
  • Collaborate with cybersecurity researchers to embed threat reasoning, anomaly detection, and defensive logic directly into model behavior.
  • Publish, document, and codify reusable AI blueprints for hybrid (cloud + on‑prem) deployments and future research acceleration.
Requirements
  • Proven end‑to‑end experience bringing LLM/SLM research into production from fine‑tuning and inference optimization to evaluation and AI Ops integration. Excellent knowledge of at least one of the following:
    • Deep understanding of data‑model‑infrastructure trade‑offs and optimization under real business constraints.
    • Hands‑on experience fine‑tuning LLMs using frameworks such as LLaMA Factory, NeMo, and PEFT (e.g., LoRA)
    • Strong knowledge of GPU‑accelerated inference (ex: vLLM, NIM, Triton, CUDA, NCCL, PyTorch/XLA).
    • Familiarity with AI Ops tool chains (ex: Weights & Biases, MLflow, Ray Serve).
  • Proficiency in Python, and experience building containerized AI microservices (ex: Docker, Kubernetes, Ray).
  • 8+ years of software engineering or research engineering experience, including the most recent 3 years focused on applied AI/ML and deploying LLM/SLM systems in production at enterprise scale.
  • Proven experience as a Senior technical lead or architect, driving end to end design, roadmap decisions, and productization of AI systems.
  • Deep expertise in cloud‑native architecture across AWS, Azure, or GCP.
  • Experience in mentoring senior engineers, reviewing technical designs, and establishing engineering best practices.
Ways to Stand Out
  • Demonstrated success in building scalable infrastructure and launching LLM/SLM‑based features and agent systems within enterprise platforms.
  • Expertise in quantization, distillation, or GPU profiling to lower inference cost.
  • Clear conceptual understanding of when to fine‑tune vs prompt‑engineer vs use RLHF and evidence of having applied each effectively.
  • Familiarity with agentic frameworks (Lang Chain, AWS Strands, Auto Gen, etc).
  • Deep understanding of A2A/MCP protocols for interoperable multi‑agent systems.
Additional Requirements
  • Research‑driven yet delivery‑focused capable of balancing innovation with practical deployment.
  • Data‑ and results‑oriented every hypothesis must be measurable.
  • Ownership mentality from exploration and experiment to evaluation, optimization, and monitoring.
  • Passionate about turning AI research into defensible, intelligent, and proactive cybersecurity systems.

This position does not offer sponsorship for work permit applications or renewals, either now or in the future. Candidates must be authorized to work in the U.S. without the need for employment‑based visa sponsorship, both currently and moving forward.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary