AI Solutions Architect-Sunnyvale,California,Job Sunnyvale area,California USA,IT/Tech

Position: AI Solutions Architect-Sunnyvale, California, USA

Role Summary：
We are hiring a Solutions Architect focused exclusively on AI model commercialization for the U.S. market. you will be the primary technical leader and customer advocate for Alibaba Cloud's cutting-edge AI Model-as-a-Service (Model Studio) and Tongyi Models (Qwen & Wan) offerings.

Core Responsibilities

AI Model Commercialization & Pre-Sales
Own technical pre-sales for Qwen (LLM) and Wan (multimodal) across the U.S. region.
Drive model selection, competitive replacement, and scale-up decisions for enterprise and AI-native customers.
Lead model-level POCs (latency, quality, cost, throughput), not application or agent builds.
Support customers migrating or benchmarking against other models.
Competitive Positioning & Technical Enablement
Clearly articulate why Qwen / Wan wins vs competitors in real production scenarios.
Run side-by-side model comparisons (quality, stability, cost, scale).
Equip sales and BD teams with clear technical narratives, not generic AI storytelling.
Act as the technical authority in executive and architect-level customer discussions.
Customer Scale & Adoption
Support customers from first POC to production ramp and volume growth.
Optimize for token / video generation scale, not one-off demos.
Identify expansion paths within a country-wide customer portfolio (you own the U.S. end-to-end).
Market Feedback & Product Loop
Bring real competitive feedback from U.S. customers back to product teams.
Influence roadmap around model performance, pricing, deployment regions, and APIs.
Build reusable POC patterns and competitive playbooks for global reuse.

Job Requirements

6–10+ years of experience in Solutions Architect, AI Pre-Sales, or AI Platform roles.
Proven track record in commercializing foundation models—beyond simple usage.
Direct exposure to LLMs or multimodal models or video model providers.

Strong Conceptual Understanding Of

LLM inference characteristics (latency, context length, throughput, cost).
Fine-tuning approaches (LoRA, SFT – conceptual, not research-heavy).
Multimodal/video generation evaluation dimensions.
Ability to own a region and manage multiple customers, balancing scale with limited time for deep custom builds.

Preferred Qualifications

Experience with Qwen, Wan, or other AI models expanding globally.
Background in AI infrastructure, model serving, or AI platform go-to-market (GTM) strategies.
Prior work supporting AI-native startups at scale.
A collaborative team player with professional work ethic, confidence, and cross-functional communication skills.
Exceptional problem-solving ability, adept at navigating ambiguity and driving results in fast-paced, innovative environments.
Language proficiency:
Excellent verbal and written communication in English;
Mandarin fluency is a strong plus, enabling effective collaboration with global stakeholders and cross‑regional teams.

Who Will Succeed in This Role

You think in models, not apps.
You prioritize scale, adoption, and usage over polished demos.

The base pay range for this position at commencement of employment is expected to be between $132,000/year and $216,000/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.

If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.

#J-18808-Ljbffr


Increase/decrease your Search Radius (miles)



Job Posting Language