AI Solutions Architect-Sunnyvale, California,
Listed on 2026-03-01
-
IT/Tech
AI Engineer, Data Science Manager, Data Analyst, Data Scientist
Role Summary:
We are hiring a Solutions Architect focused exclusively on AI model commercialization for the U.S. market. you will be the primary technical leader and customer advocate for Alibaba Cloud's cutting-edge AI Model-as-a-Service (Model Studio) and Tongyi Models (Qwen & Wan) offerings.
- AI Model Commercialization & Pre-Sales
- Own technical pre-sales for Qwen (LLM) and Wan (multimodal) across the U.S. region.
- Drive model selection, competitive replacement, and scale-up decisions for enterprise and AI-native customers.
- Lead model-level POCs (latency, quality, cost, throughput), not application or agent builds.
- Support customers migrating or benchmarking against other models.
- Competitive Positioning & Technical Enablement
- Clearly articulate why Qwen / Wan wins vs competitors in real production scenarios.
- Run side-by-side model comparisons (quality, stability, cost, scale).
- Equip sales and BD teams with clear technical narratives, not generic AI storytelling.
- Act as the technical authority in executive and architect-level customer discussions.
- Customer Scale & Adoption
- Support customers from first POC to production ramp and volume growth.
- Optimize for token / video generation scale, not one-off demos.
- Identify expansion paths within a country-wide customer portfolio (you own the U.S. end-to-end).
- Market Feedback & Product Loop
- Bring real competitive feedback from U.S. customers back to product teams.
- Influence roadmap around model performance, pricing, deployment regions, and APIs.
- Build reusable POC patterns and competitive playbooks for global reuse.
- 6–10+ years of experience in Solutions Architect, AI Pre-Sales, or AI Platform roles.
- Proven track record in commercializing foundation models—beyond simple usage.
- Direct exposure to LLMs or multimodal models or video model providers.
- LLM inference characteristics (latency, context length, throughput, cost).
- Fine-tuning approaches (LoRA, SFT – conceptual, not research-heavy).
- Multimodal/video generation evaluation dimensions.
- Ability to own a region and manage multiple customers, balancing scale with limited time for deep custom builds.
- Experience with Qwen, Wan, or other AI models expanding globally.
- Background in AI infrastructure, model serving, or AI platform go-to-market (GTM) strategies.
- Prior work supporting AI-native startups at scale.
- A collaborative team player with professional work ethic, confidence, and cross-functional communication skills.
- Exceptional problem-solving ability, adept at navigating ambiguity and driving results in fast-paced, innovative environments.
- Language proficiency:
Excellent verbal and written communication in English;
Mandarin fluency is a strong plus, enabling effective collaboration with global stakeholders and cross‑regional teams.
- You think in models, not apps.
- You prioritize scale, adoption, and usage over polished demos.
The base pay range for this position at commencement of employment is expected to be between $132,000/year and $216,000/year. However, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience.
If hired, employee will be in an “at-will position” and the Company reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, Company or individual department/team performance, and market factors.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).