More jobs:
Machine Learning Engineer, LLM Fine-Tuning
Job in
San Jose, Santa Clara County, California, 95199, USA
Listed on 2026-01-12
Listing for:
First Soft Solutions LLC
Full Time
position Listed on 2026-01-12
Job specializations:
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer
Job Description & How to Apply Below
Machine Learning Engineer, LLM Fine‑Tuning
We are actively hiring for a Machine Learning Engineer focused on LLM fine‑tuning for Verilog/RTL applications.
Location: San Jose, CA (Onsite)
Skills: LLM fine‑tuning, Verilog/RTL, AWS, Bedrock, Sage Maker
Responsibilities- Own the technical roadmap for Verilog/RTL‑focused LLM capabilities—from model selection and adaptation to evaluation, deployment, and continuous improvement.
- Lead a hands‑on team of applied scientists/engineers: set direction, unblock technically, review designs/code, and raise the bar on experimentation velocity and reliability.
- Fine‑tune and customize models using state‑of‑the‑art techniques (LoRA/QLoRA, PEFT, instruction tuning, preference optimization/RLAIF) with robust HDL‑specific evals:
- Compile‑/lint‑/simulate‑based pass rates, pass@k for code generation, constrained decoding to enforce syntax, and “does‑it‑synthesize” checks.
- Design privacy‑first ML pipelines on AWS:
- Training/customization and hosting using Amazon Bedrock and Sage Maker (or EKS + KServe/Triton/DJL) for bespoke training needs.
- Artifacts in S3 with KMS CMKs; isolated VPC subnets & Private Link (including Bedrock VPC endpoints), IAM least‑privilege, Cloud Trail auditing, and Secrets Manager for credentials.
- Enforce encryption in transit/at rest, data minimization, no public egress for customer/RTL corpora.
- Stand up dependable model serving:
Bedrock model invocation where it fits, and/or low‑latency self‑hosted inference (vLLM/Tensor
RT‑LLM), autoscaling, and canary/blue‑green rollouts. - Build an evaluation culture: automatic regression suites that run HDL compilers/simulators, measure behavioral fidelity, and detect hallucinations/constraint violations; model cards and experiment tracking (MLflow/Weights & Biases).
- Partner deeply with hardware design, CAD/EDA, Security, and Legal to source/prepare datasets (anonymization, redaction, licensing), define acceptance gates, and meet compliance requirements.
- Drive productization: integrate LLMs with internal developer tools (IDEs/plug‑ins, code review bots, CI), retrieval (RAG) over internal HDL repos/specs, and safe tool‑use/function‑calling.
- Mentor & uplevel: coach ICs on LLM best practices, reproducible training, critical paper reading, and building secure‑by‑default systems.
- 10+ years total engineering experience with 5+ years in ML/AI or large‑scale distributed systems; 3+ years working directly with transformers/LLMs.
- Proven track record shipping LLM‑powered features in production and leading ambiguous, cross‑functional initiatives at Staff level.
- Deep hands‑on skill with PyTorch, Hugging Face Transformers/PEFT/TRL, distributed training (Deep Speed/FSDP), quantization‑aware fine‑tuning (LoRA/QLoRA), and constrained/grammar‑guided decoding.
- AWS expertise to design and defend secure enterprise deployments:
Bedrock, Sage Maker, S3, EC2/EKS/ECR, VPC/Subnets/Security Groups, IAM, KMS, Private Link, Cloud Watch/Cloud Trail, Step Functions, Batch, Secrets Manager. - Strong software engineering fundamentals: testing, CI/CD, observability, performance tuning;
Python a must (bonus for Go/Java/C++). - Demonstrated ability to set technical vision and influence across teams; excellent written and verbal communication for execs and engineers.
Mid‑Senior level
Employment TypeFull‑time
Job FunctionEngineering and Information Technology
IndustriesIT Services and IT Consulting
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×