Student Researcher Seed Vision – AI Platform PhD
Listed on 2026-01-16
-
IT/Tech
Data Scientist, Data Engineer
Student Researcher [Seed Vision – AI Platform] – 2026 Start (PhD)
Location:
San Jose
Team:
Technology
Employment Type:
Intern
Job Code: A178475B
OverviewThe Seed Vision AI Platform team builds infrastructure and tooling to support large-scale training, evaluation, and deployment of vision foundation models. Our mission is to accelerate research and production through скорость, high-quality, and well-curated visual data pipelines, covering raw data processing, filtering, annotation, and training-ready formatting across video, image, and multimodal modalities. PhD internships at Byte Dance provide students with the opportunity to actively contribute to our products and research, and to the organization's future plans and emerging technologies.
Our dynamic internship experience blends hands‑on learning, enriching community‑building and development events, and collaboration with industry experts. Applications will be reviewed on a rolling basis – we encourage you to apply early. Please state your availability clearly in your resume (Start date, End date).
- Design and optimize data processing pipelines for large-scale image, video, and multimodal datasets used in model pretraining and fine‑tuning.
- Conduct research on data deduplication, filtering, and quality evaluation to maximize training signal efficiency.
- Collaborate with model teams to close the loop between data characteristics and downstream performance.
- Explore data‑centric machine learning methods, including synthetic data generation, dataset pruning, and active data selection.
- Build high‑throughput systems for dataset tracking, versioning, and feedback‑based iteration.
- Currently pursuing a PhD in Computer Vision, Machine Learning, Systems, or a related field.
- Research experience in data‑centric ML, vision data pipelines, or training dataset optimization.
- Familiarity with deep learning frameworks (e.g., PyTorch, Tensor Flow) and data processing stacks (e.g., Spark, Ray, DALI).
- Strong engineering skills in Python and/or distributed data systems.
- Experience working with large‑scale visual datasets (e.g., LAION, Web Vid, Image Net, Ego4D).
- Background in data evaluation, synthetic data curation, or auto‑labeling systems.
- Familiarity with vision foundation model pretraining workflows (e.g., CLIP, DINO, EVA, Intern Image).
- Understanding of data–model alignment loops and evaluation‑driven dataset iteration.
The hourly rate range for this position in the selected city is $60- $60.
Benefits may vary depending on the nature of employment and the country work location. Interns have day one access to health insurance, life proprio, wellbeing benefits and more. Interns also receive 10 paid holidays per year and paid sick time (56 hours if hired in the first half of the year, 40 if hired in the second half). Interns who are not working 100% remote may also be eligible for housing allowance.
The Company reserves the అది to modify or change these benefits programs at any time, with or without notice.
EEO Statement – Los Angeles County (unincorporated) CandidatesQualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment:
- Interacting and occasionally having unsupervised contact with internal/external clients and/or colleagues;
- Appropriately handling and managing confidential information including proprietary and trade secret information and access to information technology systems; and
- Exercising sound judgment.
Byte Dance is committed to providing reasonable accommodations in our recruitment processes for candidates with disabilities, pregnancy, sincerely held religious beliefs or other reasons protected by applicable laws. If you need assistance or a reasonable accommodation, please reach out to us at
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).