Red Teaming Domain Expert - AI Training
San Francisco, San Francisco County, California, 94199, USA
Listed on 2026-02-28
-
IT/Tech
AI Engineer, Artificial Intelligence
Red Teaming Domain Expert - AI Training (Contract)
Join to apply for the Red Teaming Domain Expert - AI Training (Contract) role at ChatGPT Jobs.
Company:
Handshake
Location:
San Francisco, CA (Remote)
Pay: $40 - $60/hr
Type:
Contractor
Posted: 15 hours ago
About Handshake AIHandshake is building the career network for the AI economy. Our three‑sided marketplace connects 18 million students and alumni, 1,500+ academic institutions across the U.S. and Europe, and 1 million employers to power how the next generation explores careers, builds skills, and gets hired. Handshake AI is a human data labeling business that leverages the scale of the largest early‑career network. We work directly with the world’s leading AI research labs to build a new generation of human data products.
From PhDs in physics to undergrads fluent in LLMs, Handshake AI is the trusted partner for domain‑specific data and evaluation at scale.
This is a unique opportunity to join a fast‑growing team shaping the future of AI through better data, better tools, and better systems—for experts, by experts. Now’s a great time to join Handshake.
AboutThe Role
As a Red Teamer, you will stress‑test AI models by intentionally trying to break them. Instead of checking whether an answer is correct, you’ll design creative, adversarial prompts that expose vulnerabilities—unsafe content, bias, broken guardrails, or unexpected behaviors. Your work directly supports AI safety and model robustness for leading research labs.
This role requires creativity, curiosity, and an ability to think like an adversary while operating with strong ethical judgment. No technical background is required. What matters most is how you think, how you write, and how you problem‑solve. This is a remote contract position with variable time commitments, typically 10–20 hours per week.
Day‑to‑day Responsibilities- Crafting creative prompts and scenarios to intentionally stress‑test AI guardrails.
- Discovering ways around safety filters, restrictions, and defenses.
- Exploring edge cases to provoke disallowed, harmful, or incorrect outputs.
- Documenting experiments clearly, including what you tried and why.
- Reviewing and refining adversarial prompts generated by Fellows.
- Collaborating with engineers, tutors, and researchers to share findings and strengthen defenses.
- Working with potentially disturbing content, including violence, explicit topics, and hate speech.
- Staying current on jailbreaks, attack methods, and evolving model behaviors.
- Strong hands‑on experience using multiple LLMs.
- Intuition for crafting prompts; familiarity with jailbreak or evasion techniques is a plus.
- Creative, adversarial problem‑solving skills.
- Clear and thoughtful written communication.
- Ability to tolerate emotionally heavy or graphic content.
- Curiosity, persistence, and comfort with frequent failure in experimentation.
- Strong ethical judgment and ability to separate adversarial thinking from personal values.
- Self‑directed, collaborative, and comfortable in feedback‑heavy environments.
- You go deep into unusual interests (fandoms, niche internet cultures, gaming exploits, Wikipedia rabbit holes, etc.).
- You come from a creative background, writers, visual artists, etc.
- You are obsessed with AI and can’t stop talking about it.
- Prior red‑teaming, moderation, or adversarial testing experience.
- Background in writing, gaming, improv, or niche internet subcultures.
- Experience documenting complex processes or research.
- Familiarity with safety, trust & safety, or digital security concepts.
- Engagement:
Contract, remote, variable time commitment. - Schedule:
Flexibility required, with some evening or weekend availability. - Location:
Fully remote (no visa sponsorship available). - Technical Requirements:
Personal device running Windows 10 or macOS Big Sur 11.0+ and reliable smartphone access.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).