×
Register Here to Apply for Jobs or Post Jobs. X

Evaluation Scenario Writer - AI Agent Testing Specialist

Job in 261201, Ahmedabad, Uttar Pradesh, India
Listing for: Mindrift
Part Time position
Listed on 2026-03-05
Job specializations:
  • IT/Tech
    AI Engineer, Data Analyst
Job Description & How to Apply Below
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates.

At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI.

What We Do

The Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe.

About

The Role

We're looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You'll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. You'll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You'll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions.

Although every project is unique, you might typically:

Designing structured test scenarios based on real-world tasks
Defining the golden path and acceptable agent behavior
Annotating task steps, expected outputs, and edge cases
Working with devs to test your scenarios and improve clarity
Reviewing agent outputs and adapting tests accordingly

How To Get Started

Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you'll help shape the future of AI while ensuring technology benefits everyone.

Requirements

You have a Bachelor's or Master's degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields.
You have 3+ years of experience
Your level of English is advanced (C1) or above
You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines
Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge

Benefits

Why this freelance opportunity might be a great fit for you

Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments.
Work on advanced AI projects and gain valuable experience that enhances your portfolio.
Influence how future AI models understand and communicate in your field of expertise
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary