Evaluation Scenario Writer - AI Agent Testing Specialist
MindriftJob Description
Mindrift is looking for an Evaluation Scenario Writer to join our team as an AI Agent Testing Specialist. In this role, you'll design realistic and structured evaluation scenarios for LLM-based agents, contributing to the ethical shaping of AI. If you're passionate about AI and possess a strong analytical mindset, this is an excellent opportunity to leverage your skills.
Crafting Effective AI Agent Testing Scenarios
As an Evaluation Scenario Writer, your primary responsibility will be creating test cases that simulate human-performed tasks. You'll define gold-standard behavior, ensuring each scenario is clearly defined, well-scored, and easy to execute and reuse. You will need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions. Learn more about AI Testing.
Key Responsibilities:
- Designing structured test scenarios based on real-world tasks for AI Agent Testing.
- Defining the golden path and acceptable agent behavior.
- Annotating task steps, expected outputs, and edge cases.
- Working with devs to test your scenarios and improve clarity.
- Reviewing agent outputs and adapting tests accordingly.
Ensuring Quality in AI Agent Testing
Your expertise as an Evaluation Scenario Writer will ensure the quality and reliability of AI agents. You'll be responsible for defining the golden path, which includes acceptable agent behavior, and annotating task steps to clarify expected outputs and edge cases. Your efforts will contribute significantly to refining model responses and improving overall AI performance.
Qualifications for the Evaluation Scenario Writer Role
- Bachelor's and/or Master’s Degree in Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields.
- Background in QA, software testing, data analysis, or NLP annotation.
- Good understanding of test design principles (e.g., reproducibility, coverage, edge cases).
- Strong written communication skills in English.
- Comfortable with structured formats like JSON/YAML for scenario description.
- Can define expected agent behaviors (gold paths) and scoring logic.
- Basic experience with Python and JS.
- Curious and open to working with AI-generated content, agent logs, and prompt-based behavior.
- You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.
Mindrift provides a flexible, remote, freelance project that fits around your primary professional or academic commitments. This position as an Evaluation Scenario Writer, lets you take part in an advanced AI project and gain valuable experience to enhance your portfolio. Influence how future AI models understand and communicate in your field of expertise. More on LLMs.
Check out some example test scenarios.Get notified of similar jobs
We'll send you an email when jobs similar to "Evaluation Scenario Writer - AI Agent Testing Specialist" are posted.
Related Jobs You Might Like
View all jobs →Industry Business Manager
SGS
Company Description We are SGS – the world’s leading testing, inspection and certification company. We are recognized as the global benchmark for sustainability, quality and integrity. Our 98,000 employees operate a network of 2,650 offices and laboratories, working together to enable a better, safer and more interconnected world. Job Description Reporting directly to the Managing Director, the successful candidate will ensure compliance with company quality systems, procedures, and processes while driving revenue generation and long-term profitability. The role requires a strategic approach to business development and operational management, with a focus on growth, client satisfaction, and continuous improvement. Develop and execute the Industrial business strategy, ensuring alignment with regional and corporate objectives while driving sustainable growth, profitability, and market leadership. Identify and capitalize on emerging market opportunities through strategic market intelligence, competitive analysis, customer insights, and industry trend assessment to expand market share and service offerings. Define and deliver short- and long-term business plans, including portfolio expansion, new service development, diversification initiatives, and strategic partnerships to accelerate business growth. Own the financial and operational performance of the business, monitoring key performance indicators and providing strategic insights and recommendations to senior leadership. Build and strengthen executive-level customer relationships, positioning SGS as a trusted partner while driving customer satisfaction, retention, and long-term revenue growth. Optimize operational efficiency, resource allocation, and service delivery models to enhance productivity, profitability, quality, and customer experience. Lead, develop, and inspire high-performing teams, fostering a culture of accountability, collaboration, innovation, and continuous improvement across employees and subcontractors. Champion a strong culture of safety, ethics, quality, and compliance, ensuring full adherence to SGS policies, regulatory requirements, and industry best practices. Represent the Industrial business internally and externally, strengthening SGS market presence, industry influence, and strategic stakeholder relationships. Qualifications 15+ years of progressive leadership experience managing large-scale operations, with a proven track record of driving business growth, operational excellence, and organizational performance. Demonstrated success leading business transformation, change management, and organizational development initiatives in complex and evolving environments. Strong commercial acumen with a growth-oriented mindset and a proven ability to identify, develop, and capitalize on market opportunities. Proven P&L leadership with full accountability for revenue growth, profitability, cost optimization, and sustainable business performance. Exceptional stakeholder management, negotiation, and influencing skills, with the ability to build trusted relationships with customers, partners, regulators, and senior executives. Strong customer-centric approach with a track record of developing strategic client relationships and driving long-term business partnerships. Inspirational and accountable leader with executive presence, capable of building high-performing teams and fostering a culture of engagement, collaboration, and continuous improvement. Strategic thinker with the ability to translate business strategy into actionable plans and deliver measurable results through effective execution. Experience operating in fast-paced, growth-oriented, and matrix organizations, balancing strategic priorities with operational demands. Experience within the Middle East region is highly preferred, with a strong understanding of regional market dynamics and business practices. Fluent in English, both written and spoken; Arabic language skills would be considered a strong asset.
Manager of Population Health Management
Al Moosa
Strategy Formulation & Budget ManagementDevelop departmental strategic objectives, KPIs, and individual employee goals in alignment with leadership direction.Ensure achievement of departmental targets through effective planning, risk management, and data-driven decision-making.Develop, manage, and monitor the annual departmental budget, ensuring alignment with financial plans and minimizing variances.Core Activities Population Health Program OwnershipLead the design and management of PHM programs, including screening, chronic disease management, and employee health initiatives.Define program scope, eligibility criteria, care pathways, care gaps, and outcome measures in collaboration with clinical specialties.Conduct disease segmentation and risk stratification to identify and prioritize high-risk populations.Clinical & Cross-Functional CollaborationPartner with clinical specialties to integrate PHM programs into clinical workflows and ensure clinical alignment.Act as the PHM representative in clinical committees and leadership forums.Collaborate with ADA, Case Management, IT, and Quality teams to develop dashboards, KPIs, workflows, and care-gap resolution processes.Value-Based Care & Performance ManagementLead PHM contribution to value-based care initiatives, including pilot design and risk-stratification frameworks.Define PHM KPIs and ensure alignment with measurement logic developed with analytics teams.Interpret program performance data and contribute to executive and board-level reporting.Patient-Reported Outcomes & Care DeliveryLead PROMs initiatives across PHM programs, ensuring integration with quality, patient experience, and IT teams.Support patient-centered care delivery, ensuring dignity, compassion, and shared decision-making.Promote continuous improvement through patient feedback and outcome tracking.Qualifications & ExperienceBachelor’s degree in Nursing, Pharmacy, Allied Health, Physiotherapy, Public Health, or related clinical field (required).Master’s degree in Public Health, Health Administration, Epidemiology, Population Health, or related field (preferred). MD is an advantage.6–8 years of relevant experience, including 2–4 years in a managerial role.
Population Health Management Officer
Al Moosa
Core ActivitiesGenerate and maintain patient registries, care-gap lists, and high-risk population reports to support Population Health Management (PHM) programs.Coordinate patient outreach, recalls, scheduling, and follow-up activities for screening, chronic disease, and preventive health programs.Monitor program performance, maintain accurate records, and support KPI reporting, dashboard updates, and presentation preparation.Collaborate with clinic staff, Case Management, and Analytics teams to ensure effective program implementation and continuous improvement.Administer patient surveys (PROMs), track participation, and compile results to support program evaluation and decision-making.Coordinate employer-focused healthcare services, including screenings, checkups, occupational health, and related program activities.Support audits, quality initiatives, value-based care programs, and other departmental projects as assigned.Ensure compliance with information governance requirements while maintaining patient confidentiality and data security.Provide patient-centered support and coordinate care with compassion, professionalism, and respect.Qualifications & CertificationsDiploma in Nursing, Allied Health, Public Health, Health Information Management, or a related healthcare field.0–4 years of experience in healthcare operations, care coordination, case management, quality, population health, or a related area.Good command of both Arabic and English.