Mindrift

Evaluation Scenario Writer - AI Agent Testing Specialist

Mindrift
Location
Job Type
Contract
Salary
USD 20-40/hour (Estimated)
Posted
2/23/2026
Career Level
Mid-Senior Level
Qualification
Degree in Computer Science, Software Engineering or related fields
Remote5+ years in software development16 views

Job Description

What this opportunity involves

  • Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements and information sources
  • Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks
  • Craft “fair but hard” challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required)
  • Analyze AI failures to understand what the model struggles with vs. what it masters
  • Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria

What we look for

  • Degree in Computer Science, Software Engineering or related fields
  • 5+ years in software development, primarily Python (pytest, async/await, subprocess, file operations)
  • Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems
  • Experience writing tests (functional, integration – not just running them)
  • Docker containers (running evaluations locally in containers)
  • CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results)
  • English proficiency - B2

How it works

  • Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid

Effort estimate

Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted.

Payment

  • Paid contributions, with rates up to $40/hour*
  • Fixed project rate or individual rates, depending on the project
  • Some projects include incentive payments

*Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.

Get notified of similar jobs

We'll send you an email when jobs similar to "Evaluation Scenario Writer - AI Agent Testing Specialist" are posted.

Keyword: Evaluation Scenario Writer - AI Agent Testing SpecialistLocation: Kuwait

No spam ever. Unsubscribe with one click anytime. By subscribing, you agree to our privacy policy.

Related Jobs You Might Like

View all jobs →
Mindrift

US Corporate Attorney - Freelance AI Trainer

Mindrift

KuwaitRemote
Contract
Up to $44 per hour

What this opportunity involves Generate prompts that challenge AI; Evaluate AI-generated solutions for correctness, assumptions, and logic; Improve AI reasoning to align with first principles and accepted standards; Apply structured scoring criteria to assess multi-step problem solving. What we look for Degree in law (Bachelor, J.D., LLM, FLLM) within the US context 2+ years of legal practice experience within US jurisdiction Strong written English (C1/C2) Stable internet connection How it works Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid Project time expectations For this project, tasks are estimated to require around 10–20 hours per week during active phases, based on project requirements. This is an estimate, not a guaranteed workload, and applies only while the project is active. Compensation On this project, contributors can earn up to $44 per hour equivalent, depending on their level and pace of contribution. Compensation varies across projects depending on scope, complexity, and required expertise. Please note that other projects on the platform may offer different earning levels based on their requirements.

View Details →

Head of User Access Control

AL AHLI BANK OF KUWAIT

Kuwait
Full-time
15k-25k KWD (Estimated)

Job Purpose A User Access Manager is responsible for overseeing and managing user access to the bank systems, applications, and data. The role ensures that access permissions are granted in compliance with internal security policies, industry standards, and CBK CSF beside of Manage, Monitor and Implement the approved change management processes and source code version control across all IT deliverables that are in-line with standards and best practice. Generic Accountabilities Work fully within risk policies and procedures and compliance regulations and ensure all divisional activities comply with corporate governance & regulatory/legal frameworks Develop and implement relevant policies and procedures and conduct regular reviews to remain relevant and effective. Manage people in line with people policies and best practices. Work alongside Risk Management in ensuring that the function works fully within the set frameworks, proactively monitor and report on risk exposure in order to enhance control effectiveness. Work fully within ABK’s Compliance regulations and standards. Specific Accountabilities: IT User Access Management User Access Administration: Create, modify, and delete user accounts and permissions across various systems and applications. Ensure proper access control is granted based on user roles and responsibilities. Conduct regular reviews and audits of user access to ensure compliance with company policies. Access Control Policy Development: Develop and enforce access control policies and procedures in line with industry best practices like PCI and ISO Ensure access rights are properly assigned and that segregation of duties is maintained. Access Request Management: Review and approve or reject user access requests based on predefined security guidelines. Coordinate with relevant departments to address user access requests promptly. Systems/Applications Patching Keep user access management systems updated with the latest security patches and enhancements. Continuously mitigate the reported vulnerabilities on the users access controls tools Reporting: Generate and maintain reports on user access activities, including access logs, permissions, and audits. Provide recommendations for improving access control and security measures. System Patching and audit Keep user access management systems updated with the latest security patches and enhancements. Continuously mitigate the reported vulnerabilities on the users access controls tools Close all the reported audit notes for the users access systems on time. Job Success Factors Education: Bachelor's Degree or Equivalent Certification/Experience Experience: 7 years of experience Banking background and good understanding of banking functions Software source code version control, applications standards and quality assurance techniques. Project management techniques and methodologies Process improvement techniques Skills Knowledge of identity and access management (IAM) tools and systems. Knowledge of Privileged Access Management “PAM” systems. Understanding of security protocols and practices. Familiarity with compliance standards, such as PCI, ISO…. Strong communication skills for interacting with users, management, and IT teams. ITSM solution knowledge and hand on experience dealing with different modules like assets, CMDB, Contracts and change management different types in general. Work Contact Internal: All IT Groups and ABK Departments External: Vendors

View Details →

Senior Officer IT Change Management

AL AHLI BANK OF KUWAIT

KuwaitRemote
Full-time
10k-15k KWD (Estimated)

Job Purpose Manage, Monitor and Implement the approved change management processes and source code version control across all IT deliverables that are in-line with international standards and best practice Specific Accountabilities Handle all change requests that ITD receives from Business and IT as well as ensuring that each CR has been adequately followed to meet the Change Management policy conforming to best practices and meeting the needs of the organization. Facilitate the cap meetings and ensuring that minutes of meeting is distributed on all attendees. Coordinate and follow up with business stakeholders on behalf of IT regarding any Changes request to promote awareness. Monitor the IT change management process and activities to ensure compliance with approved change management process, corporate governance & regulatory/legal frameworks. Handle the day-to-day administration of the “Service Desk Plus” SDP suite of tools, and ensuring quality of data entered Provide periodical and ad-hoc status reports to ITD management and business requesters. Develop process flow diagrams to support standard operating procedures. Assist in the ITD dashboard with periodical data updates of balance score and card Reporting Ensure that all IT Risk requirements are provided, Reviews and discussed before the final ORM Report is published Ensure that all Internal\External Audit requirements are provided, Reviews and discussed before the final Audit Report is published Support the Document management function for All ITD and maintain IT Governance documents as per the approved template & best practices Job Success Factors Bachelor's Degree or Equivalent Certification/Experience in Information Sciences and Technology At least 2 years of experience IT experience in a financial institution Skills Banking background and good understanding of banking functions Release Management version control, applications standards and quality assurance techniques. Project management techniques and methodologies Process improvement techniques IT Change and release management Source code version control ITIL and/or COBIT certification Work Contact Internal: All IT Groups and ABK Departments External: External Audits

View Details →
HomeJobsSign In