AI Agent Evaluation Analyst Remote, Mexico

19 (views)

Job role insights

  • Date posted

    30.10.2025

  • Closing date

    24.11.2025

  • Offered salary

    Min: $1,500/month

  • Career level

    Middle

  • Qualification

    Bachelor

  • Experience

    2-3 years

  • Quantity

    5 person

  • Gender

    Male Or Female

Description

Mindrift is seeking analytical and curious contributors to join an innovative AI evaluation project. This flexible, project-based opportunity allows you to work remotely while contributing to cutting-edge AI research and development.

What You’ll Do:

  • Review AI evaluation tasks and scenarios for logical consistency, completeness, and realism.
  • Identify inconsistencies, missing assumptions, or unclear decision points in complex systems.
  • Define clear expected behaviors (gold standards) for AI agents.
  • Annotate cause-effect relationships, reasoning paths, and plausible alternatives.
  • Collaborate with QA, writers, and developers to refine evaluation tasks and cover edge cases.
  • Think through complex systems holistically to ensure AI agents are tested thoroughly.

What You’ll Bring:

  • Excellent analytical thinking and logical problem-solving skills.
  • Strong attention to detail to spot ambiguities, contradictions, and missing assumptions.
  • Familiarity with structured data formats such as JSON or YAML.
  • Clear written communication in English to document findings.
  • Ability to evaluate complex systems and anticipate potential failure points.

Preferred Experience & Background:

  • Experience with policy evaluation, logic puzzles, consulting, research, or academia.
  • Exposure to LLMs, prompt engineering, or AI-generated content.
  • Knowledge of QA processes, test-case design, and edge-case thinking.
  • Participation in olympiads or case-solving competitions is a plus.

Education & Experience:

  • Bachelor’s or master’s degree in computer science, data science, mathematics, logic, AI, or related fields is preferred but not required.
  • 2+ years of professional, research, or project experience in analytics, consulting, or structured problem-solving roles.

Why Join:

  • Flexible, remote, part-time project that fits around your professional or academic schedule.
  • Competitive pay up to $19/hour based on skills and experience.
  • Contribute to advanced AI projects and shape the way AI understands complex tasks.
  • Gain valuable portfolio experience while working in a collaborative, forward-thinking environment.

How to Apply

If you’re interested in this position, please register on our portal and submit your application through the link below:

👉 Register & Apply at TeezJobs.com

View More Jobs:

Administrative Specialist

Senior Manager, Medicaid Network Management

1. What does an AI Agent Evaluation Analyst do?
An AI Agent Evaluation Analyst reviews AI agent tasks, evaluates their logic and consistency, identifies gaps or ambiguities, and helps define expected behaviors to ensure AI systems perform correctly.

2. Do I need coding experience for this role?
No coding experience is required. However, you should have strong analytical and critical thinking skills, be detail-oriented, and be able to evaluate complex systems and scenarios.

3. Is this a full-time job?
No, this is a flexible, part-time, project-based opportunity. You can work asynchronously on your own schedule, making it ideal for students, researchers, or professionals seeking extra projects.

Maps

Interested in this job?

19 days left to apply

Apply for this job

Cancel
Send message
Cancel