Job role insights

Date posted

30.10.2025

Closing date

30.10.2025

Offered salary

Min: $1,500/month

Career level

Middle

Qualification

Bachelor

Experience

2-3 years

Quantity

5 person

Gender

Male Or Female

Show more Hide less

Description

Mindrift is seeking analytical and curious contributors to join an innovative AI evaluation project. This flexible, project-based opportunity allows you to work remotely while contributing to cutting-edge AI research and development.

What You’ll Do:

Review AI evaluation tasks and scenarios for logical consistency, completeness, and realism.
Identify inconsistencies, missing assumptions, or unclear decision points in complex systems.
Define clear expected behaviors (gold standards) for AI agents.
Annotate cause-effect relationships, reasoning paths, and plausible alternatives.
Collaborate with QA, writers, and developers to refine evaluation tasks and cover edge cases.
Think through complex systems holistically to ensure AI agents are tested thoroughly.

What You’ll Bring:

Excellent analytical thinking and logical problem-solving skills.
Strong attention to detail to spot ambiguities, contradictions, and missing assumptions.
Familiarity with structured data formats such as JSON or YAML.
Clear written communication in English to document findings.
Ability to evaluate complex systems and anticipate potential failure points.

Preferred Experience & Background:

Experience with policy evaluation, logic puzzles, consulting, research, or academia.
Exposure to LLMs, prompt engineering, or AI-generated content.
Knowledge of QA processes, test-case design, and edge-case thinking.
Participation in olympiads or case-solving competitions is a plus.

Education & Experience:

Bachelor’s or master’s degree in computer science, data science, mathematics, logic, AI, or related fields is preferred but not required.
2+ years of professional, research, or project experience in analytics, consulting, or structured problem-solving roles.

Why Join:

Flexible, remote, part-time project that fits around your professional or academic schedule.
Competitive pay up to $19/hour based on skills and experience.
Contribute to advanced AI projects and shape the way AI understands complex tasks.
Gain valuable portfolio experience while working in a collaborative, forward-thinking environment.

How to Apply

If you’re interested in this position, please register on our portal and submit your application through the link below:

👉 Register & Apply at TeezJobs.com

View More Jobs:

Administrative Specialist

Senior Manager, Medicaid Network Management

1. What does an AI Agent Evaluation Analyst do?
An AI Agent Evaluation Analyst reviews AI agent tasks, evaluates their logic and consistency, identifies gaps or ambiguities, and helps define expected behaviors to ensure AI systems perform correctly.

2. Do I need coding experience for this role?
No coding experience is required. However, you should have strong analytical and critical thinking skills, be detail-oriented, and be able to evaluate complex systems and scenarios.

3. Is this a full-time job?
No, this is a flexible, part-time, project-based opportunity. You can work asynchronously on your own schedule, making it ideal for students, researchers, or professionals seeking extra projects.

Show more Hide less

Skills

Communication Management Data Analytics

Maps

Interested in this job?

0 days left to apply

AI Agent Evaluation Analyst Remote, Mexico

Job role insights

Description

How to Apply

Skills

Maps

Interested in this job?

Subscribe to our newsletter

About Us

Company

Services

Legal Information and Policies

AI Agent Evaluation Analyst Remote, Mexico

Job role insights

Description

How to Apply

Skills

Maps

Interested in this job?

Apply for this job

Subscribe to our newsletter

About Us

Company

Services

Legal Information and Policies

Send message