Candidates Experience With Us + Latest Updates
Personalized Support for Your Success
Upcoming Trainings & Events
AI Evaluation Engineer (Software Engineering / Code)
IT Jobs. Gramian Consultancy Jobs
Key Responsibilities
- Design and build multi-agent benchmark tasks based on real-world code changes (bug fixes, migrations, refactors)
- Work with the Harbor evaluation framework to run and validate tasks in containerized environments
- Write clear, precise task instructions (file paths, function signatures, expected behavior, constraints)
- Develop Python-based verification scripts to validate correctness of code changes
- Define task decomposition strategies across multiple specialized agents
- Analyze and navigate large open-source codebases to extract realistic task scenarios
- Run, debug, and refine tasks in Docker environments to ensure reproducibility
- Improve task quality, clarity, and difficulty based on evaluation results
Qualifications & Experience
- 5+ years of experience in software development (Python and JavaScript)
- Strong experience working with large codebases (e.g., Django, Flask, FastAPI, Node.js or similar)
- Familiarity with Git workflows (pull requests, diffs, commits, cherry-picking)
- Experience writing tests or validation scripts (pytest, unittest, or similar)
- Ability to write clear, precise technical specifications
- Familiarity with AI coding benchmarks or evaluation frameworks (e.g., SWE-bench or similar)
- Hands-on experience with Docker (Dockerfiles, image builds, debugging)
How to Apply
🚨 Before You Apply for This Job. Need Help With Your CV?
This job will attract 1000+ applicants.
Many qualified professionals miss out on getting shortlisted and interviews — not because they lack experience, but because their CV doesn’t clearly show how they fit this specific job.
🎯 Want to get an interview fast? Customize your CV specifically for this job.
Using the same CV for every application will not get you interviews.
Email your CV today to our Client Service Manager, Rose, using cvwriting@corporatestaffing.co.ke
Subject: CV Review & Upgrade.
Rose and our recruiters will review your CV and show you exactly how to improve it for the job you are targeting.
Using an A.I-generated CV but not getting interviews? Get it reviewed here by our recruiters today.

