Full Stack Engineer
We're working with a highly innovative company that is at the forefront of improving AI agents through human feedback. This organization collaborates with leading AI research groups to train Large Language Models (LLMs) to function as proactive, multi-step agents, designing and optimizing complex, real-world architectural workflows.The Role Develop objective and verifiable criteria (rubrics) to evaluate system performance against strict functional requirements. Review system logs and trajectories to refactor code, improve execution paths, and achieve optimal reliability. Test systems for vulnerabilities, including data exposure, unauthorized access, and edge-case failures. Contribute expertise to training advanced generative AI models. This is a freelance role offering flexible hours and remote work.What You'll Need 2+ years of experience in backend engineering, AI automation, or complex systems integration. Proven ability to build and maintain production-grade software with modular separation. Strong command of at least two major programming languages (e.g., Python, JavaScript, Go, or Java). Experience working with SQL databases. Practical experience building for live, non-mocked environments and handling multi-turn system interactions. Outstanding attention to detail and the ability to provide clear, high-density technical feedback.What's On Offer Competitive hourly rates for core project work. Additional incentives and opportunities to boost earnings through special missions. Flexible remote work arrangement. The chance to shape the future of autonomous agents and work with cutting-edge AI technologies.Apply via Haystack today!