JOBSEARCHER

Senior QA Tester – GenAI/LLM Testing

Job Title: Senior QA Tester – GenAI/LLM TestingLocation: Houston, TX (Onsite/Hybrid)Experience: 7+ YearsJob Description:We are seeking an experienced Senior QA Tester with deep expertise in testing Large Language Model (LLM)-based applications, GenAI, and ML-driven products. The ideal candidate should have strong hands-on experience with LangSmith, Promptfoo, and other modern LLM testing frameworks, along with a proven track record in automation, functional, and performance testing.Key Responsibilities:   •   Design, develop, and execute test strategies for LLM-based products, Generative AI applications, and ML workflows.   •   Perform prompt testing, regression testing, functional testing, and evaluation of LLM-powered features using tools like LangSmith and Promptfoo.   •   Validate model outputs, hallucinations, bias, accuracy, and reliability of AI responses.   •   Develop automated test scripts for GenAI/ML pipelines to ensure scalability and consistency.   •   Work closely with developers, ML engineers, and product teams to identify defects, edge cases, and quality gaps in AI-driven features.   •   Collaborate on CI/CD integration for automated testing of AI/ML models and APIs.   •   Document and maintain test cases, bug reports, and test execution results.   •   Provide technical expertise and best practices for AI/ML model validation and benchmarking.Required Skills & Experience:   •   7+ years of hands-on experience in QA Testing, with at least 2+ years in GenAI/LLM/ML product testing.   •   Proficiency in LLM testing frameworks such as LangSmith and Promptfoo.   •   Strong knowledge of Generative AI models, NLP, and ML pipelines.   •   Solid expertise in test automation (Python, Pytest, Java, or similar frameworks).   •   Experience testing APIs, microservices, and cloud-based AI solutions (AWS, Azure, GCP).   •   Understanding of prompt engineering, evaluation metrics, and model fine-tuning validation.   •   Strong debugging, problem-solving, and analytical skills.   •   Excellent communication skills and ability to work in a fast-paced, agile environment.Nice-to-Have:   •   Experience with Apple ecosystem tools (Xcode, XCTest, iOS/macOS testing).   •   Familiarity with MLOps practices and model monitoring frameworks.   •   Exposure to RAG (Retrieval Augmented Generation) or vector database testing.