Financial AI Evaluator (Remote, Hourly Contractor)
Position SummaryIn this remote, hourly contractor role, you will evaluate AI-generated financial content and develop cases that test analytical reasoning accuracy. Your work directly improves how leading AI models handle financial information, making them more accurate, reliable, and clearly explained. Tasks may include:Evaluating AI-generated responses across banking, insurance, accounting, and risk management for factual accuracy, analytical soundness, and correct application of financial principlesIdentifying errors in financial methodology, flawed assumptions, misapplied regulations, and unsupported conclusions, fact-checking against reliable sources, and explaining corrections clearly in writingAssessing AI reasoning across multi-document financial scenarios, including cross-referencing data across balance sheets, reports, and operational recordsDeveloping prompts and test cases that test AI performance on practical financial operations and data analysis, including Excel-based evaluation tasksRating and comparing AI responses based on correctness, internal consistency, and adherence to the promptProfile RequirementsBachelor's degree or higher in Finance, Accounting, Economics, Business, or a closely related fieldMinimum 1 year of full-time professional experience in banking, insurance, accounting, or risk managementAbility to explain complex financial concepts clearly and accurately in writingFull professional English proficiencyReliable and self-directed, with consistent output quality in a remote, asynchronous workflowPreferred ExperiencePrior experience with AI data training, annotation, or model evaluationFamiliarity with multimodal AI tools or LLM evaluation workflowsAbout CNTXT AICNTXT AI builds artificial intelligence products and data solutions with a focus on making AI accurate, safe, and globally relevant for impact. Our work spans data services, custom AI solutions, and proprietary AI products, with deep expertise in Arabic-native and secure, sovereign solutions.