Gen AI Architect
Straive is a global leader in enterprise-grade Data Analytics and AI solutions, committed to empowering businesses across various industries with cutting-edge technology and expert insights. Backed by EQT, a top private equity firm, we are uniquely positioned to drive innovation through significant investments and an entrepreneurial spirit.Our core focus is on delivering advanced Data Analytics & AI Solutions. By combining sophisticated technology with subject matter expertise, we deliver material impact on our clients' topline and streamline their operations. We specialize in providing tailored solutions across financial services, CPG, legal, pharma, life sciences, retail and logistics, helping them build robust data analytics and AI capabilities.With a client base spanning 30 countries, Straive's strategically located teams operate from eight countries and is headquartered in Singapore. This global presence enables us to offer localized expertise with a worldwide perspective.Join Straive to be part of a dynamic team at the forefront of data analytics and AI innovation. Here, you'll have the opportunity to contribute to transformative projects, supported by significant investments and an entrepreneurial drive fueled by our partnership with EQT.Website: https://www.straive.com/ LinkedInJob Title: Gen AI ArchitectLocation: New Jersey- (Hybrid)Type: FulltimeExperience10+ years in ML/Engineering.At least 5 years in architecture roles.Proven experience delivering production-grade solutions on Azure.Hands-on ownership of end-to-end application lifecycle—from design to deployment.ResponsibilitiesLead architecture and development of enterprise web applications with integrated Generative AI.Define scalable, secure architectural patterns and implementation standards.Work closely with product, AI/ML, DevOps, security, and network teams to align business and technical goals.Drive full-stack development best practices across backend, frontend, and infrastructure.Architect and integrate LLMs, embeddings, RAG pipelines, and vector databases into production systems.Ensure production readiness—security, networking, monitoring, performance, and complianceMandatory Technical SkillsCloud Architecture (Azure)Deep experience designing cloud-native, microservices and event-driven architectures.Expertise with Azure App Services, AKS, Functions, API Management, Event Grid, Service Bus, Storage, and Key Vault.Strong understanding of subscription design, resource hierarchy, and environment isolation.Performance & ScalabilityExpertise in caching (Redis/CDN), async processing (RabbitMQ/Kafka), load balancing, auto-scaling, and performance tuning.GenAI IntegrationHands-on experience with Azure OpenAI, RAG patterns, embeddings, prompt engineering, vector search (Azure AI Search, PostgreSQL extensions).Experience orchestrating LLM pipelines in real-world production environments.Backend EngineeringStrong REST API design (versioning, throttling, API gateways).Expert in PostgreSQL/MongoDB, data modeling, query optimization.Experience with OAuth2, SSO, and secure coding aligned to GDPR/SOC2.Frontend EngineeringDeep experience with React/Next.js, component libraries, and enterprise UI patterns.DevOps & IaCStrong proficiency withAzure DevOps or GitHub Actions CI/CD.Docker, Kubernetes (AKS), Helm.Terraform or Bicep for infrastructure automation.Experience managing production roll-outs, blue-green deployments, and canary releases.Real-Time & Scalable UIKnowledge of WebSockets/SSE, state management, and high-performance UI rendering.Testing & ObservabilityExperience with automated unit/E2E testing.Strong knowledge of logging, tracing, monitoring (App Insights, Log Analytics), and alerting.