{"schemaVersion":"jobsearcher.job.v1","id":"fb230844a432b81ae0cdeb06","url":"https://jobsearcher.com/jobs/fb230844a432b81ae0cdeb06","canonicalUrl":"https://jobsearcher.com/jobs/fb230844a432b81ae0cdeb06","title":"Python Developer (AI & LLMs)","description":"Key ResponsibilitiesAI & LLM Systems Build end-to-end RAG (Retrieval-Augmented Generation) pipelines for context-aware AI responses. Implement and fine-tune vLLM for efficient inference of large language models (LLMs). Collaborate with ML engineers to deploy transformer models (e.g., BERT, GPT variants) and vector databases.Data & Database Architecture Architect and optimize graph database systems (Neo4j) to model project knowledge networks and relationships. Develop Python-based microservices for data ingestion, processing, and API integrations (FastAPI, Flask).Performance & Operations Monitor system performance, conduct A/B tests, and ensure low-latency responses in production. Ensure scalability and efficiency of AI systems.Requirements Proficiency in Python and AI/ML libraries (PyTorch, TensorFlow, Hugging Face Transformers). Hands-on experience with graph databases, especially Neo4j (Cypher queries, graph algorithms). Demonstrated work on RAG pipelines (retrieval, reranking, generation) using frameworks like LangChain or LlamaIndex. Experience with vLLM or similar LLM optimization tools (quantization, distributed inference). Knowledge of vector databases (e.g., FAISS, Pinecone) and embedding techniques. Familiarity with cloud platforms (AWS/GCP/Azure) and containerization (Docker, Kubernetes).Preferred Qualifications Strong experience with FastAPI or Flask for building high-performance APIs. Familiarity with MLOps principles and tools (e.g., MLflow, Kubeflow). Contributions to open-source AI/ML projects. Experience in performance tuning and A/B testing for AI systems in a production environment.Soft Skills Strong analytical and problem-solving skills. Excellent communication and team collaboration abilities. Self-motivated with the ability to work independently and as part of a team.What We OfferCompetitive salary and performance-based bonuses.Flexible working hours with remote work options.Opportunities for professional development and skill enhancement.Collaborative and inclusive work environment.Paid sick timePaid time offProvident FundPerformance bonusYearly bonus","company":"Space Ai","rawCompany":"space ai","city":"Delray Beach","state":"FL","isRemote":false,"isActive":false,"createdAt":"2026-06-26T12:01:06.149Z","occupations":[{"code":"15-1252.00","title":"Software Developers","slug":"software-developers"},{"code":"15-1251.00","title":"Computer Programmers","slug":"computer-programmers"},{"code":"15-1254.00","title":"Web Developers","slug":"web-developers"}],"industries":[{"code":"541511","title":"Custom Computer Programming Services","slug":"custom-computer-programming-services"},{"code":"541512","title":"Computer Systems Design Services","slug":"computer-systems-design-services"},{"code":"513210","title":"Software Publishers","slug":"software-publishers"}],"jobPosting":{"@context":"https://schema.org","@type":"JobPosting","title":"Python Developer (AI & LLMs)","description":"Key ResponsibilitiesAI & LLM Systems Build end-to-end RAG (Retrieval-Augmented Generation) pipelines for context-aware AI responses. Implement and fine-tune vLLM for efficient inference of large language models (LLMs). Collaborate with ML engineers to deploy transformer models (e.g., BERT, GPT variants) and vector databases.Data & Database Architecture Architect and optimize graph database systems (Neo4j) to model project knowledge networks and relationships. Develop Python-based microservices for data ingestion, processing, and API integrations (FastAPI, Flask).Performance & Operations Monitor system performance, conduct A/B tests, and ensure low-latency responses in production. Ensure scalability and efficiency of AI systems.Requirements Proficiency in Python and AI/ML libraries (PyTorch, TensorFlow, Hugging Face Transformers). Hands-on experience with graph databases, especially Neo4j (Cypher queries, graph algorithms). Demonstrated work on RAG pipelines (retrieval, reranking, generation) using frameworks like LangChain or LlamaIndex. Experience with vLLM or similar LLM optimization tools (quantization, distributed inference). Knowledge of vector databases (e.g., FAISS, Pinecone) and embedding techniques. Familiarity with cloud platforms (AWS/GCP/Azure) and containerization (Docker, Kubernetes).Preferred Qualifications Strong experience with FastAPI or Flask for building high-performance APIs. Familiarity with MLOps principles and tools (e.g., MLflow, Kubeflow). Contributions to open-source AI/ML projects. Experience in performance tuning and A/B testing for AI systems in a production environment.Soft Skills Strong analytical and problem-solving skills. Excellent communication and team collaboration abilities. Self-motivated with the ability to work independently and as part of a team.What We OfferCompetitive salary and performance-based bonuses.Flexible working hours with remote work options.Opportunities for professional development and skill enhancement.Collaborative and inclusive work environment.Paid sick timePaid time offProvident FundPerformance bonusYearly bonus","datePosted":"2026-06-26T12:01:06.149Z","dateModified":"2026-06-26T12:01:06.149Z","hiringOrganization":{"@type":"Organization","name":"Space Ai","sameAs":"https://jobsearcher.com"},"jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Delray Beach","addressRegion":"FL","addressCountry":"US"}},"identifier":{"@type":"PropertyValue","name":"JobSearcher","value":"fb230844a432b81ae0cdeb06"},"url":"https://jobsearcher.com/jobs/fb230844a432b81ae0cdeb06"}}