Bilingual Storage Cloud Platform Engineer (Mandarin & English)
We are hiring for a Global Top Social Media platform! Job type: Full-time, PermLocation: temporarily remote in the Bay Area OR hybrid in NYCSalary Range: USD $120,000 – $220,000 base annuallyIndustry Experience: 2–8 years of experience in storage and cloud platform development and operations. Candidates with a background in AIOps or LLMOps are highly preferred.ResponsibilitiesLead Architecture Design & Core Development:Drive the architecture evolution and core module development of the storage cloud platform. Explore AI-native architectures for intelligent storage solutions and promote the transformation of the platform from tool-based to intelligent systems. Build an Intelligent Operations System:Independently lead the planning and implementation of self-service management platforms for storage and databases. Leverage large language models (LLMs) to build intelligent operations agents, enabling natural language-based interaction and lowering the barrier to database usage.Drive Intelligent Operations Transformation:Rebuild the automated operations platform using LLM Agent technologies to create a self-diagnosing, self-healing, and self-optimizing system. Enable the transition from manual operations to autonomous operations.Innovate AI + Database Use Cases:Lead the implementation of advanced AI use cases in database systems, including intelligent SQL optimization, anomaly detection and root cause analysis, capacity forecasting and auto-scaling, and intelligent alert noise reduction.Enhance Product Experience:Continuously track cutting-edge AI agent products (e.g., OpenClaw, Claude Code, Devin) and integrate advanced AI interaction paradigms into the storage platform to deliver industry-leading intelligent operations experiences.QualificationsStrong Engineering Foundation:Proficient in at least one backend language such as Go, Java, or Python. Familiar with frontend frameworks like Vue or React. Hands-on experience building agent applications using LLM APIs (e.g., OpenAI, Claude, Tongyi Qianwen).Deep Storage Domain Expertise:Extensive hands-on experience with distributed storage systems. Familiar with at least two of the following: distributed KV/cache systems, MySQL, distributed databases, graph databases, table storage, and object storage, including both architecture and operational practices.AI Application Development Skills:Strong experience with LLM application development, including RAG, prompt engineering, function calling, and multi-agent frameworks (e.g., LangChain, LlamaIndex, AutoGen). Proven experience delivering end-to-end AI projects.Product Mindset & AI Awareness:Deep understanding and hands-on experience with AI tools such as OpenClaw, Claude Code, Cursor, and GitHub Copilot. Ability to translate AI capabilities into product features and deliver them in production environments.Soft Skills & Ownership:Excellent problem decomposition and technical problem-solving skills. Strong cross-functional collaboration and communication abilities. Customer-oriented mindset with strong ownership. Forward-looking perspective on applying AI technologies in infrastructure.Fluent in both English and Mandarin.