JOBSEARCHER

Staff Replication Development Engineer

ARCHIVED

We can't find an active application page for this role right now. It may reopen or be listed elsewhere. Use Next Steps to search for an active apply link and similar live jobs.

Staff Replication Development Engineer Job Locations US-CA-San Francisco - Remote | US-NC-Raleigh Job ID 2026-5723 Name Linked Remote: San Francisco, CA Country United States City San Francisco - Remote Worker Type Regular Full-Time Employee Posting Location : State/Province CA OverviewThis is an incredible opportunity to be part of a company that has been at the forefront of AI and high-performance data storage innovation for over two decades. DataDirect Networks (DDN) is a global market leader renowned for powering many of the world's most demanding AI data centers, in industries ranging from life sciences and healthcare to financial services, autonomous cars, Government, academia, research and manufacturing."DDN's A3I solutions are transforming the landscape of AI infrastructure." - IDC"The real differentiator is DDN. I never hesitate to recommend DDN. DDN is the de facto name for AI Storage in high performance environments" - Marc Hamilton, VP, Solutions Architecture & Engineering | NVIDIADDN is the global leader in AI and multi-cloud data management at scale. Our cutting-edge data intelligence platform is designed to accelerate AI workloads, enabling organizations to extract maximum value from their data. With a proven track record of performance, reliability, and scalability, DDN empowers businesses to tackle the most challenging AI and data-intensive workloads with confidence.Our success is driven by our unwavering commitment to innovation, customer-centricity, and a team of passionate professionals who bring their expertise and dedication to every project. This is a chance to make a significant impact at a company that is shaping the future of AI and data management.Our commitment to innovation, customer success, and market leadership makes this an exciting and rewarding role for a driven professional looking to make a lasting impact in the world of AI and data storage. Job DescriptionDDN is seeking a Staff Replication Development Engineer to lead the design and development of the replication engine for the Infinia AI Data Platform. This role focuses on building enterprise-grade asynchronous replication capabilities that enable reliable and secure disaster recovery for large-scale data systems.You will work on developing high-performance replication pipelines, efficient data synchronization mechanisms, and secure data transfer systems. This role requires deep expertise in distributed systems and strong technical leadership to deliver a scalable and resilient replication foundation.Key ResponsibilitiesDesign and develop multi-threaded asynchronous replication systems with parallel streaming capabilitiesBuild object-level delta replication with checkpointing and resume functionalityDevelop replication engines supporting bucket/share-level replication controlsImplement secure data transfer mechanisms using TLS 1.3 with mutual authenticationEnsure end-to-end data integrity through checksum validation and verification pipelinesDesign and implement manual failover workflows for disaster recovery scenariosBuild and maintain REST APIs for replication configuration, control, and automationDevelop metadata tracking and change detection systems to enable efficient replicationImplement RPO visibility, alerting, and operational insights for replication statusContribute to monitoring dashboards focused on replication health and performanceEnsure systems are designed for high availability, fault tolerance, and scalabilityPartner with QA teams to drive performance, resiliency, and scale validationCollaborate with backend, security, and platform teams to deliver end-to-end replication workflowsParticipate in debugging, production issue resolution, and continuous improvement of replication reliabilityProvide technical leadership, architectural guidance, and mentorship to the engineering teamRequired Qualifications8+ years of experience in distributed systems, storage systems, or backend software engineeringStrong programming skills in one or more languages: C++, Go, Java, or RustExperience designing and building data replication systems, data pipelines, or distributed data servicesDeep understanding of distributed systems concepts (consistency, availability, scalability, fault tolerance)Strong expertise in multi-threading, concurrency, and parallel processingKnowledge of networking protocols and secure communication (TCP/IP, HTTP/HTTPS, TLS)Experience implementing data integrity mechanisms (checksums, validation, consistency checks)Experience designing and building REST APIs and service-based architecturesFamiliarity with checkpointing, failure recovery, and retry mechanisms in distributed systemsBasic understanding of observability concepts (metrics, logging, alerting)Strong debugging, problem-solving, and system design skillsPreferred QualificationsExperience with asynchronous replication, disaster recovery (DR), or backup systemsFamiliarity with object storage or large-scale data storage systemsKnowledge of delta encoding, change data capture, or incremental data synchronization techniquesExperience building high-throughput, low-latency data movement systemsExposure to security practices including mutual TLS, encryption, and authenticationExperience working on enterprise-scale data platforms or storage productsFamiliarity with performance optimization and large-scale system tuningSalary Range for this role: $185,000 - $230,000 DDNJoin our dynamic and driven team, where engineering excellence is at the heart of everything we do. We seek individuals who love to challenge themselves and are fueled by curiosity. Here, you'll have the opportunity to work across various areas of the company, thanks to our flat organizational structure that encourages hands-on involvement and direct contributions to our mission. Leadership is earned by those who take initiative and consistently deliver outstanding results, both in their work ethic and deliverables, making strong prioritization skills essential. Additionally, we value strong communication skills in all our engineers and researchers, as they are crucial for the success of our teams and the company as a whole.Interview Process: After submitting your application, one of our recruiters will review your resume. If your application passes this stage, you will be invited to a 30-minute interview during which a member of our team will ask some basic questions. If you clear the interview, you will enter the main process, which can consist of up to four interviews in total:Coding assessment: Often in a language of your choice.Systems design: Translate high-level requirements into a scalable, fault-tolerant service (depending on role).Real-time problem-solving: Demonstrate practical skills in a live problem-solving session.Meet and greet with the wider team.Our goal is to finish the main process in 2-3 weeks at most.DataDirect Networks (DDN) is an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity, gender expression, transgender, sex stereotyping, sexual orientation, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.#LI-Remote