Senior Infrastructure Engineer
Company DescriptionCandidly was founded in 2016 to flip the script on what it means to plan, borrow, repay, and save for college. Today, we’re the category leader with the market’s most comprehensive AI-driven student debt and savings optimization platform. We partner with hundreds of top employers, financial institutions, and retirement record keepers, positioning Candidly to serve more than 35 million Americans. We’re already achieving incredible results — to date, we’ve helped our users get on track to eliminate more than $1.8B in student debt and pay off their loans 175,000 years quicker — and we’re seeking movers, shakers, innovators, and problem solvers to help take our mission even further. Candidly is a high-growth, Series B startup, funded by leading investors including Altos Ventures, Aflac, Salesforce Ventures, UBS, Equal Opportunity Ventures, Impact Engine, Rethink Impact, Unum, and Cercano Management. Our fully remote, international team of 70 (and counting) includes alumni from Google, UBS, Twitter, Plaid, Prudential, LendingTree, Morgan Stanley, Deutsche Bank, and more. Job DescriptionWe're looking for a Senior Infrastructure Engineer to lead the design, implementation, and ongoing evolution of our cloud infrastructure. This is a role for someone who brings both technical depth and genuine curiosity — someone who sees an opportunity to improve something and goes after it, who takes problems end-to-end, and who makes the engineers around them better just by being present.You'll partner closely with engineering leadership and cross-functional teams to ensure our infrastructure strategy stays ahead of our product ambitions. You'll also be a key voice in shaping how we build — our standards, our reliability posture, and our operational culture.What You'll DoLead the design and implementation of scalable, secure, and resilient cloud infrastructure across AWS and Azure, supporting both Candidly's AI and SaaS productsDrive the architectural vision and strategy, ensuring alignment with long-term business goals and surfacing risks before they become problemsOwn and enforce best practices for infrastructure as code (IaC), CI/CD, and automated deploymentsTake the lead on automating and accelerating SDLC processes — identifying bottlenecks in how we build and ship, and designing solutions that make the whole pipeline faster and smoother, whether that involves AI-assisted tooling or traditional automationServe as a subject matter expert on cloud architecture, containerization, and observabilityLead incident response and post-mortems with a focus on systemic improvement, not just immediate fixesProactively identify and close gaps before they compoundWho You AreYou take ownership naturally — you see what needs to happen and you make it happen, and when you commit to something, it gets doneYou think and communicate beyond your immediate scope. You invest in the people around you, take informed positions on technical direction, and make the team stronger — not just the codebaseYou bring genuine energy and drive. You're energized by ambiguity, you raise the bar for yourself and others, and you don't settle for "good enough" when better is possibleRequired Skills and Experience 5–7+ years of hands-on experience in cloud infrastructure, DevOps, or Site Reliability Engineering (SRE)Expert-level AWS knowledge: EC2, ELB, ASG, RDS, S3, SQS, Lambda, IAM, VPC, CloudFormation, CDK, and Route 53Solid understanding of Linux administration and networking concepts (VPNs, VPC peering, NAT, DNS, firewalls)Deep experience with Infrastructure as Code (IaC) for container orchestration at scale using CloudFormation, AWS CDK, Docker, and KubernetesExcellent written and verbal English communication — you can translate tradeoffs for engineers and stakeholders alikeComfortable working remotely and independentlyModerate-level Azure experience, with comfort operating across multi-cloud and multi-region environmentsExpertise in CI/CD pipelines (AWS CodePipeline, GitLab CI, or similar)Strong proficiency in scripting and automation using Python, Bash, and AnsibleDeep understanding of monitoring and logging with systems and strategies (AWS Cloudwatch, Datadog, Azure Monitor)Hands-on experience with high-availability architectures and auto-scaling strategiesSolid grasp of AWS security best practices: IAM, encryption, Secrets Manager, and security auditingExperience with databases (MySQL, Postgres, Redshift)Familiarity with serverless architectures (AWS Lambda, Fargate)Knowledge of database replication strategiesPreferred Skills and ExperienceExperience in a high-growth, regulated industry (Fintech), specifically in architecting and scaling infrastructure to maintain reliability and compliance under rapid user and transaction volume growthProven technical leadership experience, including improving infrastructure processesAWS Professional-level certifications (Solutions Architect Professional, DevOps Engineer Professional)Knowledge of event-driven architectures (SNS, SQS, EventBridge)Experience in cost optimization strategies for AWS environmentsExperience in compliance frameworks (SOC, NIST, ISO, CCPA, GDPR)Experience integrating AWS services with third-party tools for observability and securityExperience with single-tenant and multi-tenant architectures, as well as client on-premises deployment systemsFamiliarity with scalable, reproducible ML pipelines, tools, and frameworks (Kubeflow, MLflow, Amazon SageMaker)Data engineering experience and large-scale data processing and storageKnowledge of Jira and Confluence, including best practices for ticket management and KPI tracking Background and EEOCCandidly offers for employment are conditioned upon satisfactory completion of our employment screening process (including, but not limited to, a review of past employment and education records, background investigation, and/or credit check & fingerprints).Candidly strives to foster an environment where every employee can succeed. As an Equal Opportunity Employer we do not discriminate on the basis of race, religion, color, sex, sexual orientation, gender identity, gender expression, national origin, age, non-disqualifying physical or mental disability, veteran status, or any other basis covered by applicable law. All employment is decided on the basis of qualifications, merit, and business need.