{"schemaVersion":"jobsearcher.job.v1","id":"f784f2bed6ffd6e7838f0a97","url":"https://jobsearcher.com/jobs/f784f2bed6ffd6e7838f0a97","canonicalUrl":"https://jobsearcher.com/jobs/f784f2bed6ffd6e7838f0a97","title":"Site Reliability Engineer (PostgreSQL)","description":"Site Reliability Engineer - Data Center (Level 3) - PostgreSQL Job Summary We are seeking an experienced Site Reliability Engineer (SRE) to join our Data Center Engineering team at Level 3. This role requires a technically strong and operationally mature engineer who will help design, scale, and maintain the reliability of our physical and virtual data center infrastructure. As a Level 3 SRE, you will be a technical leader responsible for ensuring system uptime, optimizing capacity and performance, and contributing to long-term infrastructure resiliency. Key Responsibilities • Design, implement, and maintain PostgreSQL databases, including schema design, indexing strategies, query optimization, logical/physical replication, hot standby failover, and load balancing. • Develop and execute backup and recovery strategies, including pg_dump, pg_basebackup, WAL archiving, point-in-time recovery (PITR), and disaster recovery planning. • Monitor and optimize database performance, resource utilization, and storage growth using pg_stat_statements, EXPLAIN ANALYZE, pg_top, and Prometheus/Grafana dashboards; proactively troubleshoot performance bottlenecks. • Ensure database security through role-based access control (RBAC), audit logging with pgaudit, and compliance with regulatory standards. • Implement high availability (HA) and disaster recovery (DR) solutions using Patroni, streaming replication, synchronous/asynchronous replication, and failover orchestration. • Plan and execute database version upgrades and apply security or performance patches with minimal downtime, ensuring data integrity and compatibility checks. • Collaborate with application teams, BI developers, and ETL engineers to support data pipelines, optimizing queries, and workflow performance. • Implement monitoring and alerting solutions using Prometheus, Grafana, Zabbix, or Nagios to track database health, query latency, and resource usage. • Manage database user accounts, roles, and privileges to enforce security policies and regulatory compliance, including sudo/OS-level permissions for critical operations. • Conduct capacity planning, workload forecasting, and index/partition tuning to handle anticipated growth and high-concurrency workloads. • Automate database maintenance tasks using Python, Bash, or Ansible scripts, including schema migrations, routine checks, and patch deployment. • Document procedures, configurations, operational runbooks, and PostgreSQL best practices for team knowledge sharing. • Mentor and guide team members on PostgreSQL internals, replication setups, and performance tuning techniques. • Evaluate and recommend new database tools, extensions (like TimescaleDB, pg_stat_statements), and best practices to improve efficiency, scalability, and resilience. Education and Experience • Bachelor's degree in Computer Engineering, Electrical Engineering, Information Technology, or a related technical field. • 4-7 years of experience in database administration and operations. • Experience participating in or leading incident response and postmortem analysis processes. • Previous exposure to hybrid environments integrating on-premise data centers with public or private cloud platforms is desirable. • Experienced PostgreSQL Database Administrator managing production and non-production PostgreSQL environments. • Skilled in backup and recovery, replication, performance tuning, and high availability. • Proven ability to troubleshoot critical issues, automate DBA tasks, and ensure database reliability. Expertise • 4+ years of hands-on PostgreSQL administration experience. • Strong SQL and PL/pgSQL expertise; experience with database optimization and indexing. • Hands-on experience with backup, recovery, and HA solutions. • Strong proficiency in Linux and Debian environments. • Proficiency in scripting for database automation. • Excellent analytical, problem-solving, and troubleshooting skills. • Strong communication skills for cross-team collaboration. • Understanding of Oracle and MySQL databases is a plus, but not mandatory.","company":"Salesforce","rawCompany":"salesforce","city":"Plano","state":"TX","isRemote":false,"isActive":false,"createdAt":"2026-06-18T04:13:18.682Z","occupations":[{"code":"15-1242.00","title":"Database Administrators","slug":"database-administrators"},{"code":"15-1243.00","title":"Database Architects","slug":"database-architects"},{"code":"15-1299.08","title":"Computer Systems Engineers/Architects","slug":"computer-systems-engineers-architects"}],"industries":[{"code":"541512","title":"Computer Systems Design Services","slug":"computer-systems-design-services"},{"code":"541519","title":"Other Computer Related Services","slug":"other-computer-related-services"},{"code":"518210","title":"Computing Infrastructure Providers, Data Processing, Web Hosting, and Related Services","slug":"computing-infrastructure-providers-data-processing-web-hosting-and-related-services"}],"jobPosting":{"@context":"https://schema.org","@type":"JobPosting","title":"Site Reliability Engineer (PostgreSQL)","description":"Site Reliability Engineer - Data Center (Level 3) - PostgreSQL Job Summary We are seeking an experienced Site Reliability Engineer (SRE) to join our Data Center Engineering team at Level 3. This role requires a technically strong and operationally mature engineer who will help design, scale, and maintain the reliability of our physical and virtual data center infrastructure. As a Level 3 SRE, you will be a technical leader responsible for ensuring system uptime, optimizing capacity and performance, and contributing to long-term infrastructure resiliency. Key Responsibilities • Design, implement, and maintain PostgreSQL databases, including schema design, indexing strategies, query optimization, logical/physical replication, hot standby failover, and load balancing. • Develop and execute backup and recovery strategies, including pg_dump, pg_basebackup, WAL archiving, point-in-time recovery (PITR), and disaster recovery planning. • Monitor and optimize database performance, resource utilization, and storage growth using pg_stat_statements, EXPLAIN ANALYZE, pg_top, and Prometheus/Grafana dashboards; proactively troubleshoot performance bottlenecks. • Ensure database security through role-based access control (RBAC), audit logging with pgaudit, and compliance with regulatory standards. • Implement high availability (HA) and disaster recovery (DR) solutions using Patroni, streaming replication, synchronous/asynchronous replication, and failover orchestration. • Plan and execute database version upgrades and apply security or performance patches with minimal downtime, ensuring data integrity and compatibility checks. • Collaborate with application teams, BI developers, and ETL engineers to support data pipelines, optimizing queries, and workflow performance. • Implement monitoring and alerting solutions using Prometheus, Grafana, Zabbix, or Nagios to track database health, query latency, and resource usage. • Manage database user accounts, roles, and privileges to enforce security policies and regulatory compliance, including sudo/OS-level permissions for critical operations. • Conduct capacity planning, workload forecasting, and index/partition tuning to handle anticipated growth and high-concurrency workloads. • Automate database maintenance tasks using Python, Bash, or Ansible scripts, including schema migrations, routine checks, and patch deployment. • Document procedures, configurations, operational runbooks, and PostgreSQL best practices for team knowledge sharing. • Mentor and guide team members on PostgreSQL internals, replication setups, and performance tuning techniques. • Evaluate and recommend new database tools, extensions (like TimescaleDB, pg_stat_statements), and best practices to improve efficiency, scalability, and resilience. Education and Experience • Bachelor's degree in Computer Engineering, Electrical Engineering, Information Technology, or a related technical field. • 4-7 years of experience in database administration and operations. • Experience participating in or leading incident response and postmortem analysis processes. • Previous exposure to hybrid environments integrating on-premise data centers with public or private cloud platforms is desirable. • Experienced PostgreSQL Database Administrator managing production and non-production PostgreSQL environments. • Skilled in backup and recovery, replication, performance tuning, and high availability. • Proven ability to troubleshoot critical issues, automate DBA tasks, and ensure database reliability. Expertise • 4+ years of hands-on PostgreSQL administration experience. • Strong SQL and PL/pgSQL expertise; experience with database optimization and indexing. • Hands-on experience with backup, recovery, and HA solutions. • Strong proficiency in Linux and Debian environments. • Proficiency in scripting for database automation. • Excellent analytical, problem-solving, and troubleshooting skills. • Strong communication skills for cross-team collaboration. • Understanding of Oracle and MySQL databases is a plus, but not mandatory.","datePosted":"2026-06-18T04:13:18.682Z","dateModified":"2026-06-18T04:13:18.682Z","hiringOrganization":{"@type":"Organization","name":"Salesforce","sameAs":"https://jobsearcher.com"},"jobLocation":{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Plano","addressRegion":"TX","addressCountry":"US"}},"identifier":{"@type":"PropertyValue","name":"JobSearcher","value":"f784f2bed6ffd6e7838f0a97"},"url":"https://jobsearcher.com/jobs/f784f2bed6ffd6e7838f0a97"}}