The Wikimedia Foundation is the nonprofit organization that hosts and operates Wikipedia and the other Wikimedia free knowledge projects. Our vision is a world in which every single human can freely share in the sum of all knowledge. We believe that everyone has the potential to contribute something to our shared knowledge, and that everyone should be able to access that knowledge, free of interference.
Read more about this company
As a Senior Site Reliability Engineer for databases at the Wikimedia Foundation, you will be part of a small, focused team of skilled and experienced engineers. In this role, you will be responsible for ensuring the health of our database systems - including their availability and performance.
Your responsibilities will include troubleshooting issues, planning for disaster recovery, and enhancing and maintaining backups.
You do not have to be a database expert but must be willing to be trained to be one.
The work we do is crucial and is used by hundreds of millions of people. This is a unique opportunity to have a huge impact.
Responsibilities
Implementation, maintenance and troubleshooting of relational database systems in production and staging environments
Database performance tuning, high availability, replication, backups, and general optimization
Supporting the development and deployment of new services and systems
Handling configuration management, (Debian) package maintenance, patching and building, working with upstream on bug identification and resolution
Improving observability (alerting, metrics, monitoring) of database infrastructure
Multi-datacenter design, capacity and infrastructure planning
Taking part in incident response, diagnosis and follow-up on system outages or alerts across Wikimedia’s production infrastructure and participating in an on call rotation
Sharing our values and work in accordance with them
Qualifications
5+ years experience in an DBA/SRE/Operations/DevOps role as part of a team
Experience with Open Source configuration management and orchestration tools (Puppet, Ansible, Chef, SaltStack, etc.), as well as modern observability infrastructure (Prometheus, Grafana, Graphite, Logstash/Kibana, Icinga/Nagios, etc.)
Advanced knowledge of Linux and IO/data storage concepts, internals and troubleshooting
Experience with managing remotely both bare-metal servers and virtualized environments
Proficient at automation/programming/scripting skills
Experience with high traffic and highly available website architectures and operations
Strong English language skills
Ability to work independently in a fast paced environment, as an effective part of a globally distributed team, including ticket tracking systems and asynchronous communication tools
B.S. or M.S. in Computer Science or equivalent work experience
Optional qualifications
Advanced level of experience with MariaDB or MySQL database administration and replication topologies at scale
Proficiency in SQL
Solid knowledge of relational database concepts and working experience with storage systems and architecturesExperience with LAMP stack technologies (PHP/HHVM, memcached/Redis) - MediaWiki experience is a definite plus
Experience with advanced distributed storage and database systems (Swift, Ceph, Cassandra, etc.) or graph databases (Titan, Blazegraph, etc.) is a big plus
Experience in architecture, design, and implementation of persistent data storage & query infrastructure
Strong track record of open source contributions is a major plus
MyJobMag Career Kickstart Scholarship 2026: Training Report & HighlightsFollowing the resounding success of the pilot programme, the MyJobMag Career Kickstart Scholarship 2025, the second edition was launched in 2026 to expand impact and deepen outcomes. Here's everything you need to know about how the training went.
AI's Impact on Jobs and Organisations (Nigeria report)This report examines the extent to which AI is affecting jobs and organisations in Nigeria. It brings together perspectives from HR professionals and managers across different industries.
30 Contract Staffing Risks That Could Get Your Company SuedThis piece outlines 30 contract staffing risks that have real legal consequences under Nigerian law. If you are a business owner, HR professional, or staffing agency operator, you will find this highly valuable.
10 Steps to Building an Effective Talent PipelineLearn how to keep a list of good candidates ready in advance, before a role becomes vacant. Discover step by step the process of building a talent pipeline that works.